A Benchmark Dataset of Check-worthy Factual Claims

Fatma Arslan, Naeemul Hassan, Chengkai Li, Mark Tremayne

In this paper we present the ClaimBuster dataset of 23,533 statements extracted from all U.S. general election presidential debates and annotated by human coders. The ClaimBuster dataset can be leveraged in building computational methods to identify claims that are worth fact-checking from the myriad of sources of digital or traditional media. The ClaimBuster dataset is publicly available to the research community, and it can be found at http://doi.org/10.5281/zenodo.3609356.

Knowledge Graph



Sign up or login to leave a comment