Altuncu, Enes, Başkent, Can, Bhattacherjee, Sanjay, Li, Shujun, Roy, Dwaipayan (2025) FACTors: A New Dataset for Studying the Fact-checking Ecosystem. In: Proceedings of the 2025 48th International ACM SIGIR Conference on Research and Development in Information Retrieval. . ACM (In press) (doi:10.1145/3726302.3730339) (KAR id:109902)
PDF
Author's Accepted Manuscript
Language: English
This work is licensed under a Creative Commons Attribution 4.0 International License.
|
|
Download this file (PDF/660kB) |
Preview |
Request a format suitable for use with assistive technology e.g. a screenreader | |
Official URL: https://doi.org/10.1145/3726302.3730339 |
Abstract
Our fight against false information is spearheaded by fact-checkers. They investigate the veracity of claims and document their findings as fact-checking reports. With the rapid increase in the amount of false information circulating online, the use of automation in factchecking processes aims to strengthen this ecosystem by enhancing scalability. Datasets containing fact-checked claims play a key role in developing such automated solutions. However, to the best of our knowledge, there is no fact-checking dataset at the ecosystem level, covering claims from a sufficiently long period of time and sourced from a wide range of actors reflecting the entire ecosystem that admittedly follows widely-accepted codes and principles of fact-checking.
We present a new dataset FACTors, the first to fill this gap by presenting ecosystem-level data on fact-checking. It contains 118,112 claims from 117,993 fact-checking reports in English (co-)authored by 1,953 individuals and published during the period of 1995-2025 by 39 fact-checking organisations that are active signatories of the IFCN (International Fact-Checking Network) and/or EFCSN (European Fact-Checking Standards Network). It contains 7,327 overlapping claims investigated by multiple fact-checking organisations, corresponding to 2,977 unique claims. It allows to conduct new ecosystem-level studies of the fact-checkers (organisations and individuals).
To demonstrate the usefulness of our dataset, we present three example applications. They include a first-of-its-kind statistical analysis of the fact-checking ecosystem, examining the political inclinations of the fact-checking organisations, and attempting to assign a credibility score to each organisation based on the findings of the statistical analysis and political leanings. Our methods for constructing FACTors are generic and can be used to maintain a live dataset that can be updated dynamically.
Item Type: | Conference or workshop item (Paper) |
---|---|
DOI/Identification number: | 10.1145/3726302.3730339 |
Uncontrolled keywords: | false information; misinformation; disinformation; fact-checking; dataset; resources; ecosystem |
Subjects: |
Q Science > QA Mathematics (inc Computing science) Q Science > QA Mathematics (inc Computing science) > QA 76 Software, computer programming, > QA76.76 Computer software > QA76.76.I59 Interactive media, hypermedia T Technology > TK Electrical engineering. Electronics. Nuclear engineering > TK5101 Telecommunications > TK5105.888 World Wide Web T Technology > TK Electrical engineering. Electronics. Nuclear engineering > TK7800 Electronics > TK7880 Applications of electronics > TK7885 Computer engineering. Computer hardware |
Institutional Unit: |
Schools > School of Computing Institutes > Institute of Cyber Security for Society |
Former Institutional Unit: |
Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Computing University-wide institutes > Institute of Cyber Security for Society
|
Depositing User: | Shujun Li |
Date Deposited: | 15 May 2025 20:18 UTC |
Last Modified: | 20 May 2025 10:29 UTC |
Resource URI: | https://kar.kent.ac.uk/id/eprint/109902 (The current URI for this page, for reference purposes) |
- Link to SensusAccess
- Export to:
- RefWorks
- EPrints3 XML
- BibTeX
- CSV
- Depositors only (login required):