The catalogue contains study descriptions in various languages. The system searches with your search terms from study descriptions available in the language you have selected. The catalogue does not have ‘All languages’ option as due to linguistic differences this would give incomplete results. See the User Guide for more detailed information.
ClaimsKG - A Knowledge Graph of Fact-Checked Claims (January, 2023)
Creator
Gangopadhyay, Susmita ( GESIS - Leibniz-Institut für Sozialwissenschaften)
Schellhammer, Sebastian ( GESIS - Leibniz-Institut für Sozialwissenschaften)
Boland, Katarina ( GESIS - Leibniz-Institut für Sozialwissenschaften)
Schüller, Sascha ( GESIS - Leibniz-Institut für Sozialwissenschaften)
Todorov, Konstantin ( LIRMM / University of Montpellier)
Tchechmedjiev, Andon ( LGI2P / IMT Mines Ales / University of Montpellier)
Zapilko, Benjamin ( GESIS - Leibniz-Institut für Sozialwissenschaften)
Fafalios, Pavlos ( Institute of Computer Science, FORTH-ICS)
Jabeen, Hajira ( GESIS - Leibniz-Institut für Sozialwissenschaften)
Dietze, Stefan ( GESIS - Leibniz-Institut für Sozialwissenschaften & Heinrich-Heine-University Düsseldorf)
Study number / PID
10.7802/2620 (GESIS)
10.7802/2620 (DOI)
Data access
Informationen nicht verfügbar
Series
Nicht verfügbar
Abstract
ClaimsKG is a knowledge graph of metadata information for fact-checked claims scraped from popular fact-checking sites. In addition to providing a single dataset of claims and associated metadata, truth ratings are harmonized and additional information is provided for each claim, e.g., about mentioned entities. Please see ( https://data.gesis.org/claimskg/ ) for further details about the data model, query examples and statistics.
The dataset facilitates structured queries about claims, their truth values, involved entities, authors, dates, and other kinds of metadata. ClaimsKG is generated through a (semi-)automated pipeline, which harvests claim-related data from popular fact-checking web sites, annotates them with related entities from DBpedia/Wikipedia, and lifts all data to RDF using established vocabularies (such as schema.org).
The latest release of ClaimsKG covers 74066 claims and 72127 Claim Reviews. This is the fourth release of the dataset where data was scraped till Jan 31, 2023 containing claims published between 1996 and 2023 from 13 fact-checking websites. The websites are Fullfact, Politifact, TruthOrFiction, Checkyourfact, Vishvanews, AFP (French), AFP, Polygraph, EU factcheck, Factograph, Fatabyyano, Snopes and Africacheck. The claim-review (fact-checking) period for claims ranges between the year 1996 to 2023. Similar to the previous release, the Entity fishing python client ( https://github.com/hirmeos/entity-fishing-client-python ) has been used for entity linking and disambiguation in this release. Improvements have been made in the web scraping and data preprocessing pipeline to extract more entities from both claims and claims reviews. Currently, ClaimsKG contains 3408386 entities detected and referenced with DBpedia.
This latest release of ClaimsKG supersedes the previous versions as it contained all the claims from the previous versions together in addition to the additional new claims as well as improved entity annotation resulting...
Terminology used is generally based on DDI controlled vocabularies: Time Method, Analysis Unit, Sampling Procedure and Mode of Collection, available at CESSDA Vocabulary Service.
Methodology
Data collection period
01/01/2023
Country
Time dimension
Nicht verfügbar
Analysis unit
Nicht verfügbar
Universe
Nicht verfügbar
Sampling procedure
Total Universe / Complete enumeration
Kind of data
Nicht verfügbar
Data collection mode
Web scraping
Access
Publisher
GESIS Datenarchiv für Sozialwissenschaften
Publication year
2023
Terms of data access
Freier Zugang (ohne Registrierung) - Die Forschungsdaten können von jedem direkt heruntergeladen werden.
CC BY-SA 4.0: Attribution – ShareAlike (https://creativecommons.org/licenses/by-sa/4.0/deed.de)