Study title
TweetsKB: A Public and Large-Scale RDF Corpus of Annotated Tweets (Part 12, Sep 2022 - Jun 2023)
Creator
Schellhammer, Sebastian ( GESIS - Leibniz-Institut für Sozialwissenschaften)
Baran, Erdal ( GESIS - Leibniz-Institut für Sozialwissenschaften)
Bensmann, Felix ( GESIS - Leibniz-Institut für Sozialwissenschaften)
Dimitrov, Dr. Dimitar ( GESIS - Leibniz-Institut für Sozialwissenschaften)
Dietze, Stefan ( GESIS - Leibniz-Institut für Sozialwissenschaften & Heinrich-Heine-University Düsseldorf, Germany & L3S Research Center, Hannover, Germany)
Zhang, Yudong ( GESIS - Leibniz-Institut für Sozialwissenschaften)
Data access
Informationen nicht verfügbar
Abstract
TweetsKB is a public RDF corpus of anonymized data for a large collection of annotated tweets. The dataset currently contains data for more than 3.1 billion tweets, spanning more than 10 years (February 2013 - June 2023). Metadata information about the tweets as well as extracted entities, sentiments, hashtags, user mentions and URLs are exposed in RDF using established RDF/S vocabularies. For the sake of privacy, we anonymize user IDs and we do not provide the text of the tweets. For a list of the previous dataset parts, example queries and more information see the TweetsKB's home page: https://data.gesis.org/tweetskb/ .