Study title
An archive and corpus of Twitter/X's policies for Tweet redistribution 2006-2023
Creator
Golland, Luisa ( GESIS - Leibniz-Institute for the Social Sciences)
Recker, Jonas ( GESIS - Leibniz-Institute for the Social Sciences)
Schwalbach, Jan ( GESIS - Leibniz-Institute for the Social Sciences)
Data access
Informationen nicht verfügbar
Abstract
When researchers publish results based on the analysis of Tweets, good practice requires sharing the Tweets (or Tweet IDs) to enable reproducibility of the results. What may or may not be shared depends on the Twitter/X terms for the redistribution of the platform’s content. As a complete archive of these terms is currently not easily accessible, we used The Internet Archive's Wayback Machine to gather all documents from Twitter/X that detail the terms and conditions for content redistribution, in effect between 2006 and November 2023.
Based on the relevant conditions for the redistribution of Twitter/X content to third parties a "restriction_score" was assigned to each version of the terms/policies, to express how restrictive the regulations for the redistribution of content are. In addition, to facilitate analyses of the respective documents, a corpus has been created.
The included archive of .html files allows researchers who collected Twitter/X content in the past, and who are now considering sharing this content, to determine the regulation in effect at the time of Tweet collection.
Python scripts and a list of URLs are provided, which can be used to replicate the creation of the .html files published here.
An R script is provided to create the corpus from the .html files.