Summary information

Study title

Geotagged Twitter posts from the United States: A tweet collection to investigate representativeness

Creator

Pfeffer, Jürgen (Carnegie Mellon University)
Morstatter, Fred (Arizona State University)

Study number / PID

10.7802/1166 (GESIS)

10.7802/1166 (DOI)

Data access

Information not available

Series

Not available

Abstract

This dataset consists of IDs of geotagged Twitter posts from within the United States. They are provided as files per day and state as well as per day and county. In addition, files containing the aggregated number of hashtags from these tweets are provided per day and state and per day and county. This data is organized as a ZIP-file per month containing several zip-files per day which hold the txt-files with the ID/hash information. Also part of the dataset are two shapefiles for the US counties and states and Python scripts for the data collection and sorting geotags into counties.

Topics

Not available

Keywords

Not available

Methodology

Data collection period

01/06/2014 - 30/11/2015

Country

United States

Time dimension

Not available

Analysis unit

Not available

Universe

Geotagged tweets from the US

Sampling procedure

No user sampling. Selection of Tweets with geo-location within bounding box for the United States (-128.6, 24.5), (-59, 50). Sampling by Twitter API unknown.

Kind of data

Not available

Data collection mode

Recording

Access

Publisher

GESIS Data Archive for the Social Sciences

Publication year

2016

Terms of data access

Restricted Access - To get access to the research data, the original data depositor's consent is needed. Access to this dataset will be granted only for scientific purposes upon request. Attribution is required. Redistribution is not allowed.

Related publications

Not available