Summary information

Study title

After Woolwich twitter corpus

Creator

Innes, M, Cardiff University

Study number / PID

852078 (UKDA)

10.5255/UKDA-SN-852078 (DOI)

Data access

Open

Series

Not available

Abstract

After Woolwich Twitter Corpus represents social media data collected from Twitter to analyse social reactions to the murder of Drummer Lee Rigby in Woolwich on 22 May 2013. The dataset covers a roughly 12 month span from March 29th 2013 onwards. The data enabled the tracking of the evolution of public perceptions and sentiments in real-time as key events occur. The dataset comprises of a csv format file with Tweet IDs and Date for all collected tweets. All other relevant tweet data have been omitted to comply with the Twitter API Terms of use. In order to recreate the data, utilise the Twitter API to request each tweet by ID.

The research will analyse social reactions to the murder of Drummer Lee Rigby in Woolwich on 22 May 2013 using social media data collected from Twitter, blogs and other sources. Such data uniquely enable the tracking of the evolution of public perceptions and sentiments in real-time as key events occur. They enable us to track the arc of social reactions from the crime scene through to the conclusion of the court case, to understand how public opinion and sentiment is shaped and shifts as events unfold. The work will produce new insights into the social dynamics of collective responses to high profile violent crimes, alongside methodological innovations developing text-mining methods for rigorous social scientific analyses of social media. Using a case study design applying qualitative, quantitative and geo-spatial data analysis techniques, the project will illuminate the signal event, conflict escalation and de-escalation, influence and resilience dynamics that arise in the aftermath of a major crime.

Society and culture

WOOLWICHLEE RIGBYTWITTERPUBLIC DISCUSSION2016

Data collection period

15/02/2014 - 14/08/2015

Country

United Kingdom

Time dimension

Not available

Analysis unit

Text unit

Time unit

Universe

Not available

Sampling procedure

Not available

Kind of data

Text

Data collection mode

Data was collected from Twitters streaming and search APIs using the Sentinel software stack.Sentinel programatically talks to the Twitter streaming and search APIs, which sends data back to Sentinel. This data, after passing through the Sentinel pipeline is then stored in a Mongo database.Data is requested from Twitter by location and by keyword.

Grant number

ES/L008181/1

Publisher

UK Data Service

Publication year

2016

Terms of data access

Not available

Study title

Creator

Study number / PID

Data access

Series

Abstract

Topics

Keywords

Methodology

Data collection period

Country

Time dimension

Analysis unit

Universe

Sampling procedure

Kind of data

Data collection mode

Funding information

Grant number

Access

Publisher

Publication year

Terms of data access

Related publications