Summary information

Study title

After Woolwich twitter corpus

Creator

Innes, M, Cardiff University

Study number / PID

852078 (UKDA)

10.5255/UKDA-SN-852078 (DOI)

Data access

Open

Series

Not available

Abstract

After Woolwich Twitter Corpus represents social media data collected from Twitter to analyse social reactions to the murder of Drummer Lee Rigby in Woolwich on 22 May 2013. The dataset covers a roughly 12 month span from March 29th 2013 onwards. The data enabled the tracking of the evolution of public perceptions and sentiments in real-time as key events occur. The dataset comprises of a csv format file with Tweet IDs and Date for all collected tweets. All other relevant tweet data have been omitted to comply with the Twitter API Terms of use. In order to recreate the data, utilise the Twitter API to request each tweet by ID.

The research will analyse social reactions to the murder of Drummer Lee Rigby in Woolwich on 22 May 2013 using social media data collected from Twitter, blogs and other sources. Such data uniquely enable the tracking of the evolution of public perceptions and sentiments in real-time as key events occur. They enable us to track the arc of social reactions from the crime scene through to the conclusion of the court case, to understand how public opinion and sentiment is shaped and shifts as events unfold. The work will produce new insights into the social dynamics of collective responses to high profile violent crimes, alongside methodological innovations developing text-mining methods for rigorous social scientific analyses of social media. Using a case study design applying qualitative, quantitative and geo-spatial data analysis techniques, the project will illuminate the signal event, conflict escalation and de-escalation, influence and resilience dynamics that arise in the aftermath of a major crime.

Methodology

Data collection period

15/02/2014 - 14/08/2015

Country

United Kingdom

Time dimension

Not available

Analysis unit

Text unit
Time unit

Universe

Not available

Sampling procedure

Not available

Kind of data

Text

Data collection mode

Data was collected from Twitters streaming and search APIs using the Sentinel software stack.Sentinel programatically talks to the Twitter streaming and search APIs, which sends data back to Sentinel. This data, after passing through the Sentinel pipeline is then stored in a Mongo database.Data is requested from Twitter by location and by keyword.

Funding information

Grant number

ES/L008181/1

Access

Publisher

UK Data Service

Publication year

2016

Terms of data access

Not available

Related publications

Not available