Summary information

Study title

Online Hostility Towards UK MPs, 2019-2022

Creator

Pandya, M, University of Sheffield
Bontcheva, K, University of Sheffield
Maynard, D, University of Sheffield

Study number / PID

857099 (UKDA)

10.5255/UKDA-SN-857099 (DOI)

Data access

Restricted

Series

Not available

Abstract

This is a dataset with tweets from X. Each tweet mentions one or more UK MPs from a subset selected for our study to give a diverse representation of political leanings. Each tweet is labelled for hostility and the identity characteristic it targets (religion, race, gender). Each annotator also provides a confidence score for each label. Three annotators annotate each tweet. Annotators are UK-based students from Computer Science and Politics.Toxic and abusive language threaten the integrity of public dialogue and democracy. Abusive language, such as taunts, slurs, racism, extremism, crudeness, provocation and disguise are generally considered offensive and insulting, has been linked to political polarisation and citizen apathy; the rise of terrorism and radicalisation; and cyberbullying. In response, governments worldwide have enacted strong laws against abusive language that leads to hatred, violence and criminal offences against a particular group. This includes legal obligations to moderate (i.e., detection, evaluation, and potential removal or deletion) online material containing hateful or illegal language in a timely manner; and social media companies have adopted even more stringent regulations in their terms of use. The last few years, however, have seen a significant surge in such abusive online behaviour, leaving governments, social media platforms, and individuals struggling to deal with the consequences. The responsible (i.e. effective, fair and unbiased) moderation of abusive language carries significant practical, cultural, and legal challenges. While current legislation and public outrage demand a swift response, we do not yet have effective human or technical processes that can address this need. The widespread deployment of human content moderators is costly and inadequate on many levels: the nature of the work is psychologically challenging, and significant efforts lag behind the deluge of data posted every second. At the same time,...
Read more

Methodology

Data collection period

01/02/2020 - 31/01/2024

Country

United Kingdom

Time dimension

Not available

Analysis unit

Text unit

Universe

Not available

Sampling procedure

Not available

Kind of data

Text

Data collection mode

We collected data from X using the Twitter API v1.1. The collector collected tweets based onUK MPs' user accounts (X handles). Four types of tweets were collected - tweets by the MPs, replies to their tweets, retweets by the MPs, and retweets of their tweets.

Funding information

Grant number

ES/T012714/1

Access

Publisher

UK Data Service

Publication year

2024

Terms of data access

The Data Collection is available from an external repository. Access is available via Related Resources.

Related publications

Not available