Summary information

Study title

Online Hostility Towards UK MPs, 2019-2022

Creator

Pandya, M, University of Sheffield

Bontcheva, K, University of Sheffield

Maynard, D, University of Sheffield

Study number / PID

857099 (UKDA)

10.5255/UKDA-SN-857099 (DOI)

Data access

Restricted

Series

Not available

Abstract

This is a dataset with tweets from X. Each tweet mentions one or more UK MPs from a subset selected for our study to give a diverse representation of political leanings. Each tweet is labelled for hostility and the identity characteristic it targets (religion, race, gender). Each annotator also provides a confidence score for each label. Three annotators annotate each tweet. Annotators are UK-based students from Computer Science and Politics.Toxic and abusive language threaten the integrity of public dialogue and democracy. Abusive language, such as taunts, slurs, racism, extremism, crudeness, provocation and disguise are generally considered offensive and insulting, has been linked to political polarisation and citizen apathy; the rise of terrorism and radicalisation; and cyberbullying. In response, governments worldwide have enacted strong laws against abusive language that leads to hatred, violence and criminal offences against a particular group. This includes legal obligations to moderate (i.e., detection, evaluation, and potential removal or deletion) online material containing hateful or illegal language in a timely manner; and social media companies have adopted even more stringent regulations in their terms of use. The last few years, however, have seen a significant surge in such abusive online behaviour, leaving governments, social media platforms, and individuals struggling to deal with the consequences. The responsible (i.e. effective, fair and unbiased) moderation of abusive language carries significant practical, cultural, and legal challenges. While current legislation and public outrage demand a swift response, we do not yet have effective human or technical processes that can address this need. The widespread deployment of human content moderators is costly and inadequate on many levels: the nature of the work is psychologically challenging, and significant efforts lag behind the deluge of data posted every second. At the same time,...

Media, communications and languagePoliticsScience and technologySociety and culture

SOCIAL MEDIAHOSTILITYCOMPUTER SCIENCE2024

Data collection period

01/02/2020 - 31/01/2024

Country

United Kingdom

Time dimension

Not available

Analysis unit

Text unit

Universe

Not available

Sampling procedure

Not available

Kind of data

Text

Data collection mode

We collected data from X using the Twitter API v1.1. The collector collected tweets based onUK MPs' user accounts (X handles). Four types of tweets were collected - tweets by the MPs, replies to their tweets, retweets by the MPs, and retweets of their tweets.

Grant number

ES/T012714/1

Publisher

UK Data Service

Publication year

2024

Terms of data access

The Data Collection is available from an external repository. Access is available via Related Resources.

Not available

Study title

Creator

Study number / PID

Data access

Series

Abstract

Topics

Keywords

Methodology

Data collection period

Country

Time dimension

Analysis unit

Universe

Sampling procedure

Kind of data

Data collection mode

Funding information

Grant number

Access

Publisher

Publication year

Terms of data access

Related publications