Summary information

Study title

Sarcastic Soulmates: Intimacy and irony markers in social media messaging

Creator

FA Kunneman (Radboud University)

Study number / PID

doi:10.17026/dans-24j-68qr (DOI)

easy-dataset:65746 (DANS-KNAW)

Data access

Information not available

Series

Not available

Abstract

We research the use of sarcasm on Twitter, and show that a computer has more difficulty to detect sarcasm shared among peers than sarcasm shared with any interested audience. This data set features the data used for training machine learning classifiers, and annotations of the output.

- Usercategory (and User) indicates whether the element is a feature of the classifier on the basis of USER-tweets or NOUSER-tweets.
- Featurerankwithincategory indicates the importance of the feature for the respecting classifier.
- Frequencyelementwithinusertweets indicates how often the feature was observed in the complete set with USER-tweets.
- Frequencyelementwithinnouser indicates how often the feature was observed in the complete set with NOUSER-tweets.
- Totalamountofmarkers is the sum of irony markers (Hyperbole, Interjections, Repetition, Hashtag, Capitals, Punctuation Marks and Emoticons).
- The values of each marker indicate the presence of the marker, a 1 indicates the presence and a 0 indicates the absence. The marker ‘polarity’ forms an exception. In this case a value of 0 indicates no evaluation, 1 indicates a negative polarity of the evaluation and 2 indicates a positive polarity of the evaluation.

Radboud University supplied the 'top_features_annotations' file in .xlsx format. For preservation purposes, DANS added the .csv format.

Not available

Arts and HumanitiesArts and HumanitiesTwitterVerbal IronyEvaluative markers

Data collection period

Not available

Country

Time dimension

Not available

Analysis unit

Not available

Universe

Not available

Sampling procedure

Not available

Kind of data

Not available

Data collection mode

Not available

Publisher

DANS Data Station Social Sciences and Humanities

Publication year

2016

Terms of data access

Not available

Study title

Creator

Study number / PID

Data access

Series

Abstract

Topics

Keywords

Methodology

Data collection period

Country

Time dimension

Analysis unit

Universe

Sampling procedure

Kind of data

Data collection mode

Access

Publisher

Publication year

Terms of data access

Related publications