Study title
Sarcastic Soulmates: Intimacy and irony markers in social media messaging
Creator
Study number / PID
doi:10.17026/dans-24j-68qr (DOI)
easy-dataset:65746 (DANS-KNAW)
Data access
Information not available
Series
Abstract
We research the use of sarcasm on Twitter, and show that a computer has more difficulty to detect sarcasm shared among peers than sarcasm shared with any interested audience. This data set features the data used for training machine learning classifiers, and annotations of the output.
- Usercategory (and User) indicates whether the element is a feature of the classifier on the basis of USER-tweets or NOUSER-tweets.
- Featurerankwithincategory indicates the importance of the feature for the respecting classifier.
- Frequencyelementwithinusertweets indicates how often the feature was observed in the complete set with USER-tweets.
- Frequencyelementwithinnouser indicates how often the feature was observed in the complete set with NOUSER-tweets.
- Totalamountofmarkers is the sum of irony markers (Hyperbole, Interjections, Repetition, Hashtag, Capitals, Punctuation Marks and Emoticons).
- The values of each marker indicate the presence of the marker, a 1 indicates the presence and a 0 indicates the absence. The marker ‘polarity’ forms an exception. In this case a value of 0 indicates no evaluation, 1 indicates a negative polarity of the evaluation and 2 indicates a positive polarity of the evaluation.
Radboud University supplied the 'top_features_annotations' file in .xlsx format. For preservation purposes, DANS added the .csv format.
Topics
Keywords
Methodology
Data collection period
Not availableCountry
Time dimension
Not availableAnalysis unit
Not availableUniverse
Not availableSampling procedure
Not availableKind of data
Not availableData collection mode
Not availableAccess
Publisher
DANS Data Station Social Sciences and Humanities
Publication year
2016