Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Sentiment of Emojis (1509.07761v2)

Published 25 Sep 2015 in cs.CL
Sentiment of Emojis

Abstract: There is a new generation of emoticons, called emojis, that is increasingly being used in mobile communications and social media. In the past two years, over ten billion emojis were used on Twitter. Emojis are Unicode graphic symbols, used as a shorthand to express concepts and ideas. In contrast to the small number of well-known emoticons that carry clear emotional contents, there are hundreds of emojis. But what are their emotional contents? We provide the first emoji sentiment lexicon, called the Emoji Sentiment Ranking, and draw a sentiment map of the 751 most frequently used emojis. The sentiment of the emojis is computed from the sentiment of the tweets in which they occur. We engaged 83 human annotators to label over 1.6 million tweets in 13 European languages by the sentiment polarity (negative, neutral, or positive). About 4% of the annotated tweets contain emojis. The sentiment analysis of the emojis allows us to draw several interesting conclusions. It turns out that most of the emojis are positive, especially the most popular ones. The sentiment distribution of the tweets with and without emojis is significantly different. The inter-annotator agreement on the tweets with emojis is higher. Emojis tend to occur at the end of the tweets, and their sentiment polarity increases with the distance. We observe no significant differences in the emoji rankings between the 13 languages and the Emoji Sentiment Ranking. Consequently, we propose our Emoji Sentiment Ranking as a European language-independent resource for automated sentiment analysis. Finally, the paper provides a formalization of sentiment and a novel visualization in the form of a sentiment bar.

Sentiment of Emojis

Overview

The paper by Petra Kralj Novak, Jasmina Smailović, Borut Sluban, and Igor Mozetič presents "Sentiment of Emojis," an in-depth paper aimed at deciphering the emotional content of emojis used in social media, particularly on Twitter. The main contribution of this research is the creation of the Emoji Sentiment Ranking, a sentiment lexicon comprising 751 frequently used emojis. This lexicon enables a nuanced understanding of how these graphical symbols convey emotions within text, extending the scope of traditional sentiment analysis methodologies that primarily rely on textual cues.

Key Contributions

  1. Emoji Sentiment Lexicon: The Emoji Sentiment Ranking is the core artifact of the paper. It was constructed from a dataset of over 1.6 million tweets in 13 European languages, manually annotated by 83 native speakers. This cross-linguistic analysis found that 4% of these annotated tweets contained emojis. Sentiment scores for each emoji were derived based on their contextual usage in tweets, specifically using the sentiment labels of the containing tweets.
  2. Sentiment Distribution:: The paper reveals that most emojis are predominantly positive. This was confirmed through a statistically significant difference: tweets containing emojis averaged a sentiment score of +0.365 compared to +0.106 for tweets without emojis. Furthermore, the position of emojis in tweets tends to influence their perceived sentiment, with emojis occurring more frequently at a tweet’s end showing higher sentiment scores.
  3. Inter-Annotator Agreement: The presence of emojis enhances the inter-annotator agreement, suggesting that emojis provide a clearer emotional cue than the surrounding text alone. Three measures were applied: Krippendorff’s Alpha-reliability, Accuracy, and F1-Score for positive and negative sentiments. All measures demonstrated higher agreement levels for tweets with emojis.
  4. Cross-Language Uniformity: The Emoji Sentiment Ranking demonstrates robustness across the 13 languages studied. High correlation coefficients between the sentiment scores in specific languages and the overall Emoji Sentiment Ranking suggest that the ranking holds universally for European languages, making it a valuable resource for multilingual sentiment analysis tasks.
  5. Emoji Usage Analysis: Most frequently used emojis are more emotionally charged compared to less frequently used ones. The standard deviation in sentiment scores among high-frequency emojis is smaller, implying consistent positive sentiment.

Practical and Theoretical Implications

Practical Implications

The Emoji Sentiment Ranking serves as a vital resource for automated sentiment analysis systems. Given its language-neutral nature within European contexts, it simplifies the integration of emoji sentiment data into existing NLP pipelines, enhancing their accuracy and emotional nuance. Additionally, sentiment analysis systems can leverage this lexicon to improve human-computer interaction, customer feedback interpretations, and social media monitoring tools.

Theoretical Implications

The formalization of sentiment properties for emojis opens new research avenues in emotional AI. It supports the hypothesis that non-textual elements like emojis have a profound impact on the sentiment conveyed in social media. This work encourages further exploration into how non-verbal cues contribute to computational sentiment analysis.

Future Work

Future research could explore extending the Emoji Sentiment Ranking to non-European languages and integrating new emojis as they emerge. Moreover, investigating the temporal evolution of emoji sentiment, as new cultural phenomena and meme dynamics evolve, could provide deeper insights into the digital communication landscape. Another promising direction is to analyze the combined effects of emojis and textual sentiment features to develop more sophisticated emotion-detection models.

In conclusion, the construction of the Emoji Sentiment Ranking and the derived findings underscore the critical role of emojis in digital communication. These insights not only enhance current sentiment analysis methodologies but also lay the groundwork for future advancements in understanding and interpreting emotive content in social media.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Petra Kralj Novak (11 papers)
  2. Jasmina Smailović (4 papers)
  3. Borut Sluban (3 papers)
  4. Igor Mozetič (18 papers)
Citations (794)