Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
139 tokens/sec
GPT-4o
47 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

An LSTM model for Twitter Sentiment Analysis (2212.01791v1)

Published 4 Dec 2022 in cs.CL and cs.SI

Abstract: Sentiment analysis on social media such as Twitter provides organizations and individuals an effective way to monitor public emotions towards them and their competitors. As a result, sentiment analysis has become an important and challenging task. In this work, we have collected seven publicly available and manually annotated twitter sentiment datasets. We create a new training and testing dataset from the collected datasets. We develop an LSTM model to classify sentiment of a tweet and evaluate the model with the new dataset.

Citations (4)

Summary

  • The paper presents an LSTM model that integrates seven manually annotated Twitter datasets to achieve superior accuracy over VADER.
  • It details a robust methodology including data preprocessing, tokenization, and GloVe embeddings to standardize unique Twitter content.
  • The study highlights practical implications, suggesting future improvements with attention mechanisms and enhanced word embeddings.

Evaluation of an LSTM Model for Twitter Sentiment Analysis

Introduction

The prevalence of social media platforms, particularly Twitter, has transformed these spaces into rich sources of real-time public sentiment on various topics. Recognizing the value in extracting these sentiments for organizations and entities has prompted the development of methodologies for Twitter sentiment analysis. This paper presents a paper wherein seven manually annotated Twitter sentiment datasets are merged to create a comprehensive training and testing set. An LSTM (Long Short-Term Memory) model is then developed and evaluated on this dataset, offering insights into the capabilities and limitations of machine learning approaches in analyzing Twitter sentiments.

Data Collection and Preprocessing

The paper begins by detailing the collection of seven distinct, manually annotated datasets previously utilized in sentiment analysis research. These datasets encompass a variety of sentiment labels, including positive, negative, and neutral, among others. A significant contribution of this paper is the effort in preprocessing and combining these datasets into a unified training and testing set, thus providing a more robust evaluation environment for the LSTM model. Preprocessing steps, crucial in normalizing tweet data for analysis, addressed elements unique to Twitter, such as hashtags, user handles, URLs, and emoticons. Each of these was standardized to ensure consistency across the dataset.

LSTM Model Architecture

The paper's core contribution is the development of an LSTM-based model tailored for sentiment analysis of Twitter data. The model architecture is described in detail, starting with word tokenization and followed by an embedding layer leveraging GloVe word embeddings. Subsequent layers include an LSTM layer, a fully connected layer, and a sigmoid activation layer, culminating in the model's output. The model's architecture is designed to process and classify tweets into sentiment categories effectively.

Experimental Results

The experimental evaluation showcased the LSTM model's performance in comparison to VADER, a lexicon and rule-based sentiment analysis tool. The LSTM model demonstrated a superior accuracy rate across all sentiment categories, with particularly noteworthy performance in identifying positive sentiments. However, it's also noted that the model's runtime, though longer than VADER's, resulted in significantly higher accuracy rates, making it a more effective tool for sentiment analysis despite the time investment.

Discussion and Future Directions

This paper's results affirm the potential of machine learning, and LSTM models specifically, in outperforming traditional lexicon-based approaches in sentiment analysis tasks. The authors suggest avenues for further improving LSTM model performance, such as employing word embeddings that incorporate emoticons and exploring attention mechanisms. These enhancements could address some of the current model's limitations, such as its relatively moderate performance in classifying neutral and negative tweets.

Concluding Remarks

The research presented provides a valuable exploration into the application of LSTM models for Twitter sentiment analysis, highlighting both the challenges and potential improvements in this space. By incorporating a broad dataset and detailing a methodical approach to model development and evaluation, this work contributes to the broader understanding of sentiment analysis methodologies and their practical applications. Future developments, as suggested by the authors, may pave the way for even more sophisticated and accurate sentiment analysis tools, further leveraging the wealth of data available through social media platforms.