Abstract
This research focuses on obtaining effective models that can predict a person’s sentiments from written messages. While the initially proposed models were within acceptable ranges as classifiers, applying quality metrics revealed a new perspective. For this research a dataset from comments from Facebook were collected and these comments were processed in two manners. the text was first processed as a normal bag of words and the second process was a TF-IDF. Both methods were used to train 3 different models the first model is a support vector machine, the second model is a linear regression, and the third model was long-short term memory model. Upon processing our dataset with the best model, the LSTM, which achieved an accuracy of 92% and obtained the highest scores in quality metrics. Although the model was not perfect, an interesting phenomenon emerged regarding sentiment classification. A considerable percentage of comments were classified as positive concerning the search hashtag, suggesting possible support for political actors. However, the vast majority of classified comments expressed negative sentiments. This leads to a significant conclusion that evaluating sentiment in social media comments is a complex challenge due to the presence of emojis and informal writing with spelling errors and inconsistencies, which impact model results.
Original language | English |
---|---|
Title of host publication | Information Technology and Systems - ICITS 2024 |
Editors | Alvaro Rocha, Jorge Hochstetter Diez, Carlos Ferras, Mauricio Dieguez Rebolledo |
Publisher | Springer Science and Business Media Deutschland GmbH |
Pages | 278-287 |
Number of pages | 10 |
ISBN (Print) | 9783031542343 |
DOIs | |
State | Published - 2024 |
Event | International Conference on Information Technology and Systems, ICITS 2024 - Temuco, Chile Duration: 24 Jan 2024 → 26 Jan 2024 |
Publication series
Name | Lecture Notes in Networks and Systems |
---|---|
Volume | 932 LNNS |
ISSN (Print) | 2367-3370 |
ISSN (Electronic) | 2367-3389 |
Conference
Conference | International Conference on Information Technology and Systems, ICITS 2024 |
---|---|
Country/Territory | Chile |
City | Temuco |
Period | 24/01/24 → 26/01/24 |
Bibliographical note
Publisher Copyright:© The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.
Keywords
- Corruption
- Linear Regression
- LSTM
- Sentiment Analysis
- SVM