Enhancing Sentiment Analysis of Corruption-Related Comments on Social Networks in Ecuador Using Techniques of Machine Learning: LSTM, Linear Regression, and Support Vector Machine

Paul S. Idrovo-Berrezueta, Denys A. Dutan-Sanchez, Gabriel A. León-Paredes

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This research focuses on obtaining effective models that can predict a person’s sentiments from written messages. While the initially proposed models were within acceptable ranges as classifiers, applying quality metrics revealed a new perspective. For this research a dataset from comments from Facebook were collected and these comments were processed in two manners. the text was first processed as a normal bag of words and the second process was a TF-IDF. Both methods were used to train 3 different models the first model is a support vector machine, the second model is a linear regression, and the third model was long-short term memory model. Upon processing our dataset with the best model, the LSTM, which achieved an accuracy of 92% and obtained the highest scores in quality metrics. Although the model was not perfect, an interesting phenomenon emerged regarding sentiment classification. A considerable percentage of comments were classified as positive concerning the search hashtag, suggesting possible support for political actors. However, the vast majority of classified comments expressed negative sentiments. This leads to a significant conclusion that evaluating sentiment in social media comments is a complex challenge due to the presence of emojis and informal writing with spelling errors and inconsistencies, which impact model results.

Original languageEnglish
Title of host publicationInformation Technology and Systems - ICITS 2024
EditorsAlvaro Rocha, Jorge Hochstetter Diez, Carlos Ferras, Mauricio Dieguez Rebolledo
PublisherSpringer Science and Business Media Deutschland GmbH
Pages278-287
Number of pages10
ISBN (Print)9783031542343
DOIs
StatePublished - 2024
EventInternational Conference on Information Technology and Systems, ICITS 2024 - Temuco, Chile
Duration: 24 Jan 202426 Jan 2024

Publication series

NameLecture Notes in Networks and Systems
Volume932 LNNS
ISSN (Print)2367-3370
ISSN (Electronic)2367-3389

Conference

ConferenceInternational Conference on Information Technology and Systems, ICITS 2024
Country/TerritoryChile
CityTemuco
Period24/01/2426/01/24

Bibliographical note

Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.

Keywords

  • Corruption
  • Linear Regression
  • LSTM
  • Sentiment Analysis
  • SVM

Fingerprint

Dive into the research topics of 'Enhancing Sentiment Analysis of Corruption-Related Comments on Social Networks in Ecuador Using Techniques of Machine Learning: LSTM, Linear Regression, and Support Vector Machine'. Together they form a unique fingerprint.

Cite this