Presumptive Detection of Cyberbullying on Twitter through Natural Language Processing and Machine Learning in the Spanish Language

Gabriel A. Leon-Paredes, Wilson F. Palomeque-Leon, Pablo L. Gallegos-Segovia, Paul E. Vintimilla-Tapia, Jack F. Bravo-Torres, Liliana I. Barbosa-Santillan, Maria M. Paredes-Pinos

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Nowadays, the constant development of information and communication technologies (ICTs) has changed the inter-personal interaction, allowing to transfer real experiences to a virtualized medium such as Internet. In this sense, although the space-time barriers of traditional communication are broken and social relationships are strengthened, problems related to adverse behaviors may arise. Bullying, defined as an act that threatens a person's holistic well-being, becomes cyberbullying when it is done over Internet, causing anxiety problems, depression and even suicide attempts. For this reason, it is essential to detect this type of behaviour in time. This research deploys a Spanish cyberbullying prevention system (SPC), which relies on Natural Language Processing (NLP) methods and different machine learning techniques (Naive Bayes, Support Vector Machine and Logistic Regression), using Twitter as the basis for the extraction of knowledge bases or corpus. Several precision metrics and variable corpus sizes are used for the training. The learning results reach a maximum accuracy of 93%, verified through the application of three study cases.

Original languageEnglish
Title of host publicationIEEE CHILEAN Conference on Electrical, Electronics Engineering, Information and Communication Technologies, CHILECON 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728131856
DOIs
StatePublished - Nov 2019
Event2019 IEEE CHILEAN Conference on Electrical, Electronics Engineering, Information and Communication Technologies, CHILECON 2019 - Valparaiso, Chile
Duration: 13 Nov 201927 Nov 2019

Publication series

NameIEEE CHILEAN Conference on Electrical, Electronics Engineering, Information and Communication Technologies, CHILECON 2019

Conference

Conference2019 IEEE CHILEAN Conference on Electrical, Electronics Engineering, Information and Communication Technologies, CHILECON 2019
Country/TerritoryChile
CityValparaiso
Period13/11/1927/11/19

Bibliographical note

Funding Information:
ACKNOWLEDGMENT The authors would like to thank to the Universidad Politécnica Salesiana, Sede Cuenca, especially to its Research Groups GIHP4C and GITEL for the support provided during this research.

Publisher Copyright:
© 2019 IEEE.

Copyright:
Copyright 2020 Elsevier B.V., All rights reserved.

Keywords

  • Cyberbullying
  • Expert System
  • Natural Language Processing
  • Semantics
  • Sentiment Analysis
  • Spanish Language Processing

Fingerprint

Dive into the research topics of 'Presumptive Detection of Cyberbullying on Twitter through Natural Language Processing and Machine Learning in the Spanish Language'. Together they form a unique fingerprint.

Cite this