Implementation of Machine Learning Models to Classify Security Incidents in Industrial Systems

David Andres Caiza Chafla, William Manuel Montalvo Lopez

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This research is focused on classifying security data in industrial control systems (ICS) using machine learning models. Currently, ICS mainly focus on the technical and industrial operation of technological infrastructures, neglecting their security. This practice is dangerous as it affects various critical sectors of society. Due to the scarce information and difficult access to security incident data in industrial systems, this study employed web scraping to create a data set called 'SI ICS UPS 2023' with 2914 records of non-null security incidents in text format. For the labeling phase, regular expressions were applied to standardize the data set and propose two main classes of interest in this study. Data cleaning and processing stages were implemented, followed by the training of four machine learning models from scratch. The best-performing model in terms of the area under the curve (AUC) was the Random Forest with a score of 0.76 and an accuracy of 71.20%. These results demonstrate the efficiency of automating processes for the collection and classification of cyber incident data in industrial environments using techniques like web scraping and the utilization of machine learning models.

Original languageEnglish
Title of host publicationChileCon 2023 - 2023 IEEE CHILEAN Conference on Electrical, Electronics Engineering, Information and Communication Technologies
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350369533
DOIs
StatePublished - 2023
Event2023 IEEE CHILEAN Conference on Electrical, Electronics Engineering, Information and Communication Technologies, ChileCon 2023 - Hybrid, Valdivia, Chile
Duration: 5 Dec 20237 Dec 2023

Publication series

NameProceedings - IEEE CHILEAN Conference on Electrical, Electronics Engineering, Information and Communication Technologies, ChileCon
ISSN (Print)2832-1529
ISSN (Electronic)2832-1537

Conference

Conference2023 IEEE CHILEAN Conference on Electrical, Electronics Engineering, Information and Communication Technologies, ChileCon 2023
Country/TerritoryChile
CityHybrid, Valdivia
Period5/12/237/12/23

Bibliographical note

Publisher Copyright:
© 2023 IEEE.

Keywords

  • ICS
  • incidents
  • industrial systems
  • machine learning
  • security
  • web scraping

Fingerprint

Dive into the research topics of 'Implementation of Machine Learning Models to Classify Security Incidents in Industrial Systems'. Together they form a unique fingerprint.

Cite this