Optimization of an Analysis Method for Diabetes Prediction Using Classical and Ensemble Machine Learning Techniques

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Nowadays, diabetes has become a prevalent and significant illness worldwide, causing harm to the circulatory system and leading to complications such as vision loss, kidney problems, and heart disorders. Detecting diabetes early on is crucial in order to implement more effective treatments, control blood sugar levels, and reduce the risk of associated complications affecting both small and large blood vessels. It also provides an opportunity to make lifestyle changes and use targeted medications before irreversible damage occurs in organs and tissues. To achieve this, a method based on CRISP-DM is proposed, which utilizes five traditional machine learning algorithms and ensemble techniques, including RandomForest, DecisionTree, XGboost, Logistic Regression, and Neural Networks. These algorithms are applied to a dataset containing 15,000 records from the National Institute of Diabetes and Digestive and Kidney Diseases [1]. To assess the effectiveness of the predictive models, quality measures such as Accuracy, Precision, Recall, and F1-Score are used for comparison.

Original languageEnglish
Title of host publicationProceedings of 9th International Congress on Information and Communication Technology - ICICT 2024
EditorsXin-She Yang, Simon Sherratt, Nilanjan Dey, Amit Joshi
PublisherSpringer Science and Business Media Deutschland GmbH
Pages527-536
Number of pages10
ISBN (Print)9789819735587
DOIs
StatePublished - 2024
Event9th International Congress on Information and Communication Technology, ICICT 2024 - London, United Kingdom
Duration: 19 Feb 202422 Feb 2024

Publication series

NameLecture Notes in Networks and Systems
Volume1013 LNNS
ISSN (Print)2367-3370
ISSN (Electronic)2367-3389

Conference

Conference9th International Congress on Information and Communication Technology, ICICT 2024
Country/TerritoryUnited Kingdom
CityLondon
Period19/02/2422/02/24

Bibliographical note

Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.

Keywords

  • Cross-validation
  • Decision tree
  • Diabetes prediction
  • Logistic regression
  • Machine learning
  • Neural network
  • Random forest
  • XGBoost

Cite this