An analysis method for predicting breast cancer using data science processes and machine learning

Juan Jose Cordova Calle, John Xavier Farez Villa, Remigio Ismael Hurtado Ortiz

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

In two decades, the number of people with breast cancer has almost doubled: in 2000, about 10 million patients had the disease; by 2020, it had reached 19 million. It is estimated that one in five people today will develop some form of cancer in their lifetime. Studies suggest that the number of people diagnosed with cancer will increase in the coming years, being approximately 50% higher in 2040 than in 2020. This article provides an analysis method to predict or diagnose breast cancer using data science processes and machine learning. The analysis method is structured into three phases. The first one is a data preparation phase, the second one is a predictive analysis phase, and the last one is an evaluation metric. Therefore, the predictions are experimented with machine learning techniques, which are: KNN, gradient boosting classifier, and random forest, for which evaluation metrics are presented with the next quality measures: Accuracy, Precision, Recall, and F1-Score. The dataset selected for this phase of analysis is Wisconsin breast cancer [1]. These data analysis techniques can be extended to other learning techniques and can also be used in future scientific work such as disease prediction or medicine in general.

Original languageEnglish
Title of host publication2022 IEEE International Autumn Meeting on Power, Electronics and Computing, ROPEC 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781665458924
DOIs
StatePublished - 2022
Event2022 IEEE International Autumn Meeting on Power, Electronics and Computing, ROPEC 2022 - Ixtapa, Mexico
Duration: 9 Nov 202211 Nov 2022

Publication series

Name2022 IEEE International Autumn Meeting on Power, Electronics and Computing, ROPEC 2022

Conference

Conference2022 IEEE International Autumn Meeting on Power, Electronics and Computing, ROPEC 2022
Country/TerritoryMexico
CityIxtapa
Period9/11/2211/11/22

Bibliographical note

Publisher Copyright:
© 2022 IEEE.

Fingerprint

Dive into the research topics of 'An analysis method for predicting breast cancer using data science processes and machine learning'. Together they form a unique fingerprint.

Cite this