In two decades, the number of people with breast cancer has almost doubled: in 2000, about 10 million patients had the disease; by 2020, it had reached 19 million. It is estimated that one in five people today will develop some form of cancer in their lifetime. Studies suggest that the number of people diagnosed with cancer will increase in the coming years, being approximately 50% higher in 2040 than in 2020. This article provides an analysis method to predict or diagnose breast cancer using data science processes and machine learning. The analysis method is structured into three phases. The first one is a data preparation phase, the second one is a predictive analysis phase, and the last one is an evaluation metric. Therefore, the predictions are experimented with machine learning techniques, which are: KNN, gradient boosting classifier, and random forest, for which evaluation metrics are presented with the next quality measures: Accuracy, Precision, Recall, and F1-Score. The dataset selected for this phase of analysis is Wisconsin breast cancer . These data analysis techniques can be extended to other learning techniques and can also be used in future scientific work such as disease prediction or medicine in general.
|Title of host publication||2022 IEEE International Autumn Meeting on Power, Electronics and Computing, ROPEC 2022|
|Publisher||Institute of Electrical and Electronics Engineers Inc.|
|State||Published - 2022|
|Event||2022 IEEE International Autumn Meeting on Power, Electronics and Computing, ROPEC 2022 - Ixtapa, Mexico|
Duration: 9 Nov 2022 → 11 Nov 2022
|Name||2022 IEEE International Autumn Meeting on Power, Electronics and Computing, ROPEC 2022|
|Conference||2022 IEEE International Autumn Meeting on Power, Electronics and Computing, ROPEC 2022|
|Period||9/11/22 → 11/11/22|
Bibliographical notePublisher Copyright:
© 2022 IEEE.