Abstract
In software development, defects are an inevitable part of the process. They can occur at any stage, from requirements gathering to coding to testing. Software defect prediction (SDP) can help minimize costs, guide the testing effort effectively, and ultimately improve overall software quality. This study presents an innovative approach using a machine learning framework with a Voting Ensemble model using k-Nearest Neighbors (KNN) and Support Vector Machines (SVM). In addition, the study delves into the challenge posed by imbalanced data sets, a common problem in SDP, and employs several techniques such as NearMiss, RandomOverSampler, SMOTETomek, and BalancedBaggingClassifier to address this imbalance. The results demonstrate a marked improvement in the detection of defective modules at the cost of a decrease in precision, a trade-off that is considered beneficial in scenarios where detecting all defects is critical. The balanced approach between precision and recall highlights the model’s increased sensitivity and ability to identify critical cases, essential to ensure software quality and reliability. This research contributes significantly to the SDP field, offering a balanced and effective solution for defect detection and test planning optimization in software development.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of 9th International Congress on Information and Communication Technology - ICICT 2024 |
| Editors | Xin-She Yang, Simon Sherratt, Nilanjan Dey, Amit Joshi |
| Publisher | Springer Science and Business Media Deutschland GmbH |
| Pages | 585-595 |
| Number of pages | 11 |
| ISBN (Print) | 9789819735587 |
| DOIs | |
| State | Published - 2024 |
| Event | 9th International Congress on Information and Communication Technology, ICICT 2024 - London, United Kingdom Duration: 19 Feb 2024 → 22 Feb 2024 |
Publication series
| Name | Lecture Notes in Networks and Systems |
|---|---|
| Volume | 1013 LNNS |
| ISSN (Print) | 2367-3370 |
| ISSN (Electronic) | 2367-3389 |
Conference
| Conference | 9th International Congress on Information and Communication Technology, ICICT 2024 |
|---|---|
| Country/Territory | United Kingdom |
| City | London |
| Period | 19/02/24 → 22/02/24 |
Bibliographical note
Publisher Copyright:© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.
Keywords
- Binary classification
- Ensemble
- Software defect prediction
CACES Knowledge Areas
- 8116A Information Systems
- 116A Computer Science
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver