Skip to main navigation Skip to search Skip to main content

Software Defect Prediction: A Machine Learning Approach with Voting Ensemble

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In software development, defects are an inevitable part of the process. They can occur at any stage, from requirements gathering to coding to testing. Software defect prediction (SDP) can help minimize costs, guide the testing effort effectively, and ultimately improve overall software quality. This study presents an innovative approach using a machine learning framework with a Voting Ensemble model using k-Nearest Neighbors (KNN) and Support Vector Machines (SVM). In addition, the study delves into the challenge posed by imbalanced data sets, a common problem in SDP, and employs several techniques such as NearMiss, RandomOverSampler, SMOTETomek, and BalancedBaggingClassifier to address this imbalance. The results demonstrate a marked improvement in the detection of defective modules at the cost of a decrease in precision, a trade-off that is considered beneficial in scenarios where detecting all defects is critical. The balanced approach between precision and recall highlights the model’s increased sensitivity and ability to identify critical cases, essential to ensure software quality and reliability. This research contributes significantly to the SDP field, offering a balanced and effective solution for defect detection and test planning optimization in software development.

Original languageEnglish
Title of host publicationProceedings of 9th International Congress on Information and Communication Technology - ICICT 2024
EditorsXin-She Yang, Simon Sherratt, Nilanjan Dey, Amit Joshi
PublisherSpringer Science and Business Media Deutschland GmbH
Pages585-595
Number of pages11
ISBN (Print)9789819735587
DOIs
StatePublished - 2024
Event9th International Congress on Information and Communication Technology, ICICT 2024 - London, United Kingdom
Duration: 19 Feb 202422 Feb 2024

Publication series

NameLecture Notes in Networks and Systems
Volume1013 LNNS
ISSN (Print)2367-3370
ISSN (Electronic)2367-3389

Conference

Conference9th International Congress on Information and Communication Technology, ICICT 2024
Country/TerritoryUnited Kingdom
CityLondon
Period19/02/2422/02/24

Bibliographical note

Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.

Keywords

  • Binary classification
  • Ensemble
  • Software defect prediction

CACES Knowledge Areas

  • 8116A Information Systems
  • 116A Computer Science

Cite this