Skip to main navigation Skip to search Skip to main content

A Novel Fraud Detection Method in Medical Insurance Claims Using Ensemble Learning

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Fraud in medical insurance claims accounts for approximately 5–10% of total claim costs, posing significant financial and reputational risks for insurers. Traditional detection methods, such as manual audits, are inefficient and prone to errors, necessitating the adoption of machine learning-based approaches. This study proposes a hybrid ensemble model for fraud detection, integrating Support Vector Machines, Random Forest, and K-Nearest Neighbors, with a Gradient Boosting meta-model for optimized predictions. The proposed method consists of four main phases: preprocessing (scaling, encoding, and class balancing using SMOTE), modeling (training individual classifiers), optimization (meta-model integration), and evaluation using Accuracy, F1-score, and RMSE. The model was validated on a real-world medical insurance claims dataset, demonstrating a 2.93% increase in accuracy, a 10.37% reduction in RMSE, and a 14.41% improvement in MAE compared to the best individual model. The main contribution is a scalable and interpretable hybrid system that enhances fraud detection in insurance. Future research will focus on expanding feature sets, incorporating external data sources, and applying interpretability techniques such as SHAP to improve transparency and adoption in real-world applications.

Original languageEnglish
Title of host publicationProceedings of 10th International Congress on Information and Communication Technology - ICICT 2025
EditorsXin-She Yang, Simon Sherratt, Nilanjan Dey, Amit Joshi
PublisherSpringer Science and Business Media Deutschland GmbH
Pages507-516
Number of pages10
ISBN (Print)9789819664405
DOIs
StatePublished - 2025
Event10th International Congress on Information and Communication Technology, ICICT 2025 - London, United Kingdom
Duration: 18 Feb 202521 Feb 2025

Publication series

NameLecture Notes in Networks and Systems
Volume1416 LNNS
ISSN (Print)2367-3370
ISSN (Electronic)2367-3389

Conference

Conference10th International Congress on Information and Communication Technology, ICICT 2025
Country/TerritoryUnited Kingdom
CityLondon
Period18/02/2521/02/25

Bibliographical note

Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.

Keywords

  • Ensemble learning
  • Fraud detection
  • Gradient boosting
  • Hybrid models
  • Insurance claims

Cite this