Abstract
Fraud in medical insurance claims accounts for approximately 5–10% of total claim costs, posing significant financial and reputational risks for insurers. Traditional detection methods, such as manual audits, are inefficient and prone to errors, necessitating the adoption of machine learning-based approaches. This study proposes a hybrid ensemble model for fraud detection, integrating Support Vector Machines, Random Forest, and K-Nearest Neighbors, with a Gradient Boosting meta-model for optimized predictions. The proposed method consists of four main phases: preprocessing (scaling, encoding, and class balancing using SMOTE), modeling (training individual classifiers), optimization (meta-model integration), and evaluation using Accuracy, F1-score, and RMSE. The model was validated on a real-world medical insurance claims dataset, demonstrating a 2.93% increase in accuracy, a 10.37% reduction in RMSE, and a 14.41% improvement in MAE compared to the best individual model. The main contribution is a scalable and interpretable hybrid system that enhances fraud detection in insurance. Future research will focus on expanding feature sets, incorporating external data sources, and applying interpretability techniques such as SHAP to improve transparency and adoption in real-world applications.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of 10th International Congress on Information and Communication Technology - ICICT 2025 |
| Editors | Xin-She Yang, Simon Sherratt, Nilanjan Dey, Amit Joshi |
| Publisher | Springer Science and Business Media Deutschland GmbH |
| Pages | 507-516 |
| Number of pages | 10 |
| ISBN (Print) | 9789819664405 |
| DOIs | |
| State | Published - 2025 |
| Event | 10th International Congress on Information and Communication Technology, ICICT 2025 - London, United Kingdom Duration: 18 Feb 2025 → 21 Feb 2025 |
Publication series
| Name | Lecture Notes in Networks and Systems |
|---|---|
| Volume | 1416 LNNS |
| ISSN (Print) | 2367-3370 |
| ISSN (Electronic) | 2367-3389 |
Conference
| Conference | 10th International Congress on Information and Communication Technology, ICICT 2025 |
|---|---|
| Country/Territory | United Kingdom |
| City | London |
| Period | 18/02/25 → 21/02/25 |
Bibliographical note
Publisher Copyright:© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.
Keywords
- Ensemble learning
- Fraud detection
- Gradient boosting
- Hybrid models
- Insurance claims
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver