Resumen
This work shows similarity metrics behavior on sparse data for recommender systems (RS). Clustering in RS is an important technique to perform groups of users or items with the purpose of personalization and optimization recommendations. The majority of clustering techniques try to minimize the Euclidean distance between the samples and their centroid, but this technique has a drawback on sparse data because it considers the lack of value as zero. We propose a comparative analysis of similarity metrics like Pearson Correlation, Jaccard, Mean Square Difference, Jaccard Mean Square Difference and Mean Jaccard Difference as an alternative method to Euclidean distance, our work shows results for FilmTrust and MovieLens 100K datasets, these both free and public with high sparsity. We probe that using similarity measures is better for accuracy in terms of Mean Absolute Error and Within-Cluster on sparse data.
| Idioma original | Inglés estadounidense |
|---|---|
| Título de la publicación alojada | A comparative analysis of similarity metrics on sparse data for clustering in recommender systems |
| Editores | Tareq Z. Ahram |
| Páginas | 291-299 |
| Número de páginas | 9 |
| ISBN (versión digital) | 9783319942285 |
| DOI | |
| Estado | Publicada - 1 ene. 2019 |
| Evento | Advances in Intelligent Systems and Computing - , Alemania Duración: 1 ene. 2015 → … |
Serie de la publicación
| Nombre | Advances in Intelligent Systems and Computing |
|---|---|
| Volumen | 787 |
| ISSN (versión impresa) | 2194-5357 |
Conferencia
| Conferencia | Advances in Intelligent Systems and Computing |
|---|---|
| País/Territorio | Alemania |
| Período | 1/01/15 → … |
Areas de Conocimiento del CACES
- 116A Computación
Citar esto
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver