A hybrid heuristic algorithm for evolving models in simultaneous scenarios of classification and clustering

Mariela Cerrada, Jose Aguilar, Junior Altamiranda, René Vinicio Sánchez

Producción científica: Contribución a una revistaArtículorevisión exhaustiva

5 Citas (Scopus)

Resumen

Machine Learning is currently an important research field that attracts interest due to its importance for discovering hidden knowledge or patterns from big datasets. In this paper, we propose a heuristic algorithm which can solve problems related to only classification, only clustering, or classification with clustering by creating models with the ability to evolve to another class/cluster configuration without a retraining process for new incoming data. This algorithm combines supervised and unsupervised learning principles for the incremental construction of both classes and clusters, by using the main guidelines from two classical methods of classification based on distance and clustering based on prototypes, such as KNN and K-means. The algorithm is able to deal with labeled and unlabeled samples as inputs in order to create new groups (classes or clusters), merge or reconfigure existing ones. Basically, the creation of new groups follows three sequential steps: (i) locate the provisional group for an input sample using K-means, (ii) using 1NN, locate the nearest sample to the input sample, only considering the samples in the provisional group, and (iii) merge or reconfigure existing groups following specific guidelines. Several benchmarks, related to classification and clustering problems, were evaluated by our proposal; the results were compared with classical algorithms. On the other hand, artificial datasets with labeled and unlabeled samples have been created to show the ability of our algorithm in the hybrid context to solve classification and clustering combined. As a result, the algorithm is able to create clusters and classes, simultaneously, when required. Finally, a real case study of fault diagnosis in rotating machinery is presented for discovering new groups that might represent patterns from unknown data.

Idioma originalInglés
Páginas (desde-hasta)755-798
Número de páginas44
PublicaciónKnowledge and Information Systems
Volumen61
N.º2
DOI
EstadoPublicada - 1 nov. 2019

Huella

Profundice en los temas de investigación de 'A hybrid heuristic algorithm for evolving models in simultaneous scenarios of classification and clustering'. En conjunto forman una huella única.

Citar esto