A hybrid heuristic algorithm for evolving models in simultaneous scenarios of classification and clustering

Mariela Cerrada; Jose Aguilar; Junior Altamiranda; René Vinicio Sánchez

doi:10.1007/s10115-019-01336-3

A hybrid heuristic algorithm for evolving models in simultaneous scenarios of classification and clustering

Mariela Cerrada, Jose Aguilar, Junior Altamiranda, René Vinicio Sánchez

Grupo de Investigación y Desarrollo en Tecnologías Industriales (GIDTEC)

Producción científica: Contribución a una revista › Artículo › revisión exhaustiva

5 Citas (Scopus)

Resumen

Machine Learning is currently an important research field that attracts interest due to its importance for discovering hidden knowledge or patterns from big datasets. In this paper, we propose a heuristic algorithm which can solve problems related to only classification, only clustering, or classification with clustering by creating models with the ability to evolve to another class/cluster configuration without a retraining process for new incoming data. This algorithm combines supervised and unsupervised learning principles for the incremental construction of both classes and clusters, by using the main guidelines from two classical methods of classification based on distance and clustering based on prototypes, such as KNN and K-means. The algorithm is able to deal with labeled and unlabeled samples as inputs in order to create new groups (classes or clusters), merge or reconfigure existing ones. Basically, the creation of new groups follows three sequential steps: (i) locate the provisional group for an input sample using K-means, (ii) using 1NN, locate the nearest sample to the input sample, only considering the samples in the provisional group, and (iii) merge or reconfigure existing groups following specific guidelines. Several benchmarks, related to classification and clustering problems, were evaluated by our proposal; the results were compared with classical algorithms. On the other hand, artificial datasets with labeled and unlabeled samples have been created to show the ability of our algorithm in the hybrid context to solve classification and clustering combined. As a result, the algorithm is able to create clusters and classes, simultaneously, when required. Finally, a real case study of fault diagnosis in rotating machinery is presented for discovering new groups that might represent patterns from unknown data.

Idioma original	Inglés
Páginas (desde-hasta)	755-798
Número de páginas	44
Publicación	Knowledge and Information Systems
Volumen	61
N.º	2
DOI	https://doi.org/10.1007/s10115-019-01336-3
Estado	Publicada - 1 nov. 2019

Acceder al documento

10.1007/s10115-019-01336-3

Otros archivos y enlaces

Citar esto

@article{20a7fc3673b848fc9783608e9706aed9,

title = "A hybrid heuristic algorithm for evolving models in simultaneous scenarios of classification and clustering",

abstract = "Machine Learning is currently an important research field that attracts interest due to its importance for discovering hidden knowledge or patterns from big datasets. In this paper, we propose a heuristic algorithm which can solve problems related to only classification, only clustering, or classification with clustering by creating models with the ability to evolve to another class/cluster configuration without a retraining process for new incoming data. This algorithm combines supervised and unsupervised learning principles for the incremental construction of both classes and clusters, by using the main guidelines from two classical methods of classification based on distance and clustering based on prototypes, such as KNN and K-means. The algorithm is able to deal with labeled and unlabeled samples as inputs in order to create new groups (classes or clusters), merge or reconfigure existing ones. Basically, the creation of new groups follows three sequential steps: (i) locate the provisional group for an input sample using K-means, (ii) using 1NN, locate the nearest sample to the input sample, only considering the samples in the provisional group, and (iii) merge or reconfigure existing groups following specific guidelines. Several benchmarks, related to classification and clustering problems, were evaluated by our proposal; the results were compared with classical algorithms. On the other hand, artificial datasets with labeled and unlabeled samples have been created to show the ability of our algorithm in the hybrid context to solve classification and clustering combined. As a result, the algorithm is able to create clusters and classes, simultaneously, when required. Finally, a real case study of fault diagnosis in rotating machinery is presented for discovering new groups that might represent patterns from unknown data.",

keywords = "Classification, Clustering, Data mining, Evolving learning, Hybrid learning, Incremental learning",

author = "Mariela Cerrada and Jose Aguilar and Junior Altamiranda and S{\'a}nchez, {Ren{\'e} Vinicio}",

year = "2019",

month = nov,

day = "1",

doi = "10.1007/s10115-019-01336-3",

language = "English",

volume = "61",

pages = "755--798",

journal = "Knowledge and Information Systems",

issn = "0219-1377",

publisher = "Springer Verlag",

number = "2",

}

TY - JOUR

T1 - A hybrid heuristic algorithm for evolving models in simultaneous scenarios of classification and clustering

AU - Cerrada, Mariela

AU - Aguilar, Jose

AU - Altamiranda, Junior

AU - Sánchez, René Vinicio

PY - 2019/11/1

Y1 - 2019/11/1

N2 - Machine Learning is currently an important research field that attracts interest due to its importance for discovering hidden knowledge or patterns from big datasets. In this paper, we propose a heuristic algorithm which can solve problems related to only classification, only clustering, or classification with clustering by creating models with the ability to evolve to another class/cluster configuration without a retraining process for new incoming data. This algorithm combines supervised and unsupervised learning principles for the incremental construction of both classes and clusters, by using the main guidelines from two classical methods of classification based on distance and clustering based on prototypes, such as KNN and K-means. The algorithm is able to deal with labeled and unlabeled samples as inputs in order to create new groups (classes or clusters), merge or reconfigure existing ones. Basically, the creation of new groups follows three sequential steps: (i) locate the provisional group for an input sample using K-means, (ii) using 1NN, locate the nearest sample to the input sample, only considering the samples in the provisional group, and (iii) merge or reconfigure existing groups following specific guidelines. Several benchmarks, related to classification and clustering problems, were evaluated by our proposal; the results were compared with classical algorithms. On the other hand, artificial datasets with labeled and unlabeled samples have been created to show the ability of our algorithm in the hybrid context to solve classification and clustering combined. As a result, the algorithm is able to create clusters and classes, simultaneously, when required. Finally, a real case study of fault diagnosis in rotating machinery is presented for discovering new groups that might represent patterns from unknown data.

AB - Machine Learning is currently an important research field that attracts interest due to its importance for discovering hidden knowledge or patterns from big datasets. In this paper, we propose a heuristic algorithm which can solve problems related to only classification, only clustering, or classification with clustering by creating models with the ability to evolve to another class/cluster configuration without a retraining process for new incoming data. This algorithm combines supervised and unsupervised learning principles for the incremental construction of both classes and clusters, by using the main guidelines from two classical methods of classification based on distance and clustering based on prototypes, such as KNN and K-means. The algorithm is able to deal with labeled and unlabeled samples as inputs in order to create new groups (classes or clusters), merge or reconfigure existing ones. Basically, the creation of new groups follows three sequential steps: (i) locate the provisional group for an input sample using K-means, (ii) using 1NN, locate the nearest sample to the input sample, only considering the samples in the provisional group, and (iii) merge or reconfigure existing groups following specific guidelines. Several benchmarks, related to classification and clustering problems, were evaluated by our proposal; the results were compared with classical algorithms. On the other hand, artificial datasets with labeled and unlabeled samples have been created to show the ability of our algorithm in the hybrid context to solve classification and clustering combined. As a result, the algorithm is able to create clusters and classes, simultaneously, when required. Finally, a real case study of fault diagnosis in rotating machinery is presented for discovering new groups that might represent patterns from unknown data.

KW - Classification

KW - Clustering

KW - Data mining

KW - Evolving learning

KW - Hybrid learning

KW - Incremental learning

UR - https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85061306161&origin=inward

UR - https://www.scopus.com/inward/citedby.uri?partnerID=HzOxMe3b&scp=85061306161&origin=inward

UR - http://www.mendeley.com/research/hybrid-heuristic-algorithm-evolving-models-simultaneous-scenarios-classification-clustering

U2 - 10.1007/s10115-019-01336-3

DO - 10.1007/s10115-019-01336-3

M3 - Article

SN - 0219-1377

VL - 61

SP - 755

EP - 798

JO - Knowledge and Information Systems

JF - Knowledge and Information Systems

IS - 2

ER -

A hybrid heuristic algorithm for evolving models in simultaneous scenarios of classification and clustering

Resumen

Acceder al documento

Otros archivos y enlaces

Huella

Citar esto