KALAKA-3 Database Language Classifier Through Convolutional Recurrent Neural Network

Producción científica: Capítulo del libro/informe/acta de congresoContribución de conferenciarevisión exhaustiva

Resumen

During the last few years, the field of automatic speech recognition (ASR) has been growing exponentially, due to the diverse applications and solutions it offers. For this reason, this paper presents a multiclass language classifier based on recurrent convolutional neural networks, whose objective is to classify the audios of the KALAKA-3 database, according to their language. To meet this objective, the mel frequency cepstral coefficients (MFCCs) were extracted from each of the audios in the database, with which the training process is carried out. A recurrent convolutional neural network (CRNN) was created for this process, resulting in an accuracy of 98% using the testing data, and 40% using the Eval data. This work sets a precedent for improving real-time translators, since in the future it would be possible to listen to a few seconds of a conversation, identify it, and automatically perform a translation process, which would be very useful in various applications.

Idioma originalInglés
Título de la publicación alojadaProceedings of 8th International Congress on Information and Communication Technology - ICICT 2023
EditoresXin-She Yang, R. Simon Sherratt, Nilanjan Dey, Amit Joshi
EditorialSpringer Science and Business Media Deutschland GmbH
Páginas641-649
Número de páginas9
ISBN (versión impresa)9789819930425
DOI
EstadoPublicada - 2024
Evento8th International Congress on Information and Communication Technology, ICICT 2023 - London, Reino Unido
Duración: 20 feb. 202323 feb. 2023

Serie de la publicación

NombreLecture Notes in Networks and Systems
Volumen695 LNNS
ISSN (versión impresa)2367-3370
ISSN (versión digital)2367-3389

Conferencia

Conferencia8th International Congress on Information and Communication Technology, ICICT 2023
País/TerritorioReino Unido
CiudadLondon
Período20/02/2323/02/23

Nota bibliográfica

Publisher Copyright:
© 2024, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

Huella

Profundice en los temas de investigación de 'KALAKA-3 Database Language Classifier Through Convolutional Recurrent Neural Network'. En conjunto forman una huella única.

Citar esto