Resumen
During the last few years, the field of automatic speech recognition (ASR) has been growing exponentially, due to the diverse applications and solutions it offers. For this reason, this paper presents a multiclass language classifier based on recurrent convolutional neural networks, whose objective is to classify the audios of the KALAKA-3 database, according to their language. To meet this objective, the mel frequency cepstral coefficients (MFCCs) were extracted from each of the audios in the database, with which the training process is carried out. A recurrent convolutional neural network (CRNN) was created for this process, resulting in an accuracy of 98% using the testing data, and 40% using the Eval data. This work sets a precedent for improving real-time translators, since in the future it would be possible to listen to a few seconds of a conversation, identify it, and automatically perform a translation process, which would be very useful in various applications.
Idioma original | Inglés |
---|---|
Título de la publicación alojada | Proceedings of 8th International Congress on Information and Communication Technology - ICICT 2023 |
Editores | Xin-She Yang, R. Simon Sherratt, Nilanjan Dey, Amit Joshi |
Editorial | Springer Science and Business Media Deutschland GmbH |
Páginas | 641-649 |
Número de páginas | 9 |
ISBN (versión impresa) | 9789819930425 |
DOI | |
Estado | Publicada - 2024 |
Evento | 8th International Congress on Information and Communication Technology, ICICT 2023 - London, Reino Unido Duración: 20 feb. 2023 → 23 feb. 2023 |
Serie de la publicación
Nombre | Lecture Notes in Networks and Systems |
---|---|
Volumen | 695 LNNS |
ISSN (versión impresa) | 2367-3370 |
ISSN (versión digital) | 2367-3389 |
Conferencia
Conferencia | 8th International Congress on Information and Communication Technology, ICICT 2023 |
---|---|
País/Territorio | Reino Unido |
Ciudad | London |
Período | 20/02/23 → 23/02/23 |
Nota bibliográfica
Publisher Copyright:© 2024, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.