Abstract
During the last few years, the field of automatic speech recognition (ASR) has been growing exponentially, due to the diverse applications and solutions it offers. For this reason, this paper presents a multiclass language classifier based on recurrent convolutional neural networks, whose objective is to classify the audios of the KALAKA-3 database, according to their language. To meet this objective, the mel frequency cepstral coefficients (MFCCs) were extracted from each of the audios in the database, with which the training process is carried out. A recurrent convolutional neural network (CRNN) was created for this process, resulting in an accuracy of 98% using the testing data, and 40% using the Eval data. This work sets a precedent for improving real-time translators, since in the future it would be possible to listen to a few seconds of a conversation, identify it, and automatically perform a translation process, which would be very useful in various applications.
Original language | English |
---|---|
Title of host publication | Proceedings of 8th International Congress on Information and Communication Technology - ICICT 2023 |
Editors | Xin-She Yang, R. Simon Sherratt, Nilanjan Dey, Amit Joshi |
Publisher | Springer Science and Business Media Deutschland GmbH |
Pages | 641-649 |
Number of pages | 9 |
ISBN (Print) | 9789819930425 |
DOIs | |
State | Published - 2024 |
Event | 8th International Congress on Information and Communication Technology, ICICT 2023 - London, United Kingdom Duration: 20 Feb 2023 → 23 Feb 2023 |
Publication series
Name | Lecture Notes in Networks and Systems |
---|---|
Volume | 695 LNNS |
ISSN (Print) | 2367-3370 |
ISSN (Electronic) | 2367-3389 |
Conference
Conference | 8th International Congress on Information and Communication Technology, ICICT 2023 |
---|---|
Country/Territory | United Kingdom |
City | London |
Period | 20/02/23 → 23/02/23 |
Bibliographical note
Publisher Copyright:© 2024, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
Keywords
- Artificial intelligence
- CRNN
- Database
- KALAKA-3
- Neural networks