Incorporation of Language Discriminative Information into Recurrent Neural Networks Models to LID Tasks

Christian Salamea, Ricardo Cordoba, Luis D’Haro, David Romero

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Language Identification (LID) is an essential research topic in the Automatic Recognition Speech area. One of the most important characteristics relative to language is context information. In this article, considering a phonotactic approach where the phonetic units called “phone-grams” are used, in order to introduce such context information, a novel technique is proposed. Language discriminative information has been incorporated in the Recurrent Neural Network Language Models generation (RNNLMs) in the weights initialization stage to improve the Language Identification task. This technique has been evaluated using KALAKA-3 database that contains 108 h of audios of six languages to be recognized. The metric used in this work has been the Average Detection Cost metric Cavg. In relation to the phonetic units called “phone-grams” used in order to incorporate context information in the features used to train the RNNLM, it has been considered phone-grams of two elements “2phone-grams” and three elements “3phone-grams”, obtaining a relative improvement up to 17% and 15,44% respectively compared to the results obtaining using RNNLMs.

Original languageEnglish
Title of host publicationSmart Technologies, Systems and Applications - 1st International Conference, SmartTech-IC 2019, Proceedings
EditorsFabián R. Narváez, Diego F. Vallejo, Paulina A. Morillo, Julio R. Proaño
PublisherSpringer
Pages165-175
Number of pages11
ISBN (Print)9783030467845
DOIs
StatePublished - 1 Jan 2020
Event1st International Conference on Smart Technologies, Systems and Applications, SmartTech-IC 2019 - Quito, Ecuador
Duration: 2 Dec 20194 Dec 2019

Publication series

NameCommunications in Computer and Information Science
Volume1154 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference1st International Conference on Smart Technologies, Systems and Applications, SmartTech-IC 2019
Country/TerritoryEcuador
CityQuito
Period2/12/194/12/19

Bibliographical note

Publisher Copyright:
© Springer Nature Switzerland AG 2020.

Keywords

  • Automatic Recognition Speech
  • Language discriminative information
  • Language Identification
  • Recurrent Neural Networks

Fingerprint

Dive into the research topics of 'Incorporation of Language Discriminative Information into Recurrent Neural Networks Models to LID Tasks'. Together they form a unique fingerprint.

Cite this