Ir directamente a la navegación principal Ir directamente a la búsqueda Ir directamente al contenido principal

On the Use of Phonotactic Vector Representations with FastText for Language Identification

Producción científica: Capítulo del libro/informe/acta de congresoContribución de conferenciarevisión exhaustiva

Resumen

This paper explores a better way to learn word vector representations for language identification (LID). We have focused on a phonotactic approach using phoneme sequences in order to make phonotactic units (phone-grams) to incorporate context information. In order to take into consideration the morphology of phone-grams, we have considered the use of sub-word information (lower-order n-grams) to learn phone-grams embeddings using FastText. These embeddings are used as input to an i-Vector framework to train a multiclass logistic classifier. Our approach has been compared with a LID system that uses phone-gram embeddings learned through Skipgram that do not implement sub-word information, using Cavg as a metric for our experiments. Our approach to LID to incorporate sub-word information in phone-grams embeddings significantly improves the results obtained by using embeddings that are learned ignoring the structure of phone-grams. Furthermore, we have shown that our system provides complementary information to an acoustic system, improving it through the fusion of both systems.

Idioma originalInglés
Título de la publicación alojadaConversational Dialogue Systems for the Next Decade, IWSDS 2020
EditoresLuis Fernando D’Haro, Zoraida Callejas, Satoshi Nakamura
EditorialSpringer Science and Business Media Deutschland GmbH
Páginas339-348
Número de páginas10
ISBN (versión impresa)9789811583940
DOI
EstadoPublicada - 2021
Evento11th International Workshop on Spoken Dialogue Systems, IWSDS 2020 - Madrid, Espana
Duración: 21 sep. 202023 sep. 2020

Serie de la publicación

NombreLecture Notes in Electrical Engineering
Volumen704

Conferencia

Conferencia11th International Workshop on Spoken Dialogue Systems, IWSDS 2020
País/TerritorioEspana
CiudadMadrid
Período21/09/2023/09/20

Nota bibliográfica

Publisher Copyright:
© 2021, The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

Copyright:
Copyright 2020 Elsevier B.V., All rights reserved.

Areas de Conocimiento del CACES

  • 316A Desarrollo y análisis de software y aplicaciones

Citar esto