Resumen
This paper explores a better way to learn word vector representations for language identification (LID). We have focused on a phonotactic approach using phoneme sequences in order to make phonotactic units (phone-grams) to incorporate context information. In order to take into consideration the morphology of phone-grams, we have considered the use of sub-word information (lower-order n-grams) to learn phone-grams embeddings using FastText. These embeddings are used as input to an i-Vector framework to train a multiclass logistic classifier. Our approach has been compared with a LID system that uses phone-gram embeddings learned through Skipgram that do not implement sub-word information, using Cavg as a metric for our experiments. Our approach to LID to incorporate sub-word information in phone-grams embeddings significantly improves the results obtained by using embeddings that are learned ignoring the structure of phone-grams. Furthermore, we have shown that our system provides complementary information to an acoustic system, improving it through the fusion of both systems.
| Idioma original | Inglés |
|---|---|
| Título de la publicación alojada | Conversational Dialogue Systems for the Next Decade, IWSDS 2020 |
| Editores | Luis Fernando D’Haro, Zoraida Callejas, Satoshi Nakamura |
| Editorial | Springer Science and Business Media Deutschland GmbH |
| Páginas | 339-348 |
| Número de páginas | 10 |
| ISBN (versión impresa) | 9789811583940 |
| DOI | |
| Estado | Publicada - 2021 |
| Evento | 11th International Workshop on Spoken Dialogue Systems, IWSDS 2020 - Madrid, Espana Duración: 21 sep. 2020 → 23 sep. 2020 |
Serie de la publicación
| Nombre | Lecture Notes in Electrical Engineering |
|---|---|
| Volumen | 704 |
Conferencia
| Conferencia | 11th International Workshop on Spoken Dialogue Systems, IWSDS 2020 |
|---|---|
| País/Territorio | Espana |
| Ciudad | Madrid |
| Período | 21/09/20 → 23/09/20 |
Nota bibliográfica
Publisher Copyright:© 2021, The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
Copyright:
Copyright 2020 Elsevier B.V., All rights reserved.
Areas de Conocimiento del CACES
- 316A Desarrollo y análisis de software y aplicaciones
Proyectos
- 1 Terminado
-
Desarrollo y evaluación de un sistema inteligente de apoyo basado en algoritmos de procesamiento de señales para la evaluación de pacientes con vitíligo
Calle Ortiz, E. R. (Investigador principal), Chica Ortiz, J. F. (Asistente de Investigación), Salamea Palacios, C. R. (Investigador Secundario), Arias Salcedo, K. A. (Estudiante Investigador), Auquilla Vicuña, J. F. (Estudiante Investigador), Mora Alvarez, J. C. (Estudiante Investigador), Zumba Narvaez, F. P. (Estudiante Investigador) & Zumba Narvaez, E. A. (Estudiante Investigador)
15/06/17 → 22/11/22
Proyecto: Investigación y Desarrollo
Citar esto
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver