Factors that Affect i-Vectors Based Language Identification Systems

David Romero; Christian Salamea; Fernando Chica; Erick Narvaez

doi:10.1007/978-3-030-46785-2_13

Factors that Affect i-Vectors Based Language Identification Systems

David Romero, Christian Salamea, Fernando Chica, Erick Narvaez

Research Group on Interaction, Robotics and Automatics (GIIRA)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

The performance of a language identification (LID) system that uses i-vectors as features depends on several parameters, such as algorithm parameters and data parameters. In this study, an analysis of performance of a language identification system is considered, for which we focused only on data parameters in the “Back End” of the system, analyzing the influence of the amount of data and the speaker variability in the training phases of the UBM and the total variability Matrix T. Also, the Multiclass logistic regression (MLR) classifiers were analyzed, by balancing the classes of the database to train the classifiers on each language. These tests have been carried out in the Kalaka-3 database; we have used the average detection cost function (Cavg) to evaluate the performance. It is shown experimentally that in the training phase of the UBM, speaker variability is more important than a large amount of data. In the training phase of the total variability matrix T a better performance was obtained when a larger number of audios were used. And finally, balancing classes on each language to train the MLR classifiers allowed us to get a better performance only in certain languages. Using all of these proposed variations, we got a Cavg improvement of 37% in a standard language identification system.

Original language	English
Title of host publication	Smart Technologies, Systems and Applications - 1st International Conference, SmartTech-IC 2019, Proceedings
Editors	Fabián R. Narváez, Diego F. Vallejo, Paulina A. Morillo, Julio R. Proaño
Publisher	Springer
Pages	154-164
Number of pages	11
ISBN (Print)	9783030467845
DOIs	https://doi.org/10.1007/978-3-030-46785-2_13
State	Published - 1 Jan 2020
Event	1st International Conference on Smart Technologies, Systems and Applications, SmartTech-IC 2019 - Quito, Ecuador Duration: 2 Dec 2019 → 4 Dec 2019

Publication series

Name	Communications in Computer and Information Science
Volume	1154 CCIS
ISSN (Print)	1865-0929
ISSN (Electronic)	1865-0937

Conference

Conference	1st International Conference on Smart Technologies, Systems and Applications, SmartTech-IC 2019
Country/Territory	Ecuador
City	Quito
Period	2/12/19 → 4/12/19

Bibliographical note

Publisher Copyright:
© Springer Nature Switzerland AG 2020.

Keywords

Data
i-Vector
Language identification

Access to Document

10.1007/978-3-030-46785-2_13

Cite this

Romero, D., Salamea, C., Chica, F., & Narvaez, E. (2020). Factors that Affect i-Vectors Based Language Identification Systems. In F. R. Narváez, D. F. Vallejo, P. A. Morillo, & J. R. Proaño (Eds.), Smart Technologies, Systems and Applications - 1st International Conference, SmartTech-IC 2019, Proceedings (pp. 154-164). (Communications in Computer and Information Science; Vol. 1154 CCIS). Springer. https://doi.org/10.1007/978-3-030-46785-2_13

Romero, David ; Salamea, Christian ; Chica, Fernando et al. / Factors that Affect i-Vectors Based Language Identification Systems. Smart Technologies, Systems and Applications - 1st International Conference, SmartTech-IC 2019, Proceedings. editor / Fabián R. Narváez ; Diego F. Vallejo ; Paulina A. Morillo ; Julio R. Proaño. Springer, 2020. pp. 154-164 (Communications in Computer and Information Science).

@inproceedings{419d20c727104ef0a0f38bc84c5bc684,

title = "Factors that Affect i-Vectors Based Language Identification Systems",

abstract = "The performance of a language identification (LID) system that uses i-vectors as features depends on several parameters, such as algorithm parameters and data parameters. In this study, an analysis of performance of a language identification system is considered, for which we focused only on data parameters in the “Back End” of the system, analyzing the influence of the amount of data and the speaker variability in the training phases of the UBM and the total variability Matrix T. Also, the Multiclass logistic regression (MLR) classifiers were analyzed, by balancing the classes of the database to train the classifiers on each language. These tests have been carried out in the Kalaka-3 database; we have used the average detection cost function (Cavg) to evaluate the performance. It is shown experimentally that in the training phase of the UBM, speaker variability is more important than a large amount of data. In the training phase of the total variability matrix T a better performance was obtained when a larger number of audios were used. And finally, balancing classes on each language to train the MLR classifiers allowed us to get a better performance only in certain languages. Using all of these proposed variations, we got a Cavg improvement of 37% in a standard language identification system.",

keywords = "Data, i-Vector, Language identification",

author = "David Romero and Christian Salamea and Fernando Chica and Erick Narvaez",

note = "Publisher Copyright: {\textcopyright} Springer Nature Switzerland AG 2020.; 1st International Conference on Smart Technologies, Systems and Applications, SmartTech-IC 2019 ; Conference date: 02-12-2019 Through 04-12-2019",

year = "2020",

month = jan,

day = "1",

doi = "10.1007/978-3-030-46785-2_13",

language = "English",

isbn = "9783030467845",

series = "Communications in Computer and Information Science",

publisher = "Springer",

pages = "154--164",

editor = "Narv{\'a}ez, {Fabi{\'a}n R.} and Vallejo, {Diego F.} and Morillo, {Paulina A.} and Proa{\~n}o, {Julio R.}",

booktitle = "Smart Technologies, Systems and Applications - 1st International Conference, SmartTech-IC 2019, Proceedings",

}

Romero, D, Salamea, C, Chica, F & Narvaez, E 2020, Factors that Affect i-Vectors Based Language Identification Systems. in FR Narváez, DF Vallejo, PA Morillo & JR Proaño (eds), Smart Technologies, Systems and Applications - 1st International Conference, SmartTech-IC 2019, Proceedings. Communications in Computer and Information Science, vol. 1154 CCIS, Springer, pp. 154-164, 1st International Conference on Smart Technologies, Systems and Applications, SmartTech-IC 2019, Quito, Ecuador, 2/12/19. https://doi.org/10.1007/978-3-030-46785-2_13

Factors that Affect i-Vectors Based Language Identification Systems. / Romero, David; Salamea, Christian; Chica, Fernando et al.
Smart Technologies, Systems and Applications - 1st International Conference, SmartTech-IC 2019, Proceedings. ed. / Fabián R. Narváez; Diego F. Vallejo; Paulina A. Morillo; Julio R. Proaño. Springer, 2020. p. 154-164 (Communications in Computer and Information Science; Vol. 1154 CCIS).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Factors that Affect i-Vectors Based Language Identification Systems

AU - Romero, David

AU - Salamea, Christian

AU - Chica, Fernando

AU - Narvaez, Erick

N1 - Publisher Copyright: © Springer Nature Switzerland AG 2020.

PY - 2020/1/1

Y1 - 2020/1/1

N2 - The performance of a language identification (LID) system that uses i-vectors as features depends on several parameters, such as algorithm parameters and data parameters. In this study, an analysis of performance of a language identification system is considered, for which we focused only on data parameters in the “Back End” of the system, analyzing the influence of the amount of data and the speaker variability in the training phases of the UBM and the total variability Matrix T. Also, the Multiclass logistic regression (MLR) classifiers were analyzed, by balancing the classes of the database to train the classifiers on each language. These tests have been carried out in the Kalaka-3 database; we have used the average detection cost function (Cavg) to evaluate the performance. It is shown experimentally that in the training phase of the UBM, speaker variability is more important than a large amount of data. In the training phase of the total variability matrix T a better performance was obtained when a larger number of audios were used. And finally, balancing classes on each language to train the MLR classifiers allowed us to get a better performance only in certain languages. Using all of these proposed variations, we got a Cavg improvement of 37% in a standard language identification system.

AB - The performance of a language identification (LID) system that uses i-vectors as features depends on several parameters, such as algorithm parameters and data parameters. In this study, an analysis of performance of a language identification system is considered, for which we focused only on data parameters in the “Back End” of the system, analyzing the influence of the amount of data and the speaker variability in the training phases of the UBM and the total variability Matrix T. Also, the Multiclass logistic regression (MLR) classifiers were analyzed, by balancing the classes of the database to train the classifiers on each language. These tests have been carried out in the Kalaka-3 database; we have used the average detection cost function (Cavg) to evaluate the performance. It is shown experimentally that in the training phase of the UBM, speaker variability is more important than a large amount of data. In the training phase of the total variability matrix T a better performance was obtained when a larger number of audios were used. And finally, balancing classes on each language to train the MLR classifiers allowed us to get a better performance only in certain languages. Using all of these proposed variations, we got a Cavg improvement of 37% in a standard language identification system.

KW - Data

KW - i-Vector

KW - Language identification

UR - http://www.scopus.com/inward/record.url?scp=85084838232&partnerID=8YFLogxK

UR - https://www.mendeley.com/catalogue/5c9c4c40-8f7f-3ff9-a0f0-801d14bf13ed/

U2 - 10.1007/978-3-030-46785-2_13

DO - 10.1007/978-3-030-46785-2_13

M3 - Conference contribution

AN - SCOPUS:85084838232

SN - 9783030467845

T3 - Communications in Computer and Information Science

SP - 154

EP - 164

BT - Smart Technologies, Systems and Applications - 1st International Conference, SmartTech-IC 2019, Proceedings

A2 - Narváez, Fabián R.

A2 - Vallejo, Diego F.

A2 - Morillo, Paulina A.

A2 - Proaño, Julio R.

PB - Springer

T2 - 1st International Conference on Smart Technologies, Systems and Applications, SmartTech-IC 2019

Y2 - 2 December 2019 through 4 December 2019

ER -

Romero D, Salamea C, Chica F, Narvaez E. Factors that Affect i-Vectors Based Language Identification Systems. In Narváez FR, Vallejo DF, Morillo PA, Proaño JR, editors, Smart Technologies, Systems and Applications - 1st International Conference, SmartTech-IC 2019, Proceedings. Springer. 2020. p. 154-164. (Communications in Computer and Information Science). doi: 10.1007/978-3-030-46785-2_13

Factors that Affect i-Vectors Based Language Identification Systems

Abstract

Publication series

Conference

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this