Ir directamente a la navegación principal Ir directamente a la búsqueda Ir directamente al contenido principal

A Proposed Methodology for Semantic Alignment and Specialization of Pre-trained Multilingual Embeddings Using Mixture-of-Experts and Contrastive Learning for Legal Text Retrieval in Ecuador

Producción científica: Capítulo del libro/informe/acta de congresoContribución de conferenciarevisión exhaustiva

Resumen

Legal text retrieval in multilingual contexts, such as those found within Ecuadorian judicial environments, presents significant challenges due to specialized terminology and inherent semantic complexity. This research proposes a methodological framework, currently at an initial doctoral research stage, designed to refine and specialize pre-trained multilingual embeddings specifically for legal text retrieval tasks. By integrating a Mixture-of-Experts (MoE) architecture with contrastive learning techniques applied explicitly at the embedding level, the proposed approach aims to enhance semantic alignment, embedding uniformity, and domain specialization. This integration specifically addresses recognized limitations of multilingual embeddings such as semantic anisotropy and insufficient domain adaptation. Future empirical validations will be conducted using standard retrieval metrics—including Normalized Discounted Cumulative Gain (nDCG@10), Mean Reciprocal Rank (MRR), and Spearman correlation—to rigorously assess anticipated improvements over existing baseline methods.

Idioma originalInglés
Título de la publicación alojadaInformation and Communication Technologies - 13th Ecuadorian Conference, TICEC 2025, Proceedings
EditoresSantiago Berrezueta, Tatiana Gualotuña, Efrain R. Fonseca C., Germania Rodriguez Morales, Jorge Maldonado-Mahauad
EditorialSpringer Science and Business Media Deutschland GmbH
Páginas450-463
Número de páginas14
ISBN (versión impresa)9783032083654
DOI
EstadoPublicada - 2026
Evento13th Ecuadorian Conference on Information and Communication Technologies, TICEC 2025 - Quito, Ecuador
Duración: 16 oct. 202517 oct. 2025

Serie de la publicación

NombreCommunications in Computer and Information Science
Volumen2707 CCIS
ISSN (versión impresa)1865-0929
ISSN (versión digital)1865-0937

Conferencia

Conferencia13th Ecuadorian Conference on Information and Communication Technologies, TICEC 2025
País/TerritorioEcuador
CiudadQuito
Período16/10/2517/10/25

Nota bibliográfica

Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.

Huella

Profundice en los temas de investigación de 'A Proposed Methodology for Semantic Alignment and Specialization of Pre-trained Multilingual Embeddings Using Mixture-of-Experts and Contrastive Learning for Legal Text Retrieval in Ecuador'. En conjunto forman una huella única.

Citar esto