Skip to main navigation Skip to search Skip to main content

A Proposed Methodology for Semantic Alignment and Specialization of Pre-trained Multilingual Embeddings Using Mixture-of-Experts and Contrastive Learning for Legal Text Retrieval in Ecuador

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Legal text retrieval in multilingual contexts, such as those found within Ecuadorian judicial environments, presents significant challenges due to specialized terminology and inherent semantic complexity. This research proposes a methodological framework, currently at an initial doctoral research stage, designed to refine and specialize pre-trained multilingual embeddings specifically for legal text retrieval tasks. By integrating a Mixture-of-Experts (MoE) architecture with contrastive learning techniques applied explicitly at the embedding level, the proposed approach aims to enhance semantic alignment, embedding uniformity, and domain specialization. This integration specifically addresses recognized limitations of multilingual embeddings such as semantic anisotropy and insufficient domain adaptation. Future empirical validations will be conducted using standard retrieval metrics—including Normalized Discounted Cumulative Gain (nDCG@10), Mean Reciprocal Rank (MRR), and Spearman correlation—to rigorously assess anticipated improvements over existing baseline methods.

Original languageEnglish
Title of host publicationInformation and Communication Technologies - 13th Ecuadorian Conference, TICEC 2025, Proceedings
EditorsSantiago Berrezueta, Tatiana Gualotuña, Efrain R. Fonseca C., Germania Rodriguez Morales, Jorge Maldonado-Mahauad
PublisherSpringer Science and Business Media Deutschland GmbH
Pages450-463
Number of pages14
ISBN (Print)9783032083654
DOIs
StatePublished - 2026
Event13th Ecuadorian Conference on Information and Communication Technologies, TICEC 2025 - Quito, Ecuador
Duration: 16 Oct 202517 Oct 2025

Publication series

NameCommunications in Computer and Information Science
Volume2707 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference13th Ecuadorian Conference on Information and Communication Technologies, TICEC 2025
Country/TerritoryEcuador
CityQuito
Period16/10/2517/10/25

Bibliographical note

Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.

Fingerprint

Dive into the research topics of 'A Proposed Methodology for Semantic Alignment and Specialization of Pre-trained Multilingual Embeddings Using Mixture-of-Experts and Contrastive Learning for Legal Text Retrieval in Ecuador'. Together they form a unique fingerprint.

Cite this