Resumen
This study evaluates the effectiveness of the Continuous Bag-of-Words (CBOW) and Skip-gram models in capturing semantic relationships within Ecuadorian legal texts. Utilizing a comprehensive corpus that includes the Ecuadorian Constitution, the Comprehensive Organic Criminal Code (COIP), and the General Organic Code of Processes (COGEP), among other national laws, we analyze the models’ ability to represent the complex semantics of legal language. The CBOW model predicts target words based on their surrounding context, while Skip-gram predicts the context from a given target word, making them suitable for identifying intricate patterns in legal documents. A rigorous preprocessing phase was applied to the legal texts, including normalization, stopword removal, and lemmatization, ensuring high-quality input data for training. The models were then evaluated using semantic similarity (Spearman’s correlation) and topic coherence metrics. Results indicate that while both models show potential in capturing semantic relationships, CBOW demonstrated a marginally higher performance with a Spearman correlation of 0.24 and a topic coherence score of 0.6637, compared to Skip-gram’s 0.19 and 0.6573, respectively. Despite these findings, neither model fully captured the complexities inherent in legal language, suggesting a need for further refinement in NLP techniques for legal texts. These findings provide a foundation for improving semantic search and information retrieval systems tailored to the legal domain, offering tools to assist legal professionals in analyzing and understanding complex legal texts.
| Idioma original | Inglés |
|---|---|
| Título de la publicación alojada | Smart Technologies, Systems and Applications - 4th International Conference, SmartTech-IC 2024, Revised Selected Papers |
| Editores | Fabián R. Narváez, Micaela N. Villa, Gloria M. Díaz |
| Editorial | Springer Science and Business Media Deutschland GmbH |
| Páginas | 206-216 |
| Número de páginas | 11 |
| ISBN (versión impresa) | 9783031982866 |
| DOI | |
| Estado | Publicada - 2026 |
| Evento | 4th International Conference on Smart Technologies, Systems and Applications, SmartTech-IC 2024 - Quito, Ecuador Duración: 2 dic. 2024 → 4 dic. 2024 |
Serie de la publicación
| Nombre | Communications in Computer and Information Science |
|---|---|
| Volumen | 2392 CCIS |
| ISSN (versión impresa) | 1865-0929 |
| ISSN (versión digital) | 1865-0937 |
Conferencia
| Conferencia | 4th International Conference on Smart Technologies, Systems and Applications, SmartTech-IC 2024 |
|---|---|
| País/Territorio | Ecuador |
| Ciudad | Quito |
| Período | 2/12/24 → 4/12/24 |
Nota bibliográfica
Publisher Copyright:© The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.
ODS de las Naciones Unidas
Este resultado contribuye a los siguientes Objetivos de Desarrollo Sostenible
-
ODS 16: Paz, justicia e instituciones sólidas
Proyectos
- 1 Activo
-
Diseño y Desarrollo de un Traductor de Voz a Lengua de Señas Ecuatoriana para Mejorar la Inclusión de Sordos en el Ecuador
Salamea Palacios, C. R. (Investigador principal) & Viñanzaca Figueroa, F. J. (Estudiante Investigador)
26/09/24 → …
Proyecto: Investigación y Desarrollo
Citar esto
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver