Resumen
The rapid advancement of generative language models has sparked a growing interest in balancing creativity and consistency in text generation. While many of the latest models are publicly accessible, their training methods and datasets remain undisclosed. However, older models such as GPT-2 provide full documentation on their training process, making them suitable for investigating how hyperparameter configurations influence output quality. This study evaluates the effects of temperature, top-p, top-k, beam-search and greedy-search. To assess the final outputs, Distinct-N and BERTScore metrics have been used, which measure textual diversity and semantic alignment, respectively. Each parameter was systematically varied, and the resulting texts were analyzed to generate visual representations identifying the configurations that yield coherent and diverse outputs. This research contributes to a better understanding of how hyperparameter tuning can enhance the adaptability and output of the GPT-2 model.
| Idioma original | Inglés |
|---|---|
| Título de la publicación alojada | Communication and Applied Technologies - Proceedings of ICOMTA 2025 |
| Editores | Paulo Carlos López-López, Matthieu Vernier, Úrsula Freundt-Thurne, Daniel Barredo Ibáñez |
| Editorial | Springer Science and Business Media Deutschland GmbH |
| Páginas | 13-22 |
| Número de páginas | 10 |
| ISBN (versión impresa) | 9783032099105 |
| DOI | |
| Estado | Publicada - 2026 |
| Evento | International Conference on Communication and Applied Technologies, ICOMTA 2025 - Valdivia, Chile Duración: 2 sep. 2025 → 4 sep. 2025 |
Serie de la publicación
| Nombre | Smart Innovation, Systems and Technologies |
|---|---|
| Volumen | 458 SIST |
| ISSN (versión impresa) | 2190-3018 |
| ISSN (versión digital) | 2190-3026 |
Conferencia
| Conferencia | International Conference on Communication and Applied Technologies, ICOMTA 2025 |
|---|---|
| País/Territorio | Chile |
| Ciudad | Valdivia |
| Período | 2/09/25 → 4/09/25 |
Nota bibliográfica
Publisher Copyright:© The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.
Huella
Profundice en los temas de investigación de 'Hyperparameter Optimization of GPT-2 for Enhanced Text Generation'. En conjunto forman una huella única.Citar esto
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver