Ir directamente a la navegación principal Ir directamente a la búsqueda Ir directamente al contenido principal

Hyperparameter Optimization of GPT-2 for Enhanced Text Generation

Producción científica: Capítulo del libro/informe/acta de congresoContribución de conferenciarevisión exhaustiva

Resumen

The rapid advancement of generative language models has sparked a growing interest in balancing creativity and consistency in text generation. While many of the latest models are publicly accessible, their training methods and datasets remain undisclosed. However, older models such as GPT-2 provide full documentation on their training process, making them suitable for investigating how hyperparameter configurations influence output quality. This study evaluates the effects of temperature, top-p, top-k, beam-search and greedy-search. To assess the final outputs, Distinct-N and BERTScore metrics have been used, which measure textual diversity and semantic alignment, respectively. Each parameter was systematically varied, and the resulting texts were analyzed to generate visual representations identifying the configurations that yield coherent and diverse outputs. This research contributes to a better understanding of how hyperparameter tuning can enhance the adaptability and output of the GPT-2 model.

Idioma originalInglés
Título de la publicación alojadaCommunication and Applied Technologies - Proceedings of ICOMTA 2025
EditoresPaulo Carlos López-López, Matthieu Vernier, Úrsula Freundt-Thurne, Daniel Barredo Ibáñez
EditorialSpringer Science and Business Media Deutschland GmbH
Páginas13-22
Número de páginas10
ISBN (versión impresa)9783032099105
DOI
EstadoPublicada - 2026
EventoInternational Conference on Communication and Applied Technologies, ICOMTA 2025 - Valdivia, Chile
Duración: 2 sep. 20254 sep. 2025

Serie de la publicación

NombreSmart Innovation, Systems and Technologies
Volumen458 SIST
ISSN (versión impresa)2190-3018
ISSN (versión digital)2190-3026

Conferencia

ConferenciaInternational Conference on Communication and Applied Technologies, ICOMTA 2025
País/TerritorioChile
CiudadValdivia
Período2/09/254/09/25

Nota bibliográfica

Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.

Huella

Profundice en los temas de investigación de 'Hyperparameter Optimization of GPT-2 for Enhanced Text Generation'. En conjunto forman una huella única.

Citar esto