Abstract
While large language models are constantly evaluated in various skills, such as math, general knowledge, and coding, their ability to understand and follow game rules has not yet been deeply explored. The latter is especially important as it allows testing whether LLMs can operate within predefined limits without deviating or making illogical mistakes. Therefore, this demo paper presents a tool for interacting with LLMs in board games. The tool allows the creation of players with different large language models pitted against each other or to play in human vs. LLM mode. The platform includes rules predefined in prompts for four simple games based on Tic-Tac-Toe and Connect Four. Each player can be evaluated to account for their illegal movements, wins, draws, losses, and response times. The application also allows for the creation of new games, opening up the possibility of examining LLM behavior in situations they have not previously encountered.
| Original language | English |
|---|---|
| Title of host publication | Machine Learning and Knowledge Discovery in Databases. Applied Data Science Track and Demo Track - European Conference, ECML PKDD 2025, Proceedings |
| Editors | Inês Dutra, Alípio M. Jorge, Carlos Soares, João Gama, Mykola Pechenizkiy, Paulo Cortez, Sepideh Pashami, Arian Pasquali, Nuno Moniz, Pedro H. Abreu |
| Publisher | Springer Science and Business Media Deutschland GmbH |
| Pages | 486-490 |
| Number of pages | 5 |
| ISBN (Print) | 9783032061287 |
| DOIs | |
| State | Published - 2026 |
| Event | European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2025 - Porto, Portugal Duration: 15 Sep 2025 → 19 Sep 2025 |
Publication series
| Name | Lecture Notes in Computer Science |
|---|---|
| Volume | 16022 |
| ISSN (Print) | 0302-9743 |
| ISSN (Electronic) | 1611-3349 |
Conference
| Conference | European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2025 |
|---|---|
| Country/Territory | Portugal |
| City | Porto |
| Period | 15/09/25 → 19/09/25 |
Bibliographical note
Publisher Copyright:© The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.
Keywords
- decision-making
- General Game Playing
- LLM evaluation
Fingerprint
Dive into the research topics of 'LLM GameLab: An Interactive Platform for Testing Large Language Models in Board Games'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver