Chess-Llama

We trained a tiny Llama-based decoder-only transformer model for chess play, consisting of 23M parameters. The model is trained on a 3 million high-quality chess games from the Lichess Elite Database, on a single Nvidia L4 GPU for 18 hours, using the Google Cloud’s Vertex AI platform.

View on Huggingface

Web Version

This model can be run within a browser, thanks to Huggingface transformers.js! You can try it here

Performance

It uses the UCI format for input and output. It has been trained with the token indicating result appended to the beginning of the games, hoping it would improve performance during actual chess play. The model achieves an estimated Elo rating of 1400, and easily outperforms Skill-level 0 Stockfish, but loses to Stockfish set to level higher than 1.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.github/workflows		.github/workflows
public		public
src		src
.gitignore		.gitignore
README.md		README.md
chess-icon.png		chess-icon.png
icon.svg		icon.svg
index.html		index.html
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
tsconfig.json		tsconfig.json
vite.config.js		vite.config.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Chess-Llama

Web Version

Performance

About

Uh oh!

Releases

Packages

Uh oh!

Languages

lazy-guy/chess-llama

Folders and files

Latest commit

History

Repository files navigation

Chess-Llama

Web Version

Performance

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages