Skip to content

Fix layernorm epsilon for smolgen weights. #1914

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 2 commits into from
Sep 7, 2023

Conversation

almaudoh
Copy link
Contributor

@almaudoh almaudoh commented Sep 7, 2023

The value of epsilon in the layer norms for smolgen weights are 1e-3, but were coded as 1e-6 in some backends. This may not have a huge impact on Elo, though, but may be worth testing.

@borg323 borg323 merged commit 208e718 into LeelaChessZero:master Sep 7, 2023
PikaCat-OuO pushed a commit to official-pikafish/px0 that referenced this pull request Sep 12, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants