Skip to content

AlephBertGimmel - Modern Hebrew pretrained BERT model with a 128K token vocabulary.

License

Notifications You must be signed in to change notification settings

Dicta-Israel-Center-for-Text-Analysis/alephbertgimmel

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

alephbertgimmel

AlephBertGimmel - Modern Hebrew pretrained BERT model with a 128K token vocabulary.

When using AlephBertGimmel, please reference:

Eylon Guetta, Avi Shmidman, Shaltiel Shmidman, Cheyn Shmuel Shmidman, Joshua Guedalia, Moshe Koppel, Dan Bareket, Amit Seker and Reut Tsarfaty, "Large Pre-Trained Models with Extra-Large Vocabularies: A Contrastive Analysis of Hebrew BERT Models and a New One to Outperform Them All", Nov 2022 [http://arxiv.org/abs/2211.15199]

About

AlephBertGimmel - Modern Hebrew pretrained BERT model with a 128K token vocabulary.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages