Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/hallerpatrick/babylm-2024
https://github.com/hallerpatrick/babylm-2024
Last synced: 28 days ago
JSON representation
- Host: GitHub
- URL: https://github.com/hallerpatrick/babylm-2024
- Owner: HallerPatrick
- Created: 2024-09-13T08:47:58.000Z (4 months ago)
- Default Branch: main
- Last Pushed: 2024-12-02T15:34:30.000Z (about 1 month ago)
- Last Synced: 2024-12-08T18:04:54.773Z (about 1 month ago)
- Size: 6.77 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Submission for the BabyLM 2024 Shared Task
> [!IMPORTANT]
> We are happy to announce that our paper got published at the CoNLL 2024 Shared Task! (Proceedings still in progress)> [!IMPORTANT]
> Training pipeline and further evaluation logic cleanup is still WIP# Benchmark Results
We include links to the predictions, as well as the model links.The prediction files can easily be downloaded with:
```bash
# Strict-Small Track
wget https://github.com/HallerPatrick/BabyLM-2024/blob/main/hgrn2_pile_10m_distill_babylm_textonly_predictions.json.gz# Strict Track
wget https://github.com/HallerPatrick/BabyLM-2024/blob/main/hgrn2_pile_100m_distill_babylm_textonly_predictions.json.gz
```OR (Single URLS for the OpenReivew submission)
```bash
wget https://github.com/HallerPatrick/BabyLM-2024/blob/main/predictions.zip
unzip predictions.zip
```# Model Weights
The models are available at HuggingFace:
* Strict-small Track: https://huggingface.co/PatrickHaller/hgrn2_pile_10m_distill_babylm
* Strict Track: https://huggingface.co/PatrickHaller/hgrn2_pile_100m_distill_babylmTo download and use these models, the [fla](https://github.com/sustcsonglin/flash-linear-attention) package has to be installed first:
```bash
pip install -U git+https://github.com/sustcsonglin/flash-linear-attention
```