Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists by bigscience-workshop
A curated list of projects in awesome lists by bigscience-workshop .
https://github.com/bigscience-workshop/petals
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
bloom chatbot deep-learning distributed-systems falcon gpt guanaco language-models large-language-models llama machine-learning mixtral neural-networks nlp pipeline-parallelism pretrained-models pytorch tensor-parallelism transformer volunteer-computing
Last synced: 31 Jul 2024
https://github.com/bigscience-workshop/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Last synced: 31 Jul 2024
https://github.com/bigscience-workshop/bigscience
Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.
machine-learning models nlp training
Last synced: 01 Aug 2024
https://github.com/bigscience-workshop/t-zero
Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)
Last synced: 01 Aug 2024
https://github.com/bigscience-workshop/biomedical
Tools for curating biomedical training data for large-scale language modeling
Last synced: 03 Aug 2024
https://github.com/bigscience-workshop/data-preparation
Code used for sourcing and cleaning the BigScience ROOTS corpus
dataset large-language-models multilingual
Last synced: 09 Aug 2024
https://github.com/bigscience-workshop/data_tooling
Tools for managing datasets for governance and training.
Last synced: 31 Jul 2024