Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome-huge-models
A collection of AWESOME things about HUGE AI models.
https://github.com/zhengzangw/awesome-huge-models
Last synced: 6 days ago
JSON representation
-
Survey
- Awesome-LLM
- Open-LLM
- LLMDataHub
- On the Opportunities and Risk of Foundation Models
- Pre-Trained Models: Past, Present and Future
- Open LLM Leaderboard
- A Survey of Large Language Models
- A Dive into Vision-Language Models
- Compute Trends Across Three Eras of Machine Learning - training-computation)] [2022.02]
- Vision-and-Language Pretrained Models: A Survey
- A Roadmap to Big Model
- A Survey of Vision-Language Pre-trained Models
- Transformers in Vision: A Survey
- Open LLM Leaderboard
-
Models
-
Language Model
- [Together - INCITE-Base-3B-v1)
- [OpenAI
- [H2O.ai
- h2oGPT: Democratizing Large Language Models
- [Anthropic
- [Preprint
- [Preprint
- [Preprint
- [ICLR'23
- [Preprint
- [Meta
- [Preprint
- [Preprint
- [EleutherAI - neox)
- [Preprint
- [Preprint
- [Preprint
- [Preprint
- [Preprint
- [Preprint
- [Preprint
- [Preprint
- [inspur
- [Preprint
- [Microsoft, Nvidia
- [Preprint
- [Baidu
- [Preprint
- [EleutherAI - transformer-jax)
- [AI21 Labs
- [Preprint
- [Preprint
- [Baidu
- [Preprint
- [Preprint
- [Naver
- [Preprint
- [TACL'22
- [Preprint
- [Preprint
- [Preprint
- [Preprint
- [Preprint
- [NeurIPS'20
- [Meta - 90M?text=Hey+my+name+is+Thomas%21+How+are+you%3F)
- [Preprint
- [Microsoft
- [Preprint
- [Microsoft
- [ACL'20
- [JMLR'19
- [Meta
- [Preprint
- [NeurIPS'19
- [Preprint
- [NAACL'18
- [Preprint
- [OpenAI
- [OpenAI - 2)
- [OpenAI
- [BAAI - 130B)
- h2oGPT: Democratizing Large Language Models
- [Preprint
- [Preprint
- [OpenAI
- [BAAI - Model/src/branch/master)
- [Google - research/text-to-text-transfer-transformer)
- [OpenAI
- [Stability-AI - AI/StableLM#stablelm-alpha)
- [Anthropic
- [Preprint
- [Preprint
- [ICLR'23
-
Vision Models
- [Shanghai AI Lab
- [Preprint
- [Preprint
- Scaling Vision Transformers to 22 Billion Parameters
- [Baidu
- [Preprint
- [CVPR'23 Highlight
- [Preprint
- [Preprint
- [Preprint
- [PPoPP'22
- [Preprint
- [Preprint
- [CVPR'22
- [CASIA
- [Preprint
- [NeurIPS'21
- [NeurIPS'21
- [BAAI, Alibaba
- [NeurIPS'21
- [Alibaba
- [Preprint
- [ICML'21
- [ICML'22
- [ICLR'20
- [OpenAI - gpt)
- [ICML'20
- [ICLR'19
- [OpenAI
- [OpenAI - gpt)
- [OpenAI
- [OpenAI
- [Google - pytorch)
- [Google - research/vision_transformer)
- [OpenAI - gpt)
-
Speech
-
Science
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [DeepMind
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
- [Nature
-
Reinforcement Learning
-
-
Open LLM Training Dataset
-
Science
- MNBVC - going, target 40TB), Chinese, MIT License
- SlimPajama
- RefinedWeb - By 1.0 license (The 5T tokens version is private)
- The Pile
- RedPajama
-
-
Distributed Training Framework
-
PyTorch Ecosystem
-
Other Frameworks
-
Inference Frameworks
-
XLA Ecosystem
-
Recommendation Training Framework
-
-
Keys Explanations
-
Recommendation Training Framework
-
Categories
Sub Categories
Keywords
deep-learning
2
llm
2
machine-learning
2
dht
1
asyncio
1
asynchronous-programming
1
nlp-machine-learning
1
nlp
1
corpus-data
1
chinese-simplified
1
chinese-nlp
1
chinese-language
1
chinese
1
semantic-segmentation
1
object-detection
1
foundation-model
1
deformable-convolution
1
backbone
1
dataset
1
chatgpt
1
chatbot
1
llms
1
large-language-models
1
video-processing
1
stream-processing
1
pipeline-framework
1
perception
1
mobile-development
1
mediapipe
1
inference
1
graph-framework
1
graph-based
1
framework
1
computer-vision
1
calculator
1
c-plus-plus
1
audio-processing
1
android
1
volunteer-computing
1
pytorch
1
neural-networks
1
mixture-of-experts
1
hivemind
1
distributed-training
1
distributed-systems
1
commercial
1