Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome-mixture-of-experts
A collection of AWESOME things about mixture-of-experts
https://github.com/XueFuzhao/awesome-mixture-of-experts
- Jan 2024
- Dec 2023
- Dec 2023
- Aug 2023
- Dec 2021
- Feb 2021
- [4 Sep 2022
- [11 Jan 2021
- [13 Dec 2021
- [NeurIPS2021
- [17 Feb 2022
- [NeurIPS 2022
- [ICML 2023
- [2 Aug 2023
- Aug 2023
- [ICML 2023
- [ICCV 2023
- [EMNLP 2023
- [ACL 2023
- [ICML 2023
- [NeurIPS 2022
- [ACL 2022
- [ICLR 2022
- [AAAI2022
- [NeurIPS2021
- [NeurIPS2021
- [NeurIPS2021
- [ICML2021
- [ICLR2017
- [AI Open
- [Artificial Intelligence Review
- 4 Jun 2024
- 23 May 2024
- [19 Jul 2022
- [6 Jul 2022
- [8 Jun 2022
- [6 Jun 2022
- [5 Jun 2022
- [5 Jun 2022
- [1 Jun 2022
- [28 May 2022
- [24 May 2022
- [24 May 2022
- [12 May 2022
- [26 Apr 2022
- [20 Apr 2022
- [16 Apr 2022
- [15 Apr 2022
- [11 Apr 2022
- [14 Mar 2022
- [2 Mar 2022
- [18 Feb 2022
- [17 Feb 2022
- [17 Feb 2022
- [2 Feb 2022
- [28 Jan 2022
- [26 Jan 2022
- [29 Dec 2021
- [20 Dec 2021
- [13 Dec 2021
- [10 Dec 2021
- [23 Nov 2021
- [23 Nov 2021
- [14 Oct 2021
- [8 Oct 2021
- [7 Oct 2021
- [5 Oct 2021
- [5 Sep 2021
- [31 May 2021
- [7 May 2021
- [11 Jan 2021
- [28 Sept 2020
- [MLSys2022
- [OSDI2022
- [PPoPP2022
- [PPoPP2022
- [ICLR2021
- [29 Nov 2022
- [28 Mar 2022
- [20 Mar 2022
- [14 Jan 2022
- [29 Sep 2021
- [24 Mar 2021
- [02 Feb 2023
- [24 Nov 2022
- [31 May 2022
- [18 May 2022
- [5 May 2022
- [20 Mar 2022
- [15 Apr 2022
- Tutel: An efficient mixture-of-experts implementation for large DNN model training
- Mesh-TensorFlow: Deep Learning for Supercomputers
- FastMoE: A Fast Mixture-of-Expert Training System
- DeepSpeed-MoE: Advancing Mixture-of-Experts Inference and Training to Power Next-Generation AI Scale
Programming Languages
Keywords
mixture-of-experts
5
moe
3
pytorch
2
deep-learning
2
machine-learning
2
nlp
1
transformer
1
billion-parameters
1
compression
1
data-parallelism
1
gpu
1
inference
1
model-parallelism
1
pipeline-parallelism
1
trillion-parameters
1
zero
1
large-language-models
1
model-compression
1
natural-language-processing
1
bert
1
multimodal-large-language-models
1
phi-2
1
qwen
1
stablelm
1
vision-transformer
1
continual-pre-training
1
expert-partition
1
llama
1
llm
1