An open API service indexing awesome lists of open source software.

https://github.com/ilyalasy/moe-routing

Analysis of token routing for different implementations of Mixture of Experts
https://github.com/ilyalasy/moe-routing

artificial-intelligence deep-learning interpretable-deep-learning mixture-of-experts

Last synced: 10 months ago
JSON representation

Analysis of token routing for different implementations of Mixture of Experts

Awesome Lists containing this project

README

          

# Token Routing Analysis of Mixture of Experts LLMs

## Install
```
pip install -r requirements.txt
cd ..
git clone https://github.com/hpcaitech/ColossalAI
pip install -U ./ColossalAI
cd ColossalAI/examples/language/openmoe
pip install -r requirements.txt
```
## Run OpenMoe Inference on RedPajama

```
./scripts/token-routing.sh
```

## Analyse token routing data

See [EDA notebook](https://github.com/Misterion777/moe-experiments/blob/main/notebooks/routing_eda.ipynb)

## TODO
- [x] Support Mixtral
- [x] Support DeepSeek