An open API service indexing awesome lists of open source software.

https://github.com/srush/annotated-mamba

Annotated version of the Mamba paper
https://github.com/srush/annotated-mamba

Last synced: about 1 year ago
JSON representation

Annotated version of the Mamba paper

Awesome Lists containing this project

README

          

[Go to Blog](https://srush.github.io/annotated-mamba/hard.html) [Notebook](https://github.com/srush/annotated-mamba/blob/main/Scan.ipynb) [Colab](https://colab.research.google.com/github/srush/annotated-mamba/blob/main/Scan.ipynb)

![image](https://github.com/srush/annotated-mamba/assets/35882/d21e62eb-cc3b-4fa2-9e7e-88bbf8d08325)

Mamba: Linear-Time Sequence Modeling with Selective State Spaces
https://arxiv.org/abs/2312.00752

## Challenge

The triton version is still a lot slower than the mamba custom kernel. If anyone has ideas for speeding it up, I would love to hear them. The file `final.py` has a minimal version with benchmarking.

```python
pip install mamba-ssm
pip install -U http://kermit.bounceme.net:8900/triton-3.0.0-cp310-cp310-linux_x86_64.whl
```