https://github.com/srush/annotated-mamba
Annotated version of the Mamba paper
https://github.com/srush/annotated-mamba
Last synced: about 1 year ago
JSON representation
Annotated version of the Mamba paper
- Host: GitHub
- URL: https://github.com/srush/annotated-mamba
- Owner: srush
- License: mit
- Created: 2023-12-26T16:17:10.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2024-02-27T22:21:02.000Z (over 2 years ago)
- Last Synced: 2025-03-29T07:05:51.895Z (about 1 year ago)
- Language: Jupyter Notebook
- Size: 61.8 MB
- Stars: 478
- Watchers: 22
- Forks: 18
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
[Go to Blog](https://srush.github.io/annotated-mamba/hard.html) [Notebook](https://github.com/srush/annotated-mamba/blob/main/Scan.ipynb) [Colab](https://colab.research.google.com/github/srush/annotated-mamba/blob/main/Scan.ipynb)

Mamba: Linear-Time Sequence Modeling with Selective State Spaces
https://arxiv.org/abs/2312.00752
## Challenge
The triton version is still a lot slower than the mamba custom kernel. If anyone has ideas for speeding it up, I would love to hear them. The file `final.py` has a minimal version with benchmarking.
```python
pip install mamba-ssm
pip install -U http://kermit.bounceme.net:8900/triton-3.0.0-cp310-cp310-linux_x86_64.whl
```