https://github.com/kyegomez/simplemamba

Implementation of a modular, high-performance, and simplistic mamba for high-speed applications
https://github.com/kyegomez/simplemamba

artificial-intelligence gpt gpt4 gpt4all machine neural-network open-source pytorch rnn rnn-pytorch rnns ssm tensorflow transformers

Last synced: 7 months ago
JSON representation

Implementation of a modular, high-performance, and simplistic mamba for high-speed applications

Host: GitHub
URL: https://github.com/kyegomez/simplemamba
Owner: kyegomez
License: mit
Created: 2023-12-15T09:59:12.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2024-11-11T11:22:50.000Z (8 months ago)
Last Synced: 2024-12-19T17:08:30.373Z (7 months ago)
Topics: artificial-intelligence, gpt, gpt4, gpt4all, machine, neural-network, open-source, pytorch, rnn, rnn-pytorch, rnns, ssm, tensorflow, transformers
Language: Python
Homepage: https://discord.gg/7VckQVxvKk
Size: 2.16 MB
Stars: 33
Watchers: 4
Forks: 0
Open Issues: 1
Metadata Files:
- Readme: README.md
- Funding: .github/FUNDING.yml
- License: LICENSE

Awesome Lists containing this project

README

        [![Multi-Modality](agorabanner.png)](https://discord.gg/qUtxnK2NMf)

# Simple Mamba

## Install

`pip install simple-mamba`

## Usage

```python

import torch

from simple_mamba import MambaBlock

# Define block parameters

dim = 512

hidden_dim = 128

heads = 8

in_channels = 3

out_channels = 3

kernel_size = 3

# Create an instance of MambaBlock

mamba_block = MambaBlock(

    dim, hidden_dim, heads, in_channels, out_channels, kernel_size

)

# Create a sample input tensor

x = torch.randn(1, dim, dim)

# Pass the tensor through the MambaBlock

output = mamba_block(x)

print("Output shape:", output.shape)

```

### `SSM`

```python

import torch 

from simple_mamba import SSM

# # Example usage

vocab_size = 10000  # Example vocabulary size

embed_dim = 256  # Example embedding dimension

state_dim = 512  # State dimension

num_layers = 2  # Number of state-space layers

model = SSM(vocab_size, embed_dim, state_dim, num_layers)

# Example input (sequence of word indices)

input_seq = torch.randint(

     0, vocab_size, (32, 10)

 )  # Batch size of 32, sequence length of 10

 # Forward pass

logits = model(input_seq)

print(logits.shape)  # Should be [32, 10, vocab_size]

```

# License

MIT

# Citation

```bibtex

@misc{gu2023mamba,

    title={Mamba: Linear-Time Sequence Modeling with Selective State Spaces}, 

    author={Albert Gu and Tri Dao},

    year={2023},

    eprint={2312.00752},

    archivePrefix={arXiv},

    primaryClass={cs.LG}

}

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/kyegomez/simplemamba

Awesome Lists containing this project

README