https://github.com/ChiShengChen/ResVMamba
The official repository implement of Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with Deep Residual Learning
https://github.com/ChiShengChen/ResVMamba
deep-learning food-classification mamba
Last synced: about 2 months ago
JSON representation
The official repository implement of Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with Deep Residual Learning
- Host: GitHub
- URL: https://github.com/ChiShengChen/ResVMamba
- Owner: ChiShengChen
- Created: 2024-02-20T16:13:57.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2024-11-27T15:23:34.000Z (6 months ago)
- Last Synced: 2024-11-27T16:31:41.053Z (6 months ago)
- Topics: deep-learning, food-classification, mamba
- Language: Python
- Homepage:
- Size: 983 KB
- Stars: 56
- Watchers: 2
- Forks: 4
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- Awesome-state-space-models - Vision
README
# ResVMamba
[](https://arxiv.org/abs/2402.15761)
[](https://paperswithcode.com/sota/fine-grained-image-recognition-on-cnfood-241?p=res-vmamba-fine-grained-food-category-visual)
[]([https://huggingface.co/ms57rd/Res-VMamba])  The official repository of [Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with Deep Residual Learning](https://arxiv.org/abs/2402.15761) , the most part of code is modified from [VMamba](https://github.com/MzeroMiko/VMamba) .
![]()
## Get started
Please follw the installation flow on [VMamba](https://github.com/MzeroMiko/VMamba).### Pretrained-weight
The Res-VMamba model best weight with VMamba-S as backbone trained on CNFOOD-241-Chen (CNFOOD-241 dataset with the random split in the paper) can be available on the [HuggingFace](https://huggingface.co/ms57rd/Res-VMamba/tree/main) .
The downloaded weight need to put under the folder path:
`./ResVMamba/pretrained_model/vssm_small/default/ckpt_epoch_166.pth`### Run Command
For has only 1 GPU card:
```
python3 -m torch.distributed.launch --nnodes=1 --node_rank=0 --nproc_per_node=1 --master_addr="127.0.0.1" --master_port=29501 main.py --cfg configs/vssm/vssm_small_224.yaml --batch-size 16 --data-path /food_data/CNFOOD-241 --output ./ResVMamba/pretrained_model
```## CNFOOD-241-Chen dataset
![]()
The image list can be found in `CNFOOD241_data_split` folder.
## Training Result on paper
## Star History
[](https://star-history.com/#ChiShengChen/ResVMamba&Date)
## Reference
The original CNFOOD-241 data: https://data.mendeley.com/datasets/fspyss5zbb/1## Citation
Hope this code is helpful. I would appreciate you citing us in your paper. 😊
```
@article{chen2024res,
title={Res-vmamba: Fine-grained food category visual classification using selective state space models with deep residual learning},
author={Chen, Chi-Sheng and Chen, Guan-Ying and Zhou, Dong and Jiang, Di and Chen, Dai-Shi},
journal={arXiv preprint arXiv:2402.15761},
year={2024}
}```