https://github.com/jamesjg/FoodSAM

FoodSAM: Any Food Segmentation
https://github.com/jamesjg/FoodSAM

Last synced: 7 months ago
JSON representation

FoodSAM: Any Food Segmentation

Host: GitHub
URL: https://github.com/jamesjg/FoodSAM
Owner: jamesjg
License: apache-2.0
Created: 2023-08-10T11:35:36.000Z (almost 2 years ago)
Default Branch: main
Last Pushed: 2023-11-06T05:38:08.000Z (over 1 year ago)
Last Synced: 2023-11-06T06:32:11.668Z (over 1 year ago)
Language: Python
Homepage:
Size: 13.7 MB
Stars: 103
Watchers: 4
Forks: 4
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

Awesome-Segment-Anything - [code

README

        # FoodSAM: Any Food Segmentation

This is the official PyTorch implementation of our paper:

FoodSAM: Any Food Segmentation.

[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/foodsam-any-food-segmentation/semantic-segmentation-on-foodseg103)](https://paperswithcode.com/sota/semantic-segmentation-on-foodseg103?p=foodsam-any-food-segmentation)

---

Segment anything Model(SAM) demonstrates significant performance on various segmentation benchmarks, showcasing its impressing zero-shot transfer capabilities on 23 diverse segmentation datasets. However, SAM lacks the class-specific information for each mask. To address the above limitation and explore the zero-shot capability of the SAM for food image segmentation, we propose a novel framework, called FoodSAM. This innovative approach integrates the coarse semantic mask with SAM-generated masks to enhance semantic

segmentation quality. Besides, it can perform instance segmentation on food images. Furthermore, FoodSAM extends its zero-shot capability to encompass panoptic segmentation by incorporating an object detector, which renders FoodSAM to effectively capture non-food object information. Remarkably, this pioneering framework stands as the first-ever work to achieve instance, panoptic, and promptable segmentation on food images. 

[[`Arxiv`](https://arxiv.org/abs/2308.05938)] 

[[`Project`]](https://starhiking.github.io/FoodSAM_Page/)

[[`IEEE TMM`]](https://ieeexplore.ieee.org/document/10306316)

![FoodSAM architecture](assets/foodsam.jpg)

FoodSAM contains three basic models: SAM, semantic segmenter, and object detector. SAM generates many class-agnostic binary masks, the semantic segmenter provides food category labels via mask-category match, and the object detector provides the non-food class for background masks. It then enhances the semantic mask via merge strategy and produces instance and panoptic results. Moreover, a seamless prompt-prior selection is integrated into the object detector to achieve promptable segmentation.

  

## Installation

Please follow our [installation.md](installation.md) to install.

## Getting Started

### Demo shell

You can run the model for semantic and panoptic segmentation in a few command lines.

#### semantic segmentation:

    # semantic segmentation for one img

    python FoodSAM/semantic.py --img_path  --output  

    # semantic segmentation for one folder

    python FoodSAM/semantic.py --data_root  --output 

#### panoptic segmentation:

    # panoptic segmentation for one img

    python FoodSAM/panoptic.py --img_path  --output 

    # panoptic segmentation for one folder

    python FoodSAM/panoptic.py --data_root  --output 

### Evaluation shell

Furthermore, by setting `args.eval` to true, the model can output the semantic masks and evaluate the metrics. 

Here are examples of semantic segmentation and panoptic segmentation on the FoodSeg103 dataset:

```

python FoodSAM/semantic.py --data_root dataset/FoodSeg103/Images --output Output/Semantic_Results --eval 

```

```

python FoodSAM/panoptic.py --data_root dataset/FoodSeg103/Images --output Output/Panoptic_Results

```

## Quantitative results

### FoodSeg103

| Method | mIou | aAcc | mAcc 

| :-: | :- | -: | :-: |  

|[SETR_MLA(baseline)](https://github.com/LARC-CMU-SMU/FoodSeg103-Benchmark-v1) | 45.10 | 83.53 | 57.44

FoodSAM | 46.42 | 84.10 |  58.27

### UECFOODPIXCOMPLETE

| Method | mIou | aAcc | mAcc 

| :-: | :- | -: | :-: |  

|[deeplabV3+ (baseline)](https://github.com/HitBadTrap/Foodseg-uecfoodpix)| 65.61 |88.20| 77.56

FoodSAM | 66.14 |88.47 |78.01

## Qualitative results

### cross domain results

 

### semantic segmentation results 

 

 

---

 

 

### instance segmentation results



### panoptic segmentation results



### promptable segmentation results



## Acknowledgements

A large part of the code is borrowed from the following wonderful works:

1. [Segmentation Anything](https://github.com/facebookresearch/segment-anything)

2. [UniDet](https://github.com/xingyizhou/UniDet)

3. [FoodSeg103](https://github.com/LARC-CMU-SMU/FoodSeg103-Benchmark-v1)

4. [mmsegmentation](https://github.com/open-mmlab/mmsegmentation)

## License

The model is licensed under the [Apache 2.0 license](LICENSE).

## Citation

If you want to cite our work, please use this:

```

@ARTICLE{10306316,

  author={Lan, Xing and Lyu, Jiayi and Jiang, Hanyu and Dong, Kun and Niu, Zehai and Zhang, Yi and Xue, Jian},

  journal={IEEE Transactions on Multimedia}, 

  title={FoodSAM: Any Food Segmentation}, 

  year={2023},

  volume={},

  number={},

  pages={1-14},

  doi={10.1109/TMM.2023.3330047}

}

@misc{lan2023foodsam,

      title={FoodSAM: Any Food Segmentation}, 

      author={Xing Lan and Jiayi Lyu and Hanyu Jiang and Kun Dong and Zehai Niu and Yi Zhang and Jian Xue},

      year={2023},

      eprint={2308.05938},

      archivePrefix={arXiv},

      primaryClass={cs.CV}

}

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/jamesjg/FoodSAM

Awesome Lists containing this project

README