https://github.com/showlab/ShowAnything
https://github.com/showlab/ShowAnything
image-editing segment-anything video-editing video-generation vision-and-language
Last synced: 5 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/showlab/ShowAnything
- Owner: showlab
- Created: 2023-04-11T08:50:35.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2023-08-01T07:08:28.000Z (about 2 years ago)
- Last Synced: 2025-04-22T00:38:15.650Z (6 months ago)
- Topics: image-editing, segment-anything, video-editing, video-generation, vision-and-language
- Language: Jupyter Notebook
- Homepage:
- Size: 43 MB
- Stars: 82
- Watchers: 6
- Forks: 3
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
![]()
# ShowAnything
Edit and Generate Anything In Any Image and Video.**Motivation of this repo**: [Segment Anything](https://github.com/facebookresearch/segment-anything) offers powerful perception capabilities and interfaces with points and boxes. We believe that the interface ability has the potential to greatly drive the development of generation and editing applications. We will be posting some application developments from our lab here that are compatible with both Segment Anything and Generation, thanks.
## ToDo
- [ ] Customization (LoRA or Dreambooth)
- [ ] Video editing with point interaction
- [ ] Hugging Face Demo
- [ ] ...## Installation
Please follow the [segment anything](https://github.com/facebookresearch/segment-anything#model-checkpoints) to creating the environment and download the model checkpoint (vit_l, vit_l or vit_b)## Image
[Hugging Face Demo](https://huggingface.co/spaces/weijiawu/ImageEditAnything) is now available, but please note that it may run slowly due to being currently executed on a low-end CPU.
![]()
### Edit with point click
The ultimate goal of this task is to achieve control over the generation of complex scenes, such as dense crowds and department stores.See the [here](./ImageEdit). Using points to edit and control. Enjoy it!
![]()
### Object Merge
![]()
### Object Remove
![]()
## Video
### Zero-shot Editing Anything in Any Video
Replace the person in the video:
![]()
Customize the clothes of the person in the video:
![]()
![]()
More results:
Input Video
Output Video
![]()
![]()
![]()
"Input Video"
"A soldier is dancing"
"Donald Trump is dancing"
"Iron Man is dancing"Input Video
Output Video
![]()
![]()
![]()
"Input Video"
"A man is dancing in skirt and wearing black stockings"
"A man is dancing in jeans"
"A man is dancing in shorts"## Transform Image Into Unique Paragraph
Image to paragraph is available, please refer to [Image2Paragraph](https://github.com/showlab/Image2Paragraph), and enjoy!
![]()
## Segment Anything for 3D
Image to 3D using SAM is available, please refer to [Anything3D](https://github.com/showlab/Show-Anything-3D), and enjoy!https://user-images.githubusercontent.com/37300524/232232693-ad3b0329-ab83-48e1-8f7e-dca9dff7b32e.mp4
## Acknowledgements
- [Segment Anything](https://github.com/facebookresearch/segment-anything)
- [Caption-Anything](https://github.com/ttengwang/Caption-Anything) (hugging face demo)
- [Stable Diffusion](https://github.com/CompVis/stable-diffusion)## Citation
If you find this project helpful for your research, please consider citing the following BibTeX entry.
```BibTex
@article{kirillov2023segany,
title={Segment Anything},
author={Kirillov, Alexander and Mintun, Eric and Ravi, Nikhila and Mao, Hanzi and Rolland, Chloe and Gustafson, Laura and Xiao, Tete and Whitehead, Spencer and Berg, Alexander C. and Lo, Wan-Yen and Doll{\'a}r, Piotr and Girshick, Ross},
journal={arXiv:2304.02643},
year={2023}
}```