An open API service indexing awesome lists of open source software.

https://github.com/showlab/ShowAnything


https://github.com/showlab/ShowAnything

image-editing segment-anything video-editing video-generation vision-and-language

Last synced: 5 months ago
JSON representation

Awesome Lists containing this project

README

          





# ShowAnything
Edit and Generate Anything In Any Image and Video.

**Motivation of this repo**: [Segment Anything](https://github.com/facebookresearch/segment-anything) offers powerful perception capabilities and interfaces with points and boxes. We believe that the interface ability has the potential to greatly drive the development of generation and editing applications. We will be posting some application developments from our lab here that are compatible with both Segment Anything and Generation, thanks.

## ToDo
- [ ] Customization (LoRA or Dreambooth)
- [ ] Video editing with point interaction
- [ ] Hugging Face Demo
- [ ] ...

## Installation
Please follow the [segment anything](https://github.com/facebookresearch/segment-anything#model-checkpoints) to creating the environment and download the model checkpoint (vit_l, vit_l or vit_b)

## Image

[Hugging Face Demo](https://huggingface.co/spaces/weijiawu/ImageEditAnything) is now available, but please note that it may run slowly due to being currently executed on a low-end CPU.





### Edit with point click
The ultimate goal of this task is to achieve control over the generation of complex scenes, such as dense crowds and department stores.

See the [here](./ImageEdit). Using points to edit and control. Enjoy it!





### Object Merge





### Object Remove





## Video

### Zero-shot Editing Anything in Any Video

Replace the person in the video:





Customize the clothes of the person in the video:









More results:

Input Video
Output Video




"Input Video"
"A soldier is dancing"
"Donald Trump is dancing"
"Iron Man is dancing"

Input Video
Output Video




"Input Video"
"A man is dancing in skirt and wearing black stockings"
"A man is dancing in jeans"
"A man is dancing in shorts"

## Transform Image Into Unique Paragraph
Image to paragraph is available, please refer to [Image2Paragraph](https://github.com/showlab/Image2Paragraph), and enjoy!





## Segment Anything for 3D
Image to 3D using SAM is available, please refer to [Anything3D](https://github.com/showlab/Show-Anything-3D), and enjoy!

https://user-images.githubusercontent.com/37300524/232232693-ad3b0329-ab83-48e1-8f7e-dca9dff7b32e.mp4

## Acknowledgements
- [Segment Anything](https://github.com/facebookresearch/segment-anything)
- [Caption-Anything](https://github.com/ttengwang/Caption-Anything) (hugging face demo)
- [Stable Diffusion](https://github.com/CompVis/stable-diffusion)

## Citation
If you find this project helpful for your research, please consider citing the following BibTeX entry.
```BibTex
@article{kirillov2023segany,
title={Segment Anything},
author={Kirillov, Alexander and Mintun, Eric and Ravi, Nikhila and Mao, Hanzi and Rolland, Chloe and Gustafson, Laura and Xiao, Tete and Whitehead, Spencer and Berg, Alexander C. and Lo, Wan-Yen and Doll{\'a}r, Piotr and Girshick, Ross},
journal={arXiv:2304.02643},
year={2023}
}

```