https://github.com/showlab/ShowAnything

image-editing segment-anything video-editing video-generation vision-and-language

Last synced: 6 months ago
JSON representation

Host: GitHub
URL: https://github.com/showlab/ShowAnything
Owner: showlab
Created: 2023-04-11T08:50:35.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2023-08-01T07:08:28.000Z (over 2 years ago)
Last Synced: 2025-04-22T00:38:15.650Z (7 months ago)
Topics: image-editing, segment-anything, video-editing, video-generation, vision-and-language
Language: Jupyter Notebook
Homepage:
Size: 43 MB
Stars: 82
Watchers: 6
Forks: 3
Open Issues: 2
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

awesome-segment-anything-extensions - Repo

README

# ShowAnything
Edit and Generate Anything In Any Image and Video.

**Motivation of this repo**: [Segment Anything](https://github.com/facebookresearch/segment-anything) offers powerful perception capabilities and interfaces with points and boxes. We believe that the interface ability has the potential to greatly drive the development of generation and editing applications. We will be posting some application developments from our lab here that are compatible with both Segment Anything and Generation, thanks.

## ToDo
- [ ] Customization (LoRA or Dreambooth)
- [ ] Video editing with point interaction
- [ ] Hugging Face Demo
- [ ] ...

## Installation
Please follow the [segment anything](https://github.com/facebookresearch/segment-anything#model-checkpoints) to creating the environment and download the model checkpoint (vit_l, vit_l or vit_b)

## Image

[Hugging Face Demo](https://huggingface.co/spaces/weijiawu/ImageEditAnything) is now available, but please note that it may run slowly due to being currently executed on a low-end CPU.

### Edit with point click
The ultimate goal of this task is to achieve control over the generation of complex scenes, such as dense crowds and department stores.

See the [here](./ImageEdit). Using points to edit and control. Enjoy it!

### Object Merge

### Object Remove

## Video

### Zero-shot Editing Anything in Any Video

Replace the person in the video:

Customize the clothes of the person in the video:

More results:

Input Video
Output Video

"Input Video"
"A soldier is dancing"
"Donald Trump is dancing"
"Iron Man is dancing"

Input Video
Output Video

"Input Video"
"A man is dancing in skirt and wearing black stockings"
"A man is dancing in jeans"
"A man is dancing in shorts"

## Transform Image Into Unique Paragraph
Image to paragraph is available, please refer to [Image2Paragraph](https://github.com/showlab/Image2Paragraph), and enjoy!

## Segment Anything for 3D
Image to 3D using SAM is available, please refer to [Anything3D](https://github.com/showlab/Show-Anything-3D), and enjoy!

https://user-images.githubusercontent.com/37300524/232232693-ad3b0329-ab83-48e1-8f7e-dca9dff7b32e.mp4

## Acknowledgements
- [Segment Anything](https://github.com/facebookresearch/segment-anything)
- [Caption-Anything](https://github.com/ttengwang/Caption-Anything) (hugging face demo)
- [Stable Diffusion](https://github.com/CompVis/stable-diffusion)

## Citation
If you find this project helpful for your research, please consider citing the following BibTeX entry.
```BibTex
@article{kirillov2023segany,
title={Segment Anything},
author={Kirillov, Alexander and Mintun, Eric and Ravi, Nikhila and Mao, Hanzi and Rolland, Chloe and Gustafson, Laura and Xiao, Tete and Whitehead, Spencer and Berg, Alexander C. and Lo, Wan-Yen and Doll{\'a}r, Piotr and Girshick, Ross},
journal={arXiv:2304.02643},
year={2023}
}

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/showlab/ShowAnything

Awesome Lists containing this project

README