https://github.com/imane0x/text-based-image-editing
This project explores the intersection of NLP and CV, showcasing the potential of leveraging three powerful models – SAM, Stable Diffusion, and Grounding DINO – to edit manipulate images through textual commands.
https://github.com/imane0x/text-based-image-editing
docker gradio grounding-dino segment-anything stable-diffusion
Last synced: 4 months ago
JSON representation
This project explores the intersection of NLP and CV, showcasing the potential of leveraging three powerful models – SAM, Stable Diffusion, and Grounding DINO – to edit manipulate images through textual commands.
- Host: GitHub
- URL: https://github.com/imane0x/text-based-image-editing
- Owner: imane0x
- Created: 2023-12-02T18:55:53.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-10-14T17:56:38.000Z (8 months ago)
- Last Synced: 2024-11-28T18:08:25.037Z (6 months ago)
- Topics: docker, gradio, grounding-dino, segment-anything, stable-diffusion
- Language: Python
- Homepage:
- Size: 20 MB
- Stars: 3
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Text-Based Image Editing
This project explores the intersection of Natural Language Processing (NLP) and Computer Vision through advanced text-based image editing techniques. By leveraging three powerful models—Grounding DINO, Segment Anything Model (SAM), and Stable Diffusion—we aim to create an intuitive system that allows users to modify images based on textual input.Grounding DINO: Facilitates object detection and localization within images based on text prompts, allowing for precise object targeting.
Segment Anything Model (SAM): Provides robust segmentation capabilities to isolate specific objects or regions within an image for editing.
Stable Diffusion: Handles image generation and enhancement, enabling creative transformations based on text instructions.
## PrerequisitesBefore you begin, ensure you have the following installed on your system:
- [Docker](https://www.docker.com/)
- [NVIDIA Drivers](https://docs.nvidia.com/datacenter/tesla/tesla-installation-notes/index.html)
- [NVIDIA Container Toolkit](https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html)## Getting Started
### 1. Build the Docker Image
```bash
docker build -t image_editing .
```### 2. Run the Docker Container with GPU Support
```bash
docker run --gpus all -p 7860:7860 image_editing
```
## Demo
- Replacing lamb with wolf.