https://github.com/aihpi/sketch2image

An interactive AI tool that transforms simple sketches into detailed images in various artistic styles.
https://github.com/aihpi/sketch2image

ai-image-generation controlnet diffusion-models excalidraw react sketch-to-image stable-diffusion t2i-adapter

Last synced: 4 months ago
JSON representation

An interactive AI tool that transforms simple sketches into detailed images in various artistic styles.

Host: GitHub
URL: https://github.com/aihpi/sketch2image
Owner: aihpi
Created: 2025-04-11T07:21:47.000Z (6 months ago)
Default Branch: main
Last Pushed: 2025-06-06T10:29:32.000Z (4 months ago)
Last Synced: 2025-06-06T11:29:08.974Z (4 months ago)
Topics: ai-image-generation, controlnet, diffusion-models, excalidraw, react, sketch-to-image, stable-diffusion, t2i-adapter
Language: Python
Homepage:
Size: 29.3 MB
Stars: 1
Watchers: 1
Forks: 0
Open Issues: 1
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

Sketch-to-Image Demonstrator

This project is an interactive tool that transforms simple sketches into detailed images using AI models. Users can draw freehand sketches and convert them into realistic or artistic images in various styles.

![Sketch-to-Image Example](./docs/images/sketch2image_example.png)

## Features

- **Interactive Drawing**: Built with Excalidraw for a natural drawing experience
- **Model Selection**: Choose between two different AI models with different capabilities
- **Multiple Styles**: Choose from multiple visual styles for your generated images (Photorealistic, Anime, Oil Painting, Watercolor, and Detailed Sketch)
- **Responsive Design**: Works on tablets and desktop devices
- **GPU Acceleration**: Utilizes NVIDIA GPUs when available for faster image generation

## Setup and Installation

### Prerequisites

- Docker and Docker Compose
- NVIDIA GPU with CUDA support (optional, but recommended for faster performance)

### Quick Start

1. Clone the repository:
```bash
git clone https://github.com/aihpi/sketch2image.git
cd sketch2image
```

2. Run the setup script:
```bash
chmod +x setup.sh
./setup.sh
```

3. Access the application:
- Frontend: http://localhost:3000
- Backend API: http://localhost:8000/api

## User Guide

### Using the Sketch-to-Image Tool

1. **Draw your sketch** on the Excalidraw canvas
- Keep lines clear and distinct for best results
- Simple sketches work better than highly detailed ones

2. **Select an AI model**:
- **SD 1.5 + ControlNet Scribble**: Faster generation (5-15s on GPU)
- **SDXL + T2I-Adapter Sketch**: Higher quality but slower (10-30s on GPU)

3. **Choose an output style**:
- **Photorealistic**: Realistic images with photographic details
- **Anime**: Cartoon/anime style with simplified shapes and bold colors
- **Oil Painting**: Artistic oil painting look with rich textures
- **Watercolor**: Soft watercolor art style with gentle color blending
- **Detailed Sketch**: Enhanced detailed sketch with improved linework and shading

4. **Add a description**
- Describe what you're drawing for better results
- Example: "a cat sitting on a windowsill"
- Include key details you want emphasized

5. **Click "Generate Image"**
- Wait for the AI to process your sketch (5-30 seconds)
- The generated image will appear on the right side

4. **Managing Results**:
- Download your image using the download button
- Use the "Reset All" button to start over with a new sketch

### Recommended Workflow

For best results:
1. Start with a simple sketch with clear outlines
2. Try both models to see which best captures your vision
3. Experiment with different styles
4. Use specific descriptions that emphasize important elements
5. For complex subjects, break down into simpler components

## Example Results

The following examples showcase how the system transforms simple sketches into various styles:

![Car Sketch Results](./docs/images/car_sketch_comparison_grid.png)
![Cat Sketch Results](./docs/images/cat_sketch_comparison_grid.png)
![House Sketch Results](./docs/images/house_sketch_comparison_grid.png)
![Landscape Sketch Results](./docs/images/landscape_sketch_comparison_grid.png)
![Portrait Sketch Results](./docs/images/portrait_sketch_comparison_grid.png)

## Limitations

- **Sketch Clarity**: The system works best with clear, simple line drawings; complex or ambiguous sketches may produce unexpected results.
- **Generation Time**: Processing time increases with sketch complexity and varies by hardware.
- **Style Consistency**: Some styles work better with certain subjects than others. For example, the "anime" style may not always produce consistent anime-style artwork for all sketches.
- **Unusual Subjects**: The models may struggle with abstract or highly unusual sketches that don't resemble common objects.
- **Resolution**: Output images are fixed at 512×512 pixels.
- **Model Limitations**:
- Both models occasionally ignore certain elements in very complex sketches.
- Both models sometimes misinterpret the scale or perspective of sketched objects.

## Advanced Configuration

You can modify the application settings by editing the `.env` file or the `docker-compose.yml` file:

- `MODEL_ID`: The default model to use
- `NUM_INFERENCE_STEPS`: Number of diffusion steps
- `GUIDANCE_SCALE`: Controls how closely the output follows the prompt
- `OUTPUT_IMAGE_SIZE`: Size of the generated image
- `DEVICE`: Set to "cuda" for GPU or "cpu" for CPU processing

## Troubleshooting

- **Slow Generation**: Try using the ControlNet Scribble model instead of T2I-Adapter
- **Poor Results**: Simplify your sketch and provide a clear description
- **Container Errors**: Check Docker logs with `docker-compose logs`
- **GPU Not Detected**: Ensure NVIDIA drivers and Docker GPU support are correctly installed

## Acknowledgements

- [Excalidraw](https://excalidraw.com/) for the drawing interface
- [Hugging Face](https://huggingface.co/) for hosting the pre-trained models
- [ControlNet](https://github.com/lllyasviel/ControlNet) & [T2I-Adapter](https://github.com/TencentARC/T2I-Adapter) for the sketch-to-image technology

---
[KI-Servicezentrum Berlin-Brandenburg](http://hpi.de/kisz) is funded by the Federal Ministry of Education and Research under the funding code 01IS22092.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/aihpi/sketch2image

Awesome Lists containing this project

README

Sketch-to-Image Demonstrator