https://github.com/g-hano/gemini-to-image

A versatile tool that leverages Google's LLM Gemini, along with HuggingFace models, to generate text and images based on user prompts. It utilizes Langchain for text generation and Hugging Face models for image generation. The project consists of a Streamlit GUI interface where users can interact with the generated content.
https://github.com/g-hano/gemini-to-image

ai gemini google llm rag

Last synced: 5 months ago
JSON representation

Host: GitHub
URL: https://github.com/g-hano/gemini-to-image
Owner: g-hano
License: mit
Created: 2024-03-03T17:54:33.000Z (about 1 year ago)
Default Branch: main
Last Pushed: 2024-03-03T19:04:41.000Z (about 1 year ago)
Last Synced: 2024-11-13T22:32:52.560Z (7 months ago)
Topics: ai, gemini, google, llm, rag
Language: Python
Homepage:
Size: 23.5 MB
Stars: 4
Watchers: 1
Forks: 1
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Gemini-to-Image
## Overview
It is a versatile tool that leverages Google's LLM Gemini, along with HuggingFace models, to generate text and images based on user prompts. It utilizes Langchain for text generation and Hugging Face models for image generation.
The project consists of a Streamlit GUI interface where users can interact with the generated content. Users can upload their own images and provide prompts to generate customized images and text outputs.
Check [imgs](https://github.com/g-hano/Gemini-to-Image/tree/main/imgs) folder for usage video.

![img2img](imgs/Img2Img.png)

## Features
- Accepts user prompts via text input.
- Utilizes Google's Gemini via Langchain to generate enhanced prompts based on user input.
- Generates images based on user prompts.
- Allows users to upload their own images and provides prompts to generate customized images and text outputs.

## Requirements
- Dependencies listed in `requirements.txt`

## Installation
1. Clone the repository:
```
git clone https://github.com/g-hano/Gemini-to-Image.git
```
2. Navigate to the project directory:
```
cd Gemini-to-Image
```
3. Install dependencies:
```
pip install -r requirements.txt
```

## Usage
1. Run the Streamlit app:
```
streamlit run app.py
```
2. Access the Streamlit interface in your web browser.

## Screenshots
![GUI](imgs/GUI.png)

## Acknowledgements
- [Google Gemini](https://gemini.google.com/app)
- Hugging Face Models
- [Text-to-Image](https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0)
- [Image-to-Image](https://huggingface.co/timbrooks/instruct-pix2pix)
- [Streamlit](https://docs.streamlit.io/)

## License
MIT

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/g-hano/gemini-to-image

Awesome Lists containing this project

README