https://github.com/g-hano/gemini-to-image
A versatile tool that leverages Google's LLM Gemini, along with HuggingFace models, to generate text and images based on user prompts. It utilizes Langchain for text generation and Hugging Face models for image generation. The project consists of a Streamlit GUI interface where users can interact with the generated content.
https://github.com/g-hano/gemini-to-image
ai gemini google llm rag
Last synced: 5 months ago
JSON representation
A versatile tool that leverages Google's LLM Gemini, along with HuggingFace models, to generate text and images based on user prompts. It utilizes Langchain for text generation and Hugging Face models for image generation. The project consists of a Streamlit GUI interface where users can interact with the generated content.
- Host: GitHub
- URL: https://github.com/g-hano/gemini-to-image
- Owner: g-hano
- License: mit
- Created: 2024-03-03T17:54:33.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2024-03-03T19:04:41.000Z (about 1 year ago)
- Last Synced: 2024-11-13T22:32:52.560Z (7 months ago)
- Topics: ai, gemini, google, llm, rag
- Language: Python
- Homepage:
- Size: 23.5 MB
- Stars: 4
- Watchers: 1
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Gemini-to-Image
## Overview
It is a versatile tool that leverages Google's LLM Gemini, along with HuggingFace models, to generate text and images based on user prompts. It utilizes Langchain for text generation and Hugging Face models for image generation.
The project consists of a Streamlit GUI interface where users can interact with the generated content. Users can upload their own images and provide prompts to generate customized images and text outputs.
Check [imgs](https://github.com/g-hano/Gemini-to-Image/tree/main/imgs) folder for usage video.
## Features
- Accepts user prompts via text input.
- Utilizes Google's Gemini via Langchain to generate enhanced prompts based on user input.
- Generates images based on user prompts.
- Allows users to upload their own images and provides prompts to generate customized images and text outputs.## Requirements
- Dependencies listed in `requirements.txt`## Installation
1. Clone the repository:
```
git clone https://github.com/g-hano/Gemini-to-Image.git
```
2. Navigate to the project directory:
```
cd Gemini-to-Image
```
3. Install dependencies:
```
pip install -r requirements.txt
```## Usage
1. Run the Streamlit app:
```
streamlit run app.py
```
2. Access the Streamlit interface in your web browser.## Screenshots
## Acknowledgements
- [Google Gemini](https://gemini.google.com/app)
- Hugging Face Models
- [Text-to-Image](https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0)
- [Image-to-Image](https://huggingface.co/timbrooks/instruct-pix2pix)
- [Streamlit](https://docs.streamlit.io/)## License
MIT