An open API service indexing awesome lists of open source software.

https://github.com/souravsuvarna/picinteract

An Image Enabled Interrogation. This Streamlit application enables users to inquire about an uploaded image, receiving responses from a conversational AI agent. Powered by the OpenAI GPT-3.5 Turbo model, the agent generates answers based on both the provided image and user input.
https://github.com/souravsuvarna/picinteract

ai artificial-intelligence chatbot machine-learning python streamlit

Last synced: about 2 months ago
JSON representation

An Image Enabled Interrogation. This Streamlit application enables users to inquire about an uploaded image, receiving responses from a conversational AI agent. Powered by the OpenAI GPT-3.5 Turbo model, the agent generates answers based on both the provided image and user input.

Awesome Lists containing this project

README

          

# PicInteract - An Image Enabled Interrogation

This Streamlit application enables users to inquire about an uploaded image, receiving responses from a conversational AI agent. Powered by the OpenAI GPT-3.5 Turbo model, the agent generates answers based on both the provided image and user input.

## Installation

1. Clone the repository:

git clone https://github.com/your-username/PicInteract.git

2. Change to the project directory:

cd PicInteract

3. Install the required dependencies:

pip install -r requirements.txt

4. Obtain an **OpenAI API key**. You can sign up for an API key at [OpenAI](https://platform.openai.com).

5. Replace the placeholder API key in the main.py file with your actual OpenAI API key:

llm = ChatOpenAI(
openai_api_key='YOUR_API_KEY_HERE',
temperature=0,
model_name="gpt-3.5-turbo"
)

6. Run the Streamlit application:

streamlit run main.py

7. Open your web browser and go to http://localhost:8501 to access the application.

## Usage

1. Upload an image by clicking the file upload button.

2. The uploaded image will be displayed.

3. Enter a question about the image in the text input field.

4. The conversational AI agent will generate a response based on the provided question and image.

5. The response will be displayed below the question input.

## Tools

The application utilizes the following custom tools:

- **ImageCaptionTool**: Generates a textual caption for the uploaded image.
- **ObjectDetectionTool**: Performs object detection on the uploaded image and identifies the objects present.

## Screenshots

[![Pic-Interact3.png](https://i.postimg.cc/C5Hx8W5w/Pic-Interact3.png)](https://postimg.cc/Cn50Trb2)

[![Pic-Interact4.png](https://i.postimg.cc/jj5cFTYk/Pic-Interact4.png)](https://postimg.cc/hXkVXHj9)

## Authors

- [@SriPrada Upadya](https://github.com/sriprada346)

## Reference
[![portfolio](https://encrypted-tbn2.gstatic.com/images?q=tbn:ANd9GcQG-eYMnCL7YUDv-ux9wQUGLRjxud3AaFlgsrE_RnfgHq3csIef)](https://github.com/computervisioneng)