https://github.com/ishitaagl20/genaivision
Exploring Image Processing with Gemini - Image Describer, Object Finder, and Object Tracker
https://github.com/ishitaagl20/genaivision
gemini object-detection object-tracking opencv streamlit
Last synced: about 2 months ago
JSON representation
Exploring Image Processing with Gemini - Image Describer, Object Finder, and Object Tracker
- Host: GitHub
- URL: https://github.com/ishitaagl20/genaivision
- Owner: ishitaagl20
- Created: 2024-11-17T17:29:26.000Z (over 1 year ago)
- Default Branch: master
- Last Pushed: 2024-11-17T18:13:43.000Z (over 1 year ago)
- Last Synced: 2025-01-21T05:28:15.759Z (over 1 year ago)
- Topics: gemini, object-detection, object-tracking, opencv, streamlit
- Language: Python
- Homepage:
- Size: 10.7 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Image Describer, Object Finder, and Object Tracker
This project consists of three modules that utilize the **Gemini-1.5-Flash** model to analyze and interact with images and videos:
1. **Image Description**: Describes the contents of an uploaded image.
2. **Object Finder**: Finds the location of a specific object in an image.
3. **Object Tracker**: Tracks an object in a video and timestamps when the object appears.
To begin, clone the repository to your local machine:
```bash
git clone https://github.com/ishitaagl20/GenAIVision.git
```
Before running the app, make sure you have the required Python libraries:
```bash
pip install streamlit pillow google-generativeai