Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/smaranjitghose/objectsightai
A powerful and intuitive image analysis interface powered by Google's Gemini Flash 2, built with Python and Streamlit
https://github.com/smaranjitghose/objectsightai
computer-vision gemini gemini-2-0-flash llm python streamlit
Last synced: 25 days ago
JSON representation
A powerful and intuitive image analysis interface powered by Google's Gemini Flash 2, built with Python and Streamlit
- Host: GitHub
- URL: https://github.com/smaranjitghose/objectsightai
- Owner: smaranjitghose
- License: mit
- Created: 2024-12-28T22:16:43.000Z (about 1 month ago)
- Default Branch: master
- Last Pushed: 2024-12-28T22:39:13.000Z (about 1 month ago)
- Last Synced: 2024-12-28T23:20:43.891Z (about 1 month ago)
- Topics: computer-vision, gemini, gemini-2-0-flash, llm, python, streamlit
- Language: Python
- Homepage: https://objectsightai.streamlit.app/
- Size: 0 Bytes
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.MD
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
Awesome Lists containing this project
README
# ObjectSight AI 👁️
A powerful and intuitive image analysis interface powered by Google's Gemini Flash 2, built with Python and Streamlit.![Python](https://img.shields.io/badge/python-3.10+-blue.svg)
![Streamlit](https://img.shields.io/badge/streamlit-latest-red.svg)
![Gemini](https://img.shields.io/badge/gemini--vision-compatible-green.svg)
![License](https://img.shields.io/badge/license-MIT-blue.svg)## 🌟 Features
- 🎯 Real-time object detection and localization
- 📦 Clear bounding box visualization with enhanced labels
- 🔄 Support for common image formats (JPG, JPEG, PNG)
- 🎨 Clean and minimalist user interface
- 💾 Download capability for analyzed images
- 🔑 Secure API key management## 🖼️ Snapshot
![ObjectSight AI Interface](./assets/demo1.png)## 🔧 Prerequisites
- Python 3.10 or higher
- A web Browser
- Google API key from [Google AI Studio](https://makersuite.google.com/app/apikey)## 📥 Installation
1. Clone the repository:
```bash
git clone https://github.com/smaranjitghose/ObjectSightAI.git
cd ObjectSightAI
```2. Create and activate virtual environment:
```bash
# Windows
python -m venv env
.\env\Scripts\activate# Linux/Mac
python3 -m venv env
source env/bin/activate
```3. Install required packages:
```bash
pip install -r requirements.txt
```## 🚀 Usage
1. Start ObjectSight AI:
```bash
streamlit run app.py
```2. Open your browser and navigate to:
```
http://localhost:8501
```## 💡 Quick Start Guide
1. Enter your Google API key in the sidebar
2. Upload an image using the file uploader
3. Write a descriptive prompt about what to analyze
4. Click "Run!" to start the analysis
5. View results and download the analyzed image if desired## 🎯 Example Prompts
```plaintext
"Identify and locate all objects in this image"
"Find and label all people and furniture"
"Detect all electronic devices"
"Locate and identify different types of vehicles"
```## 🛠️ Troubleshooting
### Common Issues
1. **API Key Error**
- Verify API key is entered correctly
- Check if API key has necessary permissions
- Ensure API key is active2. **Image Upload Issues**
- Check if image format is supported
- Ensure image size is under limit
- Verify image is not corrupted3. **Analysis Failures**
- Check internet connection
- Verify API quota hasn't been exceeded
- Ensure prompt is clear and specific## 🤝 Contributing
Contributions are welcome! Please feel free to submit a Pull Request.
1. Fork the project
2. Create your feature branch (`git checkout -b feature/AmazingFeature`)
3. Commit your changes (`git commit -m 'Add some AmazingFeature'`)
4. Push to the branch (`git push origin feature/AmazingFeature`)
5. Open a Pull Request## 📝 License
This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.---
Made with ❤️ by [Smaranjit Ghose](https://github.com/smaranjitghose)