https://github.com/uk0/kmars.ai_ai_image_analyzer
AI Image Analyzer for ollama mistral.rs molmo in Mac M2 max (Screen Capture Analyzer ,Camera Capture Analyzer)
https://github.com/uk0/kmars.ai_ai_image_analyzer
image-to-text ollama
Last synced: about 1 month ago
JSON representation
AI Image Analyzer for ollama mistral.rs molmo in Mac M2 max (Screen Capture Analyzer ,Camera Capture Analyzer)
- Host: GitHub
- URL: https://github.com/uk0/kmars.ai_ai_image_analyzer
- Owner: uk0
- Created: 2024-10-05T09:36:42.000Z (8 months ago)
- Default Branch: main
- Last Pushed: 2024-10-31T04:26:34.000Z (7 months ago)
- Last Synced: 2025-04-10T19:33:15.639Z (about 1 month ago)
- Topics: image-to-text, ollama
- Language: HTML
- Homepage:
- Size: 7.28 MB
- Stars: 3
- Watchers: 2
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
### Kmars.ai_AI_Image_Analyzer
AI Image Analyzer
#### Description
This is a simple AI Image Analyzer that uses the `ollama`,`mistral.rs` API or `transformers` to analyze images. The user can upload an image and the AI will analyze the image and provide a description of the image. The user can also translate the description to a different language.
The original intention of the project is to make a small visual model tool for doing some tests or some simple applications, such as picture description, picture translation, etc.
It may not be perfect but it's a starting point for you to start understanding visual models.If it is helpful to you, I hope you will not hesitate to give me a star. This will be a recognition of my work. If you have interesting ideas, you can submit them to me on issues.
The current test found that only `Molmo-7B-D-0924` will accurately return coordinate-related information(`电饭煲`). Other models seem to be unable to do so. I am trying to let minicpm return spatial information. I believe it will be updated soon.
Since the startup of loading the translation model will be slow, this is normal. Just wait for the web site(`http://127.0.0.1:7777`) to load. Don't worry.
#### model
* allenai/Molmo-7B-D-0924
* Helsinki-NLP/opus-mt-en-zh#### example (only `Molmo-7B-D-0924`)


#### Guide
