Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/ashvardanian/usearch-images
Semantic Search demo featuring UForm, USearch, UCall, and StreamLit, to visual and retrieve from image datasets, similar to "CLIP Retrieval"
https://github.com/ashvardanian/usearch-images
ai clip clip-model clip-retrival demo demo-app multi-lingual multi-modal semantic-search streamlit transformer vector-search
Last synced: 3 months ago
JSON representation
Semantic Search demo featuring UForm, USearch, UCall, and StreamLit, to visual and retrieve from image datasets, similar to "CLIP Retrieval"
- Host: GitHub
- URL: https://github.com/ashvardanian/usearch-images
- Owner: ashvardanian
- Created: 2023-06-04T13:17:29.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2023-12-29T19:38:04.000Z (about 1 year ago)
- Last Synced: 2024-10-31T08:42:46.784Z (3 months ago)
- Topics: ai, clip, clip-model, clip-retrival, demo, demo-app, multi-lingual, multi-modal, semantic-search, streamlit, transformer, vector-search
- Language: Python
- Homepage: https://usearch-images.com
- Size: 10.5 MB
- Stars: 38
- Watchers: 4
- Forks: 4
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# USearch Images
![USearch Images animated text to image multi-modal AI search](assets/usearch-images-slow.gif)
## Semantic Search Demo with [UForm][uform], [USearch][usearch], and [UCall][ucall]
- [x] Can run the GUI and the search part on the same or different server
- [x] Comes with pre-constructed indexes for large datasets
- [x] Supports text-to-image and image-to-image searchTo start the StreamLit demo app locally, you need to download just a couple of files:
```sh
mkdir -p /data/unsplash-25k
wget -O /data/unsplash-25k/images.txt https://huggingface.co/datasets/unum-cloud/gallery-unsplash-25k/resolve/main/images.txt
wget -O /data/unsplash-25k/images.uform-vl-multilingual-v2.fbin https://huggingface.co/datasets/unum-cloud/gallery-unsplash-25k/resolve/main/images.uform-vl-multilingual-v2.fbinpip install -r requirements.txt
streamlit run streamlit_app.py
```[uform]: https://github.com/unum-cloud/uform
[usearch]: https://github.com/unum-cloud/usearch
[ucall]: https://github.com/unum-cloud/ucall## Datasets
The default dataset - Unsplash, contains less than 25'000 images.
Still, the demo is easy to extend to other datasets, some of which we already embedded with UForm and indexed with USearch.
All datasets are available on [Unum's HuggingFace page][unum-huggingface] and share an identical format:- `images.txt` contains newline-delimited URLs or Base64-encoded data-URIs of images.
- `images..fbin` contains a binary matrix of [UForm][uform] embedding for every image from `images.txt`.
- `images..usearch` contains a binary [USearch][usearch] search index for fast kANN.Additionally, some image-text paired datasets may provide `texts.txt`, `texts..fbin`, `texts..usearch`, following the same logic.
[unum-huggingface]: https://huggingface.co/unum-cloud