{"id":16084085,"url":"https://github.com/skwzrd/imagesearch","last_synced_at":"2026-01-21T04:33:50.009Z","repository":{"id":257816191,"uuid":"856756495","full_name":"skwzrd/imagesearch","owner":"skwzrd","description":"OCR, EXIF, CLIP, ImageHash processing and search.","archived":false,"fork":false,"pushed_at":"2024-10-13T00:56:49.000Z","size":1204,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-04-11T15:10:26.750Z","etag":null,"topics":["clip","embedding","exif","exif-data-extraction","flask","image-processing","image-recognition","ocr","ocr-recognition","python","search"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/skwzrd.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2024-09-13T06:33:10.000Z","updated_at":"2024-10-13T00:56:52.000Z","dependencies_parsed_at":"2025-04-11T15:04:16.316Z","dependency_job_id":"595b026f-2b67-40e1-a7b8-127508cd1fce","html_url":"https://github.com/skwzrd/imagesearch","commit_stats":null,"previous_names":["skwzrd/imagesearch","skwzrd/eclipse_img"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/skwzrd/imagesearch","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/skwzrd%2Fimagesearch","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/skwzrd%2Fimagesearch/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/skwzrd%2Fimagesearch/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/skwzrd%2Fimagesearch/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/skwzrd","download_url":"https://codeload.github.com/skwzrd/imagesearch/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/skwzrd%2Fimagesearch/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":28626280,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-01-21T02:47:06.670Z","status":"ssl_error","status_checked_at":"2026-01-21T02:45:44.886Z","response_time":86,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.6:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["clip","embedding","exif","exif-data-extraction","flask","image-processing","image-recognition","ocr","ocr-recognition","python","search"],"created_at":"2024-10-09T12:23:40.652Z","updated_at":"2026-01-21T04:33:49.990Z","avatar_url":"https://github.com/skwzrd.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# ImageSearch\n\nA program that stiches together many libraries in order to provide a new image search experience.\n\n![preview](preview.png)\n\nGet your venv up with and install the `requirements.txt`.\n- Run `python3.12 -m pip install git+https://github.com/openai/CLIP.git` to install CLIP.\n- Run `python3.12 -m pip install -r requirements.txt`\n- If you don't have nvidia drivers installed, you can use `sudo ubuntu-drivers autoinstall` and reboot.\n\nSet your configs in `configs.py`.\n\nRun `python3.12 process.py` to crawl images and populate a sqlite database with CLIP embeddings, OCR text, EXIF data, Hashes, Noise, and/or Faces.\n\nRun `python3.12 web.py` to launch the web UI for searching the sqlite database.\n\n\n## Dependencies\n\n- For searching embeddings, we use [https://github.com/openai/CLIP](https://github.com/openai/CLIP) (see `requirements.txt` for install).\n\nIf you want OCR, pick one OCR program.\n\n- ocrs (recommended) (download and compile [https://github.com/robertknight/ocrs](https://github.com/robertknight/ocrs))\n- doctr (see `requirements.txt`)\n- pytesseract (see `requirements.txt`)\n\nIf you want facial detection to search by face counts, install `face_recognition` (see `requirements.txt` ).\n\nNeed help choosing? See [https://neetventures.com/post/50](https://neetventures.com/post/50).\n\n\n## Performance\n\nMy server (see below) does 160 images/second with EXIF, CLIP, and OCR ([https://github.com/robertknight/ocrs](https://github.com/robertknight/ocrs)) engaged.\n\n\n## Hardware Requirements\n\nThese metrics were taken while processing images. I'll let you interpret them.\n\n`sudo lshw -class CPU | grep -i product`\n\n```\nproduct: Intel(R) Core(TM) i5-9400F CPU @ 2.90GHz\n```\n\n`free -mh`\n\n```\n               total        used        free      shared  buff/cache   available\nMem:            31Gi       2.5Gi        20Gi        17Mi       8.0Gi        28Gi\n```\n\n`nvidia-smi`\n\n```\n+-----------------------------------------------------------------------------------------+\n| NVIDIA-SMI 550.107.02             Driver Version: 550.107.02     CUDA Version: 12.4     |\n|-----------------------------------------+------------------------+----------------------+\n| GPU  Name                 Persistence-M | Bus-Id          Disp.A | Volatile Uncorr. ECC |\n| Fan  Temp   Perf          Pwr:Usage/Cap |           Memory-Usage | GPU-Util  Compute M. |\n|                                         |                        |               MIG M. |\n|=========================================+========================+======================|\n|   0  NVIDIA GeForce GTX 1660 Ti     Off |   00000000:01:00.0 Off |                  N/A |\n| 23%   41C    P2             28W /  120W |     466MiB /   6144MiB |      5%      Default |\n|                                         |                        |                  N/A |\n+-----------------------------------------+------------------------+----------------------+\n                                                                                         \n+-----------------------------------------------------------------------------------------+\n| Processes:                                                                              |\n|  GPU   GI   CI        PID   Type   Process name                              GPU Memory |\n|        ID   ID                                                               Usage      |\n|=========================================================================================|\n|    0   N/A  N/A     25919      C   python3.12                                    462MiB |\n+-----------------------------------------------------------------------------------------+\n```\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fskwzrd%2Fimagesearch","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fskwzrd%2Fimagesearch","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fskwzrd%2Fimagesearch/lists"}