An open API service indexing awesome lists of open source software.

https://github.com/chrismattmann/imagecat

ImageCat is an Apache OODT RADIX application that uses Apache Solr, Apache Tika and Apache OODT to ingest 10s of millions of files (images,but could be extended to other files) in place, and to extract metadata and OCR information from those files/images using Tika and Tesseract OCR.
https://github.com/chrismattmann/imagecat

apache memex oodt oodt-radix solr tika

Last synced: 6 months ago
JSON representation

ImageCat is an Apache OODT RADIX application that uses Apache Solr, Apache Tika and Apache OODT to ingest 10s of millions of files (images,but could be extended to other files) in place, and to extract metadata and OCR information from those files/images using Tika and Tesseract OCR.

Awesome Lists containing this project