An open API service indexing awesome lists of open source software.

https://github.com/sebastian-nagel/warc-crawler

Process web archives (WARC format) with StormCrawler and index content into Elasticsearch or Solr
https://github.com/sebastian-nagel/warc-crawler

apache-storm elasticsearch solr stormcrawler warc warc-files web-archives

Last synced: 4 months ago
JSON representation

Process web archives (WARC format) with StormCrawler and index content into Elasticsearch or Solr

Awesome Lists containing this project