Projects in Awesome Lists by USCDataScience
A curated list of projects in awesome lists by USCDataScience .
https://github.com/USCDataScience/sparkler
Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
big-data distributed-systems information-retrieval nutch search search-engine solr spark tika web-crawler
Last synced: 25 Mar 2025