Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists by Norconex
A curated list of projects in awesome lists by Norconex .
https://github.com/norconex/crawlers
Norconex Crawlers (or spiders) are flexible web and filesystem crawlers for collecting, parsing, and manipulating data from the web or filesystem to various data repositories such as search engines.
collector-fs collector-http crawler crawlers filesystem-crawler flexible java search-engine web-crawler
Last synced: 11 Nov 2024
https://github.com/norconex/importer
Norconex Importer is a Java library and command-line application meant to "parse" and "extract" content out of a file as plain text, whatever its format (HTML, PDF, Word, etc). In addition, it allows you to perform any manipulation on the extracted text before using it in your own service or application.
extract html java java-library manipulation norconex-importer parse pdf
Last synced: 11 Nov 2024
https://github.com/norconex/collector-filesystem
Norconex Filesystem Collector is a flexible crawler for collecting, parsing, and manipulating data ranging from local hard drives to network locations into various data repositories such as search engines.
crawler filesystem-crawler java norconex-filesystem-collector search-engine
Last synced: 11 Nov 2024
https://github.com/norconex/commons-lang
Generic library shared between several projects.
Last synced: 11 Nov 2024
https://github.com/norconex/committer-elasticsearch
Implementation of Norconex Committer for Elasticsearch.
committer-elasticsearch norconex-committer
Last synced: 11 Nov 2024
https://github.com/norconex/collector-core
Collector-related code shared between different collector implementations
Last synced: 11 Nov 2024
https://github.com/norconex/committer-core
Norconex Committer is a java library and command line application used to route content to local or remote target repositories, such as a search engine index.
Last synced: 11 Nov 2024
https://github.com/norconex/jef-monitor
Web-based application for monitoring jobs progress (created with JEF).
Last synced: 11 Nov 2024
https://github.com/norconex/committer-idol
Autonomy IDOL implementation of Norconex Committer.
idol-committer norconex-committer
Last synced: 11 Nov 2024
https://github.com/norconex/committer-solr
Solr implementation of Norconex Committer. Should also work with any Solr-based products, such as LucidWorks.
committer-solr lucidworks norconex-committer solr
Last synced: 11 Nov 2024
https://github.com/norconex/committer-neo4j
Implementation of Norconex Committer for Neo4j.
crawler neo4j neo4j-committer norconex-committer
Last synced: 17 Dec 2024
https://github.com/norconex/committer-sql
Implementation of Norconex Committer for SQL (JDBC) databases.
databases jdbc norconex-committer sql-committer
Last synced: 19 Dec 2024
https://github.com/norconex/committer-azuresearch
Implementation of Norconex Committer for Microsoft Azure Search.
microsoft-azure-search norconex-committer
Last synced: 11 Nov 2024
https://github.com/norconex/committer-cloudsearch
Amazon CloudSearch implementation of Norconex Committer.
amazon-cloudsearch committer-cloudsearch norconex-committer
Last synced: 11 Nov 2024
https://github.com/norconex/commons-maven-parent
Maven parent POM for many Norconex Maven projects.
Last synced: 11 Nov 2024