Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists by Norconex

A curated list of projects in awesome lists by Norconex .

https://github.com/norconex/crawlers

Norconex Crawlers (or spiders) are flexible web and filesystem crawlers for collecting, parsing, and manipulating data from the web or filesystem to various data repositories such as search engines.

collector-fs collector-http crawler crawlers filesystem-crawler flexible java search-engine web-crawler

Last synced: 11 Nov 2024

https://github.com/norconex/importer

Norconex Importer is a Java library and command-line application meant to "parse" and "extract" content out of a file as plain text, whatever its format (HTML, PDF, Word, etc). In addition, it allows you to perform any manipulation on the extracted text before using it in your own service or application.

extract html java java-library manipulation norconex-importer parse pdf

Last synced: 11 Nov 2024

https://github.com/norconex/collector-filesystem

Norconex Filesystem Collector is a flexible crawler for collecting, parsing, and manipulating data ranging from local hard drives to network locations into various data repositories such as search engines.

crawler filesystem-crawler java norconex-filesystem-collector search-engine

Last synced: 11 Nov 2024

https://github.com/norconex/commons-lang

Generic library shared between several projects.

java library

Last synced: 11 Nov 2024

https://github.com/norconex/committer-elasticsearch

Implementation of Norconex Committer for Elasticsearch.

committer-elasticsearch norconex-committer

Last synced: 11 Nov 2024

https://github.com/norconex/collector-core

Collector-related code shared between different collector implementations

Last synced: 11 Nov 2024

https://github.com/norconex/committer-core

Norconex Committer is a java library and command line application used to route content to local or remote target repositories, such as a search engine index.

Last synced: 11 Nov 2024

https://github.com/norconex/jef

Job Execution Framework.

Last synced: 11 Nov 2024

https://github.com/norconex/jef-monitor

Web-based application for monitoring jobs progress (created with JEF).

Last synced: 11 Nov 2024

https://github.com/norconex/committer-idol

Autonomy IDOL implementation of Norconex Committer.

idol-committer norconex-committer

Last synced: 11 Nov 2024

https://github.com/norconex/committer-solr

Solr implementation of Norconex Committer. Should also work with any Solr-based products, such as LucidWorks.

committer-solr lucidworks norconex-committer solr

Last synced: 11 Nov 2024

https://github.com/norconex/committer-neo4j

Implementation of Norconex Committer for Neo4j.

crawler neo4j neo4j-committer norconex-committer

Last synced: 17 Dec 2024

https://github.com/norconex/committer-sql

Implementation of Norconex Committer for SQL (JDBC) databases.

databases jdbc norconex-committer sql-committer

Last synced: 19 Dec 2024

https://github.com/norconex/committer-azuresearch

Implementation of Norconex Committer for Microsoft Azure Search.

microsoft-azure-search norconex-committer

Last synced: 11 Nov 2024

https://github.com/norconex/committer-cloudsearch

Amazon CloudSearch implementation of Norconex Committer.

amazon-cloudsearch committer-cloudsearch norconex-committer

Last synced: 11 Nov 2024

https://github.com/norconex/commons-maven-parent

Maven parent POM for many Norconex Maven projects.

Last synced: 11 Nov 2024