Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/jaeksoft/opensearchserver
Open-source Enterprise Grade Search Engine Software
https://github.com/jaeksoft/opensearchserver
crawler custom-search enterprise indexing java lucene ocr opensearchserver search search-engine synonyms webcrawler webcrawling
Last synced: 7 days ago
JSON representation
Open-source Enterprise Grade Search Engine Software
- Host: GitHub
- URL: https://github.com/jaeksoft/opensearchserver
- Owner: jaeksoft
- License: apache-2.0
- Created: 2013-07-18T16:03:33.000Z (over 11 years ago)
- Default Branch: master
- Last Pushed: 2022-09-03T06:28:22.000Z (about 2 years ago)
- Last Synced: 2024-06-15T12:35:05.291Z (5 months ago)
- Topics: crawler, custom-search, enterprise, indexing, java, lucene, ocr, opensearchserver, search, search-engine, synonyms, webcrawler, webcrawling
- Language: Java
- Homepage: http://www.opensearchserver.com
- Size: 498 MB
- Stars: 499
- Watchers: 77
- Forks: 191
- Open Issues: 224
-
Metadata Files:
- Readme: README.md
- License: LICENSE.txt
Awesome Lists containing this project
- awesome-repositories - jaeksoft/opensearchserver - Open-source Enterprise Grade Search Engine Software (Java)
README
[OpenSearchServer](http://www.opensearchserver.com)
===================================================[![Build Status](https://travis-ci.org/jaeksoft/opensearchserver.svg?branch=master)](https://travis-ci.org/jaeksoft/opensearchserver)
[![Maven Central](https://maven-badges.herokuapp.com/maven-central/com.jaeksoft/opensearchserver/badge.svg)](https://maven-badges.herokuapp.com/maven-central/com.jaeksoft/opensearchserver)
[![Join the chat at https://gitter.im/jaeksoft/opensearchserver](https://badges.gitter.im/jaeksoft/opensearchserver.svg)](https://gitter.im/jaeksoft/opensearchserver)OpenSearchServer is a powerful, enterprise-class, search engine software based on Lucene.
Using the web user interface, the crawlers (web, file, database, ...) and the JSON webservice you will be able to integrate quickly and easily advanced full-text search capabilities in your application. OpenSearchServer runs on Linux/Unix/BSD/Windows.Quickstart
----------
### Docker imageNot yet there.. coming soon..
### Go with the interface and/or the API
http://localhost:9090Useful links
------------
+ Download binaries: https://www.opensearchserver.com/#download
+ The documentation: https://www.opensearchserver.com/documentation
+ Issues (bugs, enhancements): https://github.com/jaeksoft/opensearchserver/issuesFeatures
--------
### Search functions
- Advanced full-text search features
- Phonetic search
- Advanced boolean search with query language
- Clustered results with faceting and collapsing
- Filter search using sub-requests (including negative filters)
- Geolocation
- Spell-checking
- Relevance customization
- Search suggestion facility (auto-completion)### Indexation
- Supports 18 languages
- Fields schema with analyzers in each language
- Several filters: n-gram, lemmatization, shingle, stripping diacritic from words,…
- Automatic language recognition
- Named entity recognition
- Word synonyms and expression synonyms
- Export indexed terms with frequencies
- Automatic classification### Document supported
- HTML / XHTML
- MS Office documents (Word, Excel, Powerpoint, Visio, Publisher)
- OpenOffice documents
- Adobe PDF (with OCR)
- RTF, Plaintext
- Audio files metadata (wav, mp3, AIFF, Ogg)
- Torrent files
- OCR over images### Crawlers
- The web crawler for internet, extranet and intranet
- The file systems crawler for local and remote files (NFS, SMB/CIFS, FTP, FTPS, SWIFT)
- The database crawler for all JDBC databases (MySQL, PostgreSQL, Oracle, SQL Server, …)
- Filter inclusion or exclusion with wildcards
- Session parameters removal
- SQL join and linked files support
- Screenshot capture### General
- JSON web service
- Index replication and sharding
- Federated searchLicense
-------
Copyright Emmanuel Keller / Jaeksoft (2008-2020)Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License athttp://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.