Projects in Awesome Lists by opensemanticsearch
A curated list of projects in awesome lists by opensemanticsearch .
https://github.com/opensemanticsearch/open-semantic-search
Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, search user interface & search apps for fulltext search, faceted search & knowledge graph)
annotation faceted-search fulltext-search investigative-journalism journalism named-entity-recognition ocr ontologies osint python research-tool search search-engine search-interface semantic skos text-analysis text-mining thesaurus ui
Last synced: 16 May 2025
https://github.com/opensemanticsearch/open-semantic-etl
Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
annotation documents elasticsearch enrichment etl extract extract-information extract-text extractor ingest ingestion-pipeline ingests-documents named-entity-recognition nlp ocr pdf python rdf solr solr-dataimporter
Last synced: 06 Apr 2025
https://github.com/opensemanticsearch/open-semantic-entity-search-api
Open Source REST API for named entity extraction, named entity linking, named entity disambiguation, recommendation & reconciliation of entities like persons, organizations and places for (semi)automatic semantic tagging & analysis of documents by linked data knowledge graph like SKOS thesaurus, RDF ontology, database(s) or list(s) of names
api disambiguation entity-extraction knowledge-graph knowledgebase linked-data linked-data-api linkeddata named-entities named-entity-recognition natural-language-processing nlp python reconciliation reconciliation-service rest-api semantic semantic-analysis semantic-annotation thesaurus
Last synced: 16 Mar 2025
https://github.com/opensemanticsearch/open-semantic-search-apps
Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations and named entities) and data import (ETL like text extraction, OCR and crawling filesystems or websites)
django django-application named-entities named-entity-recognition ocr ontologies ontology python research-data-management research-tool search search-interface skos solr solr-client solr-dataimporter thesaurus ui user-interface
Last synced: 12 Feb 2025
https://github.com/opensemanticsearch/open-semantic-visual-graph-explorer
Open Semantic Visual Linked Data Graph Explorer: Open Source tool (web app) and user interace (UI) for discovery, exploration and visualization of direct and indirect connections between named entities like persons, organizations, locations & concepts from thesarus or ontologies within your documents and knowledgegraph
discovery exploration explorer graph graph-analysis graph-analytics graph-visualisation graph-visualization knowledge-discovery linkeddata named-entity-graph network-analysis network-virtualization network-visualization python semantic semantic-ui ui user-interface visualization
Last synced: 16 Mar 2025
https://github.com/opensemanticsearch/solr-ontology-tagger
Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri
faceted-search ontologies ontology python rdf skos solr tagger tagging thesaurus
Last synced: 21 Mar 2025
https://github.com/opensemanticsearch/solr-php-ui
Solr client and user interface for search
exploratory exploratory-analysis exploratory-search faceted-search filter filtering fulltext-search named-entities php search search-engine solr ui user-interface
Last synced: 21 Mar 2025
https://github.com/opensemanticsearch/solr-relevance-ranking-analysis
Solr Relevance Ranking Analysis and Visualization Tool
analysis apache-solr debugging django django-application information-retrieval information-visualization python ranking relevance relevancy search-engine-optimization search-quality-evaluation search-relevance solr ui visualisation visualization visualize visualizer
Last synced: 23 Mar 2025
https://github.com/opensemanticsearch/open-semantic-search-appliance
Open Semantic Search Appliance (VM)
Last synced: 23 Mar 2025
https://github.com/opensemanticsearch/lexemes
Import lexemes (dictionary including different grammar forms/lexical forms for each lexical entry) from Wikidata to Apache Solr synonyms config
apache-solr grammar grammar-rules grammars lemmatization lemmatizer linkeddata opendata semantic semantics solr solr-dataimporter synonyms wikidata
Last synced: 12 Feb 2025
https://github.com/opensemanticsearch/solr-synonames
Import synonames (multilingual variants of first names from Wikidata) to Solr managed synonyms graph
information-retrieval solr synonym synonym-discovery synonyms synonyms-data
Last synced: 23 Apr 2025
https://github.com/opensemanticsearch/open-semantic-etl-filemonitoring-remote
File monitoring of filesystem by inotify for indexing new/changed files immediately by a remote API on remote search server
elasticsearch inotify monitor-files monitoring python search-engine solr
Last synced: 12 Feb 2025
https://github.com/opensemanticsearch/tika-server.deb
Apache Tika Server as Debian GNU/Linux and Ubuntu Linux package
debian debian-packages debian-packaging tika tika-server tika-server-jar
Last synced: 12 Feb 2025
https://github.com/opensemanticsearch/spacy-services.deb
Debian & Ubuntu package for REST microservices for spaCy natural language processing and machine learning framework for named entity recognition
api debian debian-packages named-entity-recognition natural-language-processing nlp-machine-learning python spacy spacy-nlp
Last synced: 12 Feb 2025
https://github.com/opensemanticsearch/tesseract-ocr-cache
Tesseract OCR wrapper for Apache Tika and/or Open Semantic ETL caching the OCR results, so Tika-Server or Open Semantic ETL has not to reprocess slow and expensive OCR on same images again
cache caching ocr python tesseract tesseract-ocr tika tika-server
Last synced: 17 Jun 2025
https://github.com/opensemanticsearch/tika-python.deb
tika-python as Debian GNU/Linux and Ubuntu Linux package
debian debian-packaging tika-api tika-python tika-wrapper ubuntu ubuntu-packages
Last synced: 24 Feb 2025
https://github.com/opensemanticsearch/neo4j.deb
Debian package of Neo4j graph database preconfigured for Open Semantic ETL and Open Semantic Search
Last synced: 24 Feb 2025
https://github.com/opensemanticsearch/solr.deb
Apache Solr as Debian package with preconfigured schema for Open Semantic ETL and Open Semantic Search
deb debian debian-packages debian-packaging solr
Last synced: 12 Feb 2025