Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists by tokenmill
A curated list of projects in awesome lists by tokenmill .
https://github.com/tokenmill/beagle
Beagle helps you identify keywords, phrases, regexes, and complex search queries of interest in streams of text documents.
clojure java lucene luwak nlp real-time-search stemming stored-query-engine stream-search
Last synced: 10 Nov 2024
https://github.com/tokenmill/clojure-graalvm-aws-lambda-template
Leiningen template for AWS Lambda custom runtime with GraalVM native image compiled Clojure projects.
aws-lambda aws-lambda-clojure aws-lambda-custom-runtime clojure docker graalvm lein-template
Last synced: 10 Nov 2024
https://github.com/tokenmill/timewords
Multilingual library to easily parse date strings to java.util.Date objects.
clojure date-parser date-time java natural-language-processing parsing
Last synced: 10 Nov 2024
https://github.com/tokenmill/crawling-framework
Easily crawl news portals or blog sites using Storm Crawler.
crawler crawling crawling-framework elasticsearch java scraping storm storm-crawler vaadin
Last synced: 10 Nov 2024
https://github.com/tokenmill/docx-utils
Easily work with .docx files from Clojure (a wrapper on Apache POI library).
Last synced: 12 Oct 2024
https://github.com/tokenmill/fast-url-access-checker
Easily run HTTP GET requests against a list of URLs to check their HTTP status.
clojure http-redirect http-status java url-checker url-cleaning
Last synced: 10 Nov 2024
https://github.com/tokenmill/snowball
Snowball version of the Porter stemmer for the Lithuanian language.
lithuanian-language nlp porter-stemmer snowball stemmer
Last synced: 10 Nov 2024
https://github.com/tokenmill/dictionary-annotator
Fast and configurable UIMA dictionary annotator.
annotators csv dictionary dkpro nlp ruta
Last synced: 10 Nov 2024
https://github.com/tokenmill/common-crawl-utils
Various Common Crawl utilities in Clojure.
cdx-api clojure clojure-library common-crawl warc
Last synced: 10 Nov 2024
https://github.com/tokenmill/docker-images
Docker configurations, images, and examples of Dockerfiles for various TokenMill products and projects.Official source for Docker configurations, images, and examples of Dockerfiles for TokenMill products and projects
clojure docker docker-configurations docker-images dockerfiles graalvm
Last synced: 10 Nov 2024
https://github.com/tokenmill/crawling-framework-example
Demonstration on how to use the Crawling Framework to setup a simple science news crawler and store results in ElasticSearch. Use this configuration to set up your own crawler.
crawler crawling-framework elasticsearch storm-crawler
Last synced: 10 Nov 2024
https://github.com/tokenmill/es-utils
Clojure helper functions for Elasticsearch.
Last synced: 10 Nov 2024
https://github.com/tokenmill/beagle-performance-benchmarks
Performance benchmarks for the Beagle library, and comparisons with other stored-query solutions.
Last synced: 10 Nov 2024