Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists by tokenmill

A curated list of projects in awesome lists by tokenmill .

https://github.com/tokenmill/beagle

Beagle helps you identify keywords, phrases, regexes, and complex search queries of interest in streams of text documents.

clojure java lucene luwak nlp real-time-search stemming stored-query-engine stream-search

Last synced: 10 Nov 2024

https://github.com/tokenmill/clojure-graalvm-aws-lambda-template

Leiningen template for AWS Lambda custom runtime with GraalVM native image compiled Clojure projects.

aws-lambda aws-lambda-clojure aws-lambda-custom-runtime clojure docker graalvm lein-template

Last synced: 10 Nov 2024

https://github.com/tokenmill/timewords

Multilingual library to easily parse date strings to java.util.Date objects.

clojure date-parser date-time java natural-language-processing parsing

Last synced: 10 Nov 2024

https://github.com/tokenmill/crawling-framework

Easily crawl news portals or blog sites using Storm Crawler.

crawler crawling crawling-framework elasticsearch java scraping storm storm-crawler vaadin

Last synced: 10 Nov 2024

https://github.com/tokenmill/docx-utils

Easily work with .docx files from Clojure (a wrapper on Apache POI library).

clojars clojure docx poi

Last synced: 12 Oct 2024

https://github.com/tokenmill/fast-url-access-checker

Easily run HTTP GET requests against a list of URLs to check their HTTP status.

clojure http-redirect http-status java url-checker url-cleaning

Last synced: 10 Nov 2024

https://github.com/tokenmill/snowball

Snowball version of the Porter stemmer for the Lithuanian language.

lithuanian-language nlp porter-stemmer snowball stemmer

Last synced: 10 Nov 2024

https://github.com/tokenmill/dictionary-annotator

Fast and configurable UIMA dictionary annotator.

annotators csv dictionary dkpro nlp ruta

Last synced: 10 Nov 2024

https://github.com/tokenmill/common-crawl-utils

Various Common Crawl utilities in Clojure.

cdx-api clojure clojure-library common-crawl warc

Last synced: 10 Nov 2024

https://github.com/tokenmill/docker-images

Docker configurations, images, and examples of Dockerfiles for various TokenMill products and projects.Official source for Docker configurations, images, and examples of Dockerfiles for TokenMill products and projects

clojure docker docker-configurations docker-images dockerfiles graalvm

Last synced: 10 Nov 2024

https://github.com/tokenmill/crawling-framework-example

Demonstration on how to use the Crawling Framework to setup a simple science news crawler and store results in ElasticSearch. Use this configuration to set up your own crawler.

crawler crawling-framework elasticsearch storm-crawler

Last synced: 10 Nov 2024

https://github.com/tokenmill/es-utils

Clojure helper functions for Elasticsearch.

clojure elasticsearch

Last synced: 10 Nov 2024

https://github.com/tokenmill/beagle-performance-benchmarks

Performance benchmarks for the Beagle library, and comparisons with other stored-query solutions.

Last synced: 10 Nov 2024