Projects in Awesome Lists by tdebatty
A curated list of projects in awesome lists by tdebatty .
https://github.com/tdebatty/java-string-similarity
Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...
algorithm cosine-similarity damerau-levenshtein distance distance-measure jaro-winkler java levenshtein-distance shingles similarity-measures string-distance
Last synced: 13 May 2025
https://github.com/tdebatty/java-lsh
A Java implementation of Locality Sensitive Hashing (LSH)
Last synced: 16 May 2025
https://github.com/tdebatty/spark-knn-graphs
Spark algorithms for building k-nn graphs
algorithm knn-graphs lsh-superbit nearest-neighbor-search nn-descent processing-knn-graphs spark spark-knn-graphs
Last synced: 12 Apr 2025
https://github.com/tdebatty/java-graphs
Algorithms that build k-nearest neighbors graph (k-nn graph): Brute-force, NN-Descent,...
Last synced: 12 Apr 2025
https://github.com/tdebatty/php-language-processing
A PHP library for language processing. Includes string distance function (Levenshtein, Jaro-Winkler,...), stemming, etc.
Last synced: 26 Jun 2025
https://github.com/tdebatty/java-spamsum
A Java implementation of SpamSum / SSDeep
Last synced: 12 Apr 2025
https://github.com/tdebatty/java-datasets
Java library for parsing various datasets: ENRON email dataset, Wikipedia web pages, DBLP papers, Reuters news ...
Last synced: 04 Mar 2025
https://github.com/tdebatty/php-vector-matrix
A PHP library for vectors and matrices algebra
Last synced: 12 Apr 2025
https://github.com/tdebatty/laravel-resource-generator
A complete code generator for Laravel resources (includes fully working controller code, views etc.)
Last synced: 12 Apr 2025
https://github.com/tdebatty/php-odt2html
PHP library to convert Openoffice files (ODT) to HTML
Last synced: 12 Apr 2025
https://github.com/tdebatty/spark-kmedoids
Spark implementation of k-medoids clustering algorithm
Last synced: 30 Oct 2025
https://github.com/tdebatty/hadoop-knn-graph
Hadoop implementation of KNN graph building algorithms (Brute force, NNDescent, NNCtph, ...)
Last synced: 04 Mar 2025
https://github.com/tdebatty/hadoop-clustering
Algorithms to perform clustering with Hadoop
Last synced: 02 Dec 2025
https://github.com/tdebatty/java-aggregation
Java implementation of aggregation operators: WA, OWA and WOWA
Last synced: 04 Mar 2025
https://github.com/tdebatty/php-data-structures
Data structures implemented in PHP : KDTree,...
Last synced: 04 Mar 2025
https://github.com/tdebatty/sparkpackage-maven-plugin
Maven plugin for publishing on spark-packages
Last synced: 04 Mar 2025
https://github.com/tdebatty/jinu
Java algorithm evaluation framework
algorithms evaluation framework java
Last synced: 27 Jul 2025
https://github.com/tdebatty/php-aggregation-operators
PHP implementations of Weighted Ordered Weighted Aggregation (WOWA), Ordered Weighted Averaging (OWA), etc.
Last synced: 04 Mar 2025