Projects in Awesome Lists tagged with string-distance
A curated list of projects in awesome lists tagged with string-distance .
https://github.com/tdebatty/java-string-similarity
Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...
algorithm cosine-similarity damerau-levenshtein distance distance-measure jaro-winkler java levenshtein-distance shingles similarity-measures string-distance
Last synced: 13 May 2025
https://github.com/j535d165/recordlinkage
A powerful and modular toolkit for record linkage and duplicate detection in Python
data-matching dedupe deduplication entity-resolution machine-learning privacy python python-library record-linkage similarity string-distance utrecht-university
Last synced: 14 May 2025
https://github.com/J535D165/recordlinkage
A powerful and modular toolkit for record linkage and duplicate detection in Python
data-matching dedupe deduplication entity-resolution machine-learning privacy python python-library record-linkage similarity string-distance utrecht-university
Last synced: 26 Mar 2025
https://github.com/xdrop/fuzzywuzzy
Java fuzzy string matching implementation of the well known Python's fuzzywuzzy algorithm. Fuzzy search for Java
fuzzy-matching fuzzy-search fuzzywuzzy java python-levenshtein string-distance
Last synced: 14 Apr 2025
https://github.com/hbollon/go-edlib
📚 String comparison and edit distance algorithms library, featuring : Levenshtein, LCS, Hamming, Damerau levenshtein (OSA and Adjacent transpositions algorithms), Jaro-Winkler, Cosine, etc...
algorithms cosine damerau-levenshtein edit-distance edit-distance-algorithms go golang golang-string-comparison hamming jaro-winkler lcs lcs-distance levenshtein levenshtein-distance similarity-measures string-comparison string-distance string-matching unicode
Last synced: 08 Apr 2025
https://github.com/feature23/StringSimilarity.NET
A .NET port of java-string-similarity
algorithms cosine-similarity damerau-levenshtein distance dotnet jaro-winkler lcs-distance levenshtein-distance shingles similarity-measures string string-distance string-metrics strings winkler
Last synced: 04 May 2025
https://github.com/adrg/strutil
Go metrics for calculating string similarity and other string utility functions
dice-coefficient golang hamming-distance jaccard jaccard-index jaccard-similarity jaro jaro-winkler levenshtein n-gram n-gram-intersection overlap-coefficient smith-waterman smith-waterman-gotoh sorensen-dice string string-distance string-matching string-metrics string-similarity
Last synced: 14 May 2025
https://github.com/turnerj/quickenshtein
Making the quickest and most memory efficient implementation of Levenshtein Distance with SIMD and Threading support
edit-distance hardware-intrinsics levenshtein levenshtein-distance simd string-distance threading
Last synced: 07 Apr 2025
https://github.com/cadmiumcr/cadmium
Natural Language Processing (NLP) library for Crystal
crystal crystal-lang crystal-language inflector nlp phonetics readability sentiment-analysis shards stemmer string-distance tf-idf transliterator tries wordnet
Last synced: 10 May 2025
https://github.com/fasiha/mudderjs
Lexicographically-subdivide the “space” between strings, by defining an alternate non-base-ten number system using a pre-defined dictionary of symbol↔︎number mappings. Handy for ordering NoSQL keys.
lexicographical radix string string-distance
Last synced: 16 May 2025
https://github.com/technikhil314/offline-diff-viewer
A Privacy focused, easy sharable, open source and anonymous tracking diff viewer.
deflate deflation diff diff-checker diff-viewer diffchecker difference enterprise-data gzip nuxt nuxtjs open-source privacy string-compression string-distance text-diff textdiff textdistance vue vuejs
Last synced: 12 Oct 2025
https://github.com/daniel-liu-c0deb0t/triple_accel
Rust edit distance routines accelerated using SIMD. Supports fast Hamming, Levenshtein, restricted Damerau-Levenshtein, etc. distance calculations and string search.
algorithms avx2 dynamic-programming hamming levenshtein rust simd sse string-distance string-matching string-search string-similarity
Last synced: 05 Apr 2025
https://github.com/Daniel-Liu-c0deb0t/triple_accel
Rust edit distance routines accelerated using SIMD. Supports fast Hamming, Levenshtein, restricted Damerau-Levenshtein, etc. distance calculations and string search.
algorithms avx2 dynamic-programming hamming levenshtein rust simd sse string-distance string-matching string-search string-similarity
Last synced: 15 Apr 2025
https://github.com/agext/levenshtein
Levenshtein distance and similarity metrics with customizable edit costs and Winkler-like bonus for common prefix.
awesome-go common-prefix-bonus edit-costs levenshtein levenshtein-distance similarity-metric string-distance string-pairs string-similarity winkler
Last synced: 14 Mar 2025
https://github.com/anirbanmu/str_metrics
Ruby gem (native extension in Rust) providing implementations of various string metrics
damerau-levenshtein jaro jaro-winkler levenshtein native native-extension ruby ruby-gem rubygem rust rust-lang sorensen-dice string-comparison string-distance string-metrics strings utility
Last synced: 21 Aug 2025
https://github.com/dexyk/stringosim
String similarity functions, String distance's, Jaccard, Levenshtein, Hamming, Jaro-Winkler, Q-grams, N-grams, LCS - Longest Common Subsequence, Cosine similarity...
comparison cosine-distance distance jaccard jaro-distance jaro-winkler levenshtein string-distance
Last synced: 10 Apr 2025
https://github.com/dedupeio/affinegap
:triangular_ruler: A Cython implementation of the affine gap string distance
cython levenshtein-distance python string-distance
Last synced: 11 Mar 2026
https://github.com/hyperjumptech/beda
Beda is a golang library for detecting how similar a two string
difference go golang string-distance string-matching string-similarity
Last synced: 14 May 2025
https://github.com/iesl/stance
Learned string similarity for entity names using optimal transport.
aliases entity-resolution optimal-transport record-linkage stance string-distance string-matching string-similarity
Last synced: 09 Jul 2025
https://github.com/lovit/levenshtein_finder
Similar string search in Levenshtein distance
Last synced: 17 Jan 2026
https://github.com/dynom/tysug
A project around helping to prevent typing typos. TySug (Typo Suggestions) suggests alternative words with respect to keyboard layouts
algorithm cors docker go golang jaro jaro-winkler keyboard keyboard-layout library spelling-errors string-distance suggestions toml typing typo webservice words
Last synced: 13 Apr 2025
https://github.com/Dynom/TySug
A project around helping to prevent typing typos. TySug (Typo Suggestions) suggests alternative words with respect to keyboard layouts
algorithm cors docker go golang jaro jaro-winkler keyboard keyboard-layout library spelling-errors string-distance suggestions toml typing typo webservice words
Last synced: 14 Mar 2025
https://github.com/cicirello/javapermutationtools
A Java library for computation on permutations and sequences
edit-distance permutation-distance permutation-distance-metrics permutations sequences string-distance
Last synced: 15 Apr 2025
https://github.com/sumn2u/string-comparisons
A collection of string comparisons algorithms
algorithms cosine-similarity damerau-levenshtein distance hamming-distance jaccard-similarity jaro-winkler-distance javascript levenshtein-distance similarity-measures smith-waterman sorensen-dice-distance string-comparison string-distance trigrams
Last synced: 19 Mar 2025
https://github.com/mehrandvd/simila
A project for string similarities.
c-sharp string-distance string-matching string-similarity
Last synced: 14 May 2025
https://github.com/nkkarpov/editdistancek
LMS algorithm for computing edit distance with SIMD optimizations
levenshtein levenshtein-distance string-distance
Last synced: 07 Apr 2026
https://github.com/andrewjsaid/levenshtypo
A fuzzy string dictionary based on Levenshtein automata
dotnet edit-distance fuzzy-string fuzzy-string-matching levenshtein levenshtein-automata levenshtein-string-distance optimal-string-alignment restricted-edit string-distance string-matching
Last synced: 14 Jan 2026
https://github.com/dedupeio/highered
CRF Edit Distance
conditional-random-fields edit-distance python string-distance
Last synced: 12 Dec 2025
https://github.com/simonschoelly/informationdistances.jl
A small Julia library for calculating the normalized compression distance.
compression hacktoberfest information-distance kolmogorov-complexity normalized-compression-distance string-distance
Last synced: 19 Jun 2025
https://github.com/alex-werner/khal
Utils for node project
clone date file geolocation helpers levenshtein-distance math misc nodejs regex sort string-distance utils
Last synced: 14 Sep 2025
https://github.com/sandinmyjoints/equivalency
Declaratively define rules for string equivalence so you can focus on the differences that matter.
comparison javascript string-distance strings
Last synced: 20 Mar 2025
https://github.com/mtingers/hashfuzz
Detects similarities between strings & generates similarity hash
difflib fuzzymatch hash levenshtein-distance python sequencematcher string-distance
Last synced: 15 Mar 2025
https://github.com/elara6331/pak
This repository is a mirror. Do not post issues or PRs here.
golang package-management string-distance
Last synced: 03 Apr 2025
https://github.com/t-ski/string-similarity-algorithms
Common string similarity algorithm implementations.
nlp python string-distance string-similarity
Last synced: 21 Apr 2026
https://github.com/lqdc/pysimstr
Fast(ish) string similarity for one vs many comparisons.
string-distance string-matching string-search string-similarity
Last synced: 25 Jan 2026
https://github.com/agricolamz/2017_andan_course
Course for ANDAN Summer School about strings and texts in R
crawler language-detection r regular-expressions rstats string-distance string-manipulation strings teaching teaching-materials text-analysis tf-idf tidytext
Last synced: 14 Jun 2025
https://github.com/johnny-morrice/anagram
Fun anagram ranking tool for golang
anagrams golang string-distance
Last synced: 13 Oct 2025
https://github.com/dev-ahmadbilal/string-master
A comprehensive JS/TS library with 18 specialized classes for string manipulation, conversion, validation, and more. Streamline your development with powerful, all-in-one solutions.
inflection javascript slugify string-case string-comparison string-compression string-distance string-interpolation string-manipulation string-matching string-methods string-search string-similarity string-transformations string-utilities string-validation typescript
Last synced: 10 Jul 2025
https://github.com/henrik9999/string-similarity
Finds degree of similarity between two strings, based on Dice's Coefficient, which is mostly better than Levenshtein distance.
dice-coefficient php php8 string string-comparison string-distance string-distance-calculation string-similarity strings
Last synced: 20 Oct 2025
https://github.com/ac000/libac
A C library of miscellaneous utility functions
c data-structures freebsd geospatial json linux network-programming string-distance time
Last synced: 21 Sep 2025
https://github.com/sfischer13/python-stringmetric
:snake: Python implementations of common string distance and similarity algorithms
hamming-distance levenshtein-distance library python python-3 string-distance string-metrics
Last synced: 03 Oct 2025
https://github.com/calebwin/quill
A high-level API for computing edit distance
Last synced: 09 Sep 2025
https://github.com/cicirello/jpt-examples
Example programs for the JavaPermutationTools (JPT) library
distance permutation-distance permutations sequences string-distance
Last synced: 07 May 2025
https://github.com/elazzabi/fuzzy-string-matching
Get the degree of resemblance between two strings
resemblance string-distance string-matching
Last synced: 26 Mar 2025
https://github.com/seehuhn/go-levenshtein
compute the Levenshtein distance between two strings in Go
Last synced: 22 Jan 2026
https://github.com/jhermsmeier/sift.c
SIFT string distance algorithm
sift sift-algorithm string-distance
Last synced: 14 Sep 2025