Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/Yomguithereal/clj-fuzzy
A handy collection of algorithms dealing with fuzzy strings and phonetics.
https://github.com/Yomguithereal/clj-fuzzy
Last synced: 9 days ago
JSON representation
A handy collection of algorithms dealing with fuzzy strings and phonetics.
- Host: GitHub
- URL: https://github.com/Yomguithereal/clj-fuzzy
- Owner: Yomguithereal
- License: mit
- Created: 2014-01-09T13:31:52.000Z (almost 11 years ago)
- Default Branch: master
- Last Pushed: 2023-06-19T05:27:27.000Z (over 1 year ago)
- Last Synced: 2024-10-12T01:42:40.353Z (about 1 month ago)
- Language: Clojure
- Homepage: http://yomguithereal.github.io/clj-fuzzy/
- Size: 762 KB
- Stars: 261
- Watchers: 10
- Forks: 27
- Open Issues: 5
-
Metadata Files:
- Readme: README.md
- License: LICENSE.txt
Awesome Lists containing this project
README
[![Build Status](https://travis-ci.org/Yomguithereal/clj-fuzzy.svg)](https://travis-ci.org/Yomguithereal/clj-fuzzy)
# clj-fuzzy
clj-fuzzy is a native Clojure library providing a collection of famous algorithms dealing with fuzzy strings and phonetics.It can be used in Clojure, ClojureScript, client-side JavaScript and Node.js.
[![Clojars Project](http://clojars.org/clj-fuzzy/latest-version.svg)](http://clojars.org/clj-fuzzy)
## ~Deprecation warning~
Consider this library deprecated for JavaScript.
Indeed, the [Talisman](http://yomguithereal.github.io/talisman/) library can be seen as an improvement over `clj-fuzzy` and is, what's more, written directly in JavaScript.
## Full documentation
The full documentation for this library is available [there](http://yomguithereal.github.io/clj-fuzzy/).## Available algorithms
### Metrics
* [Sorensen / Dice coefficient](http://en.wikipedia.org/wiki/S%C3%B8rensen%E2%80%93Dice_coefficient)
* [Levenshtein distance](http://en.wikipedia.org/wiki/Levenshtein_distance)
* [Hamming distance](http://en.wikipedia.org/wiki/Hamming_distance)
* [Jaccard / Tanimoto distance](http://en.wikipedia.org/wiki/Jaccard_index)
* [Jaro-Winkler distance](http://en.wikipedia.org/wiki/Jaro%E2%80%93Winkler_distance)
* [MRA comparison](http://en.wikipedia.org/wiki/Match_rating_approach)
* [Tversky index](http://en.wikipedia.org/wiki/Tversky_index)### Stemmers
* [Lancaster stemmer](http://www.comp.lancs.ac.uk/computing/research/stemming/) (en)
* [Lovins stemmer](http://snowball.tartarus.org/algorithms/lovins/stemmer.html) (en)
* [Porter stemmer](http://tartarus.org/martin/PorterStemmer/index-old.html) (en)
* [Schinke stemmer](http://snowball.tartarus.org/otherapps/schinke/intro.html) (latin)### Phonetics
* [Metaphone](http://en.wikipedia.org/wiki/Metaphone)
* [Double Metaphone](http://en.wikipedia.org/wiki/Metaphone#Double_Metaphone)
* [Soundex](http://en.wikipedia.org/wiki/Soundex)
* [NYSIIS](http://en.wikipedia.org/wiki/New_York_State_Identification_and_Intelligence_System) (original & refined)
* [Caverphone](http://en.wikipedia.org/wiki/Caverphone) (original & revisited)
* [Cologne Phonetic](http://de.wikipedia.org/wiki/K%C3%B6lner_Phonetik)
* [MRA codex](http://en.wikipedia.org/wiki/Match_rating_approach)## Contribution
Please feel free to contribute by forking this repo. Just be sure to add relevant unit tests and pass them all before submitting any code.## License
MIT