Projects in Awesome Lists by dedupeio
A curated list of projects in awesome lists by dedupeio .
https://github.com/dedupeio/dedupe
:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
clustering datamade de-duplicating dedupe dedupe-library entity-resolution python python-library record-linkage
Last synced: 18 Dec 2025
https://github.com/dedupeio/csvdedupe
:id: Command line tool for deduplicating CSV files
cli csv-files dedupe entity-resolution record-linkage
Last synced: 13 Apr 2025
https://github.com/dedupeio/dedupe-examples
:id: Examples for using the dedupe library
dedupe entity-resolution python record-linkage
Last synced: 18 Dec 2025
https://github.com/dedupeio/address-matching
Python script for matching a list of messy addresses against a gazetteer using dedupe.
Last synced: 15 Apr 2025
https://github.com/dedupeio/affinegap
:triangular_ruler: A Cython implementation of the affine gap string distance
cython levenshtein-distance python string-distance
Last synced: 11 Mar 2026
https://github.com/dedupeio/dedupe-geocoder
:round_pushpin: Demonstration of how dedupe might be used as geocoder
Last synced: 15 Apr 2025
https://github.com/dedupeio/doublemetaphone
:sound: Python wrapper for a C++ Double Metaphone
double-metaphone python string-matching
Last synced: 12 Dec 2025
https://github.com/dedupeio/fuzzycategory
:triangular_ruler: Fuzzy Categorical Distances
Last synced: 15 Apr 2025
https://github.com/dedupeio/dedupe-variable-person
Dedupe variable for person names. just people. no companies.
Last synced: 25 Feb 2026
https://github.com/dedupeio/dedupe-variable-address
Address Variable Type for dedupe
Last synced: 15 Apr 2025
https://github.com/dedupeio/dedupe-variable-name
name variable type for dedupe
Last synced: 15 Apr 2025
https://github.com/dedupeio/dedupeio-web-api-docs
Dedupe.io web API allows for matching and training against projects using a standard RESTful framework.
Last synced: 05 Mar 2026
https://github.com/dedupeio/soft-tfidf
Mispelling tolerant tf-idf similarity metric
Last synced: 06 Mar 2025
https://github.com/dedupeio/highered
CRF Edit Distance
conditional-random-fields edit-distance python string-distance
Last synced: 12 Dec 2025
https://github.com/dedupeio/dedupe-vowpal
Vowpal Wabbit Active Labeler for Dedupe
Last synced: 07 Jul 2025
https://github.com/dedupeio/dedupe-variable-datetime
DateTime variable for dedupe
Last synced: 09 Oct 2025
https://github.com/dedupeio/categorical-distance
:triangular_ruler: Compare categorical variables
Last synced: 11 Mar 2026
https://github.com/dedupeio/dedupe-variable-fuzzycategory
Dedupe Variable for Fuzzy Categories
Last synced: 27 Aug 2025
https://github.com/dedupeio/simplecosine
:triangular_ruler: simple cosine distance
Last synced: 12 Dec 2025
https://github.com/dedupeio/dedupe-variable-number
Try to cast strings to numbers, then compare
Last synced: 15 Apr 2025
https://github.com/dedupeio/parseratorvariable
Base class for dedupe variables for parsed fields
Last synced: 15 Apr 2025
https://github.com/dedupeio/dedupe-variable-ilcs
Dedupe variable for Illinois Compiled Statute (ILCS) codes
Last synced: 02 Mar 2026