An open API service indexing awesome lists of open source software.

Projects in Awesome Lists by dedupeio

A curated list of projects in awesome lists by dedupeio .

https://github.com/dedupeio/dedupe

:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

clustering datamade de-duplicating dedupe dedupe-library entity-resolution python python-library record-linkage

Last synced: 18 Dec 2025

https://github.com/dedupeio/csvdedupe

:id: Command line tool for deduplicating CSV files

cli csv-files dedupe entity-resolution record-linkage

Last synced: 13 Apr 2025

https://github.com/dedupeio/dedupe-examples

:id: Examples for using the dedupe library

dedupe entity-resolution python record-linkage

Last synced: 18 Dec 2025

https://github.com/dedupeio/address-matching

Python script for matching a list of messy addresses against a gazetteer using dedupe.

Last synced: 15 Apr 2025

https://github.com/dedupeio/affinegap

:triangular_ruler: A Cython implementation of the affine gap string distance

cython levenshtein-distance python string-distance

Last synced: 11 Mar 2026

https://github.com/dedupeio/dedupe-geocoder

:round_pushpin: Demonstration of how dedupe might be used as geocoder

Last synced: 15 Apr 2025

https://github.com/dedupeio/doublemetaphone

:sound: Python wrapper for a C++ Double Metaphone

double-metaphone python string-matching

Last synced: 12 Dec 2025

https://github.com/dedupeio/fuzzycategory

:triangular_ruler: Fuzzy Categorical Distances

Last synced: 15 Apr 2025

https://github.com/dedupeio/dedupe-variable-person

Dedupe variable for person names. just people. no companies.

Last synced: 25 Feb 2026

https://github.com/dedupeio/dedupe-variable-address

Address Variable Type for dedupe

dedupe dedupe-variable

Last synced: 15 Apr 2025

https://github.com/dedupeio/dedupe-variable-name

name variable type for dedupe

dedupe dedupe-variable

Last synced: 15 Apr 2025

https://github.com/dedupeio/dedupeio-web-api-docs

Dedupe.io web API allows for matching and training against projects using a standard RESTful framework.

Last synced: 05 Mar 2026

https://github.com/dedupeio/soft-tfidf

Mispelling tolerant tf-idf similarity metric

Last synced: 06 Mar 2025

https://github.com/dedupeio/dedupe-vowpal

Vowpal Wabbit Active Labeler for Dedupe

Last synced: 07 Jul 2025

https://github.com/dedupeio/dedupe-variable-datetime

DateTime variable for dedupe

Last synced: 09 Oct 2025

https://github.com/dedupeio/categorical-distance

:triangular_ruler: Compare categorical variables

Last synced: 11 Mar 2026

https://github.com/dedupeio/dedupe-variable-fuzzycategory

Dedupe Variable for Fuzzy Categories

dedupe dedupe-variable

Last synced: 27 Aug 2025

https://github.com/dedupeio/simplecosine

:triangular_ruler: simple cosine distance

python string-similarity

Last synced: 12 Dec 2025

https://github.com/dedupeio/dedupe-variable-number

Try to cast strings to numbers, then compare

Last synced: 15 Apr 2025

https://github.com/dedupeio/parseratorvariable

Base class for dedupe variables for parsed fields

dedupe dedupe-variable

Last synced: 15 Apr 2025

https://github.com/dedupeio/datetime-distance

 📐 Compare dates and times

Last synced: 29 Jul 2025

https://github.com/dedupeio/dedupe-variable-ilcs

Dedupe variable for Illinois Compiled Statute (ILCS) codes

Last synced: 02 Mar 2026