Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
data-matching-software
A list of free data matching and record linkage software.
https://github.com/J535D165/data-matching-software
- AtyImo
- GitHub
- ![GitHub stars
- Dedupe
- csvdedupe
- PyPI - License
- PyPI - Python Version
- ![PyPI
- PyPI - Downloads
- ![GitHub stars
- dirty-cat
- dirty-cat
- TableVectorizer - cat.github.io/stable/generated/dirty_cat.FeatureAugmenter.html)) are scikit-learn compatible, and easily introduced into ML pipelines.
- PyPI - License
- PyPI - Python Version
- ![PyPI
- PyPI - Downloads
- ![GitHub stars
- fastLink
- CRAN/METACRAN
- ![CRAN - project.org/web/packages/fastLink/index.html) |
- ![metacran downloads - project.org/package=fastLink) |
- ![GitHub stars
- FEBRL
- FRIL
- [source code
- FuzzyMatcher
- [source code
- PyPI - License
- PyPI - Python Version
- ![PyPI
- PyPI - Downloads
- ![GitHub stars
- hlink
- [source_code
- PyPI - License
- PyPI - Python Version
- ![PyPI
- PyPI - Downloads
- ![GitHub stars
- JedAI
- [source code
- GitHub
- ![GitHub stars
- PRIL
- GitHub
- ![GitHub stars - ALPHAnetwork/PIRL_RecordLinkageSoftware) |
- Python Record Linkage Toolkit
- PyPI - License
- PyPI - Python Version
- ![PyPI
- PyPI - Downloads
- ![GitHub stars
- RecordLinkage (R)
- CRAN/METACRAN
- ![CRAN - project.org/web/packages/RecordLinkage/index.html) |
- ![metacran downloads - project.org/package=RecordLinkage) |
- Reclin2
- CRAN/METACRAN
- ![CRAN - project.org/web/packages/reclin2/index.html) |
- ![metacran downloads - project.org/package=reclin2) |
- ![GitHub stars
- RELAIS
- ReMaDDer
- RLTK
- PyPI - License
- ![PyPI
- PyPI - Downloads
- ![GitHub stars - isi-i2/rltk) |
- Splink
- PyPI - License
- PyPI - Python Version
- ![PyPI
- PyPI - Downloads
- ![GitHub stars - analytical-services/splink) |
- Zingg
- Zingg
- slack community
- PyPI - License
- PyPI - Python Version
- ![PyPI
- PyPI - Downloads
- ![GitHub stars
- The Link King
- CC-BY-SA 3.0
Keywords
entity-resolution
7
record-linkage
6
deduplication
4
dedupe
4
python
3
fuzzy-matching
3
data-matching
3
similarity
2
data-science
2
machine-learning
2
spark
2
python-library
2
csv-files
1
cli
1
probabalistic-matching
1
pypi
1
dedupe-library
1
pyspark
1
blocking
1
de-duplicating
1
entity-matching
1
scalability
1
datamade
1
privacy
1
clustering
1
modern-data-stack
1
ml
1
masterdata
1
identity-resolution
1
identity
1
fuzzymatch
1
etl
1
dataquality
1
datalake
1
dataengineering
1
data-transformations
1
data-transformation
1
analytics-engineering
1
analytics
1
uk-gov-data-science
1
em-algorithm
1
duckdb
1
deduplicate-data
1
string-similarity
1
similarity-metric
1
linkage
1
utrecht-university
1
string-distance
1