An open API service indexing awesome lists of open source software.

https://github.com/gaglia88/ruler

Scalable record-level matching rules
https://github.com/gaglia88/ruler

distributed-computing entity-matching entity-resolution similarity-join

Last synced: 19 days ago
JSON representation

Scalable record-level matching rules

Awesome Lists containing this project

README

        

# RulER
RulER is a tool for Apache Spark that uses a novel technique that allows to find similar records by applying complex joining rules on one or more attributes.

---

If use this library, please cite:

- **Gagliardelli, L., Simonini, G., & Bergamaschi, S. (2020). RulER: Scaling Up Record-level Matching Rules. In EDBT 2020: 23nd International Conference on Extending Database Technology.**

---

A brief presentation about RulER is available by clicking on the image below
[![](http://img.youtube.com/vi/ZuIre-WO3lY/0.jpg)](http://www.youtube.com/watch?v=ZuIre-WO3lY "")

### Contacts
For any questions about RulER write us at [email protected]
* Luca Gagliardelli
* Giovanni Simonini