https://github.com/dedupeio/dedupe-vowpal
Vowpal Wabbit Active Labeler for Dedupe
https://github.com/dedupeio/dedupe-vowpal
Last synced: 11 months ago
JSON representation
Vowpal Wabbit Active Labeler for Dedupe
- Host: GitHub
- URL: https://github.com/dedupeio/dedupe-vowpal
- Owner: dedupeio
- License: mit
- Created: 2016-09-23T21:24:43.000Z (over 9 years ago)
- Default Branch: master
- Last Pushed: 2020-03-31T20:48:34.000Z (about 6 years ago)
- Last Synced: 2025-03-28T19:53:40.253Z (about 1 year ago)
- Language: Python
- Size: 4.88 KB
- Stars: 4
- Watchers: 2
- Forks: 3
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# dedupe-vowpal
Vowpal Wabbit Active Labeler for Dedupe
This module provides an class for active learning using Vowpal Wabbit.
Part of the [Dedupe.io](https://dedupe.io/) cloud service and open source toolset for de-duplicating and finding fuzzy matches in your data.
# Prerequsites
* [Vowpal Wabbit](https://github.com/JohnLangford/vowpal_wabbit#getting-the-code)
# Install
```bash
pip install https://github.com/datamade/dedupe-vowpal/zipball/master
```
# To use
Instead of using the normal Dedupe classes, you will create a subclass with the ActiveLearner from this module
```python
import dedupe
from vowpal_labeler import VowpalLearner
class Dedupe(dedupe.Dedupe):
ActiveLearner = VowpalLearner
....
```