Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/j535d165/recordlinkage-annotator
A browser user interface for manual labeling of record pairs.
https://github.com/j535d165/recordlinkage-annotator
annotation-tool data-matching deduplication entity-resolution labeling-tool machine-learning record-linkage
Last synced: 2 months ago
JSON representation
A browser user interface for manual labeling of record pairs.
- Host: GitHub
- URL: https://github.com/j535d165/recordlinkage-annotator
- Owner: J535D165
- Created: 2019-11-02T08:36:13.000Z (over 5 years ago)
- Default Branch: master
- Last Pushed: 2023-06-23T18:07:48.000Z (over 1 year ago)
- Last Synced: 2024-06-11T21:25:42.429Z (8 months ago)
- Topics: annotation-tool, data-matching, deduplication, entity-resolution, labeling-tool, machine-learning, record-linkage
- Language: JavaScript
- Homepage:
- Size: 3.49 MB
- Stars: 41
- Watchers: 3
- Forks: 8
- Open Issues: 19
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# RecordLinkage ANNOTATOR
*RecordLinkage ANNOTATOR* is a browser-based user interface for
manual labeling of record pairs. Manual labeled or annotated data is useful
for training and validation models. The application provides the annotator with a clean and intuitive interface. The annotator labels the record pair as "Match" or "Distinct". The output can be exported and used for training and validation. This application is part of the
[Python Record Linkage Toolkit](https://github.com/J535D165/recordlinkage), but can be used as a standalone tool.:arrow_right: :arrow_right: Check out the [hosted version of *RecordLinkage ANNOTATOR*](https://j535d165.github.io/recordlinkage-annotator/) :arrow_left: :arrow_left:
Want to give it a try? Download our [**Prison book**](https://raw.githubusercontent.com/J535D165/recordlinkage-annotator/master/examples/annotation_dishonesty.json) example file :cop: :suspect:
[![Review screen of RecordLinkage ANNOTATOR](images/annotator_review.png)](https://j535d165.github.io/recordlinkage-annotator/)
The hosted version of [*RecordLinkage ANNOTATOR*](https://j535d165.github.io/recordlinkage-annotator/) on [Github-pages](https://pages.github.com/) makes use of client-side Javascript
only. The tool doesn't upload data. Not convinced about the safety of your data? That's
fine. You can deploy the code yourself locally. Please read the [Development](#development) instructions.## Create annotation file
*RecordLinkage ANNOTATOR* requires an annotation file as input. This JSON file contains your record pairs and defines a comparison schema. The [Python Record Linkage Toolkit](https://github.com/J535D165/recordlinkage) can be used to render such a file. For more information, see the documentation on [Manual Labeling](https://recordlinkage.readthedocs.io/en/latest/annotation.html).
Version 1 of the schema is open source and can be found [here](/schema). Examples of annotation files can be found in the [examples](/examples) folder.
## Extract results
*RecordLinkage ANNOTATOR* exports the results of the annotation in an annotation file with the same structure as the input annotation file. This makes it simple to review the annotation or continue labeling the data. The [Python Record Linkage Toolkit](https://github.com/J535D165/recordlinkage) can be used to read the annotation file and extract the links and distinct pairs. For more information, see the documentation on [Manual Labeling](https://recordlinkage.readthedocs.io/en/latest/annotation.html).
## Development
*RecordLinkage ANNOTATOR* is a [React](https://reactjs.org/) application. You can develop or deploy the project locally with `npm start`.
## License
BSD 3-Clause License