https://github.com/davidfoerster/schema-matching
Match schema attributes of relational databases by value similarity. As a study assignment, this isn't well documented, but you can contact me for questions and I may even add docs, if I sense enough interest.
https://github.com/davidfoerster/schema-matching
data-integration python schema-matching
Last synced: about 1 year ago
JSON representation
Match schema attributes of relational databases by value similarity. As a study assignment, this isn't well documented, but you can contact me for questions and I may even add docs, if I sense enough interest.
- Host: GitHub
- URL: https://github.com/davidfoerster/schema-matching
- Owner: davidfoerster
- License: mit
- Created: 2014-08-01T23:01:21.000Z (almost 12 years ago)
- Default Branch: master
- Last Pushed: 2019-10-18T16:50:19.000Z (over 6 years ago)
- Last Synced: 2023-04-12T02:41:59.644Z (about 3 years ago)
- Topics: data-integration, python, schema-matching
- Language: Python
- Homepage:
- Size: 271 KB
- Stars: 20
- Watchers: 3
- Forks: 8
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Schema Matching
Match schema attributes by value similarity.
## Usage
Run `./schema-matching --help` to see a usage description.
### Examples
See the shell scripts in `demo`. I suggest that you start with `1-match.sh` for
something simple. The output will be, in this order,
1. the norms between each column pair, i. e. a measure of how different they
are, between 0 (identical) and 1 (completely different),
2. the norm of the most closely matching column mapping, and
3. the most closely matching column mapping, one pair per line.
1\. and 2. require at least one level of verbosity (using `-v` or `--verbose`).
## Pre-requisites
- **Python 3** (tested with v3.6.8)