Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/kensuke-mitsuzawa/word2vec-wikification-py
Disambiguation of wikipedia article name
https://github.com/kensuke-mitsuzawa/word2vec-wikification-py
disambiguation entity-linking nlp python-3 wikipedia wikipedia-database
Last synced: about 1 month ago
JSON representation
Disambiguation of wikipedia article name
- Host: GitHub
- URL: https://github.com/kensuke-mitsuzawa/word2vec-wikification-py
- Owner: Kensuke-Mitsuzawa
- Created: 2016-12-02T08:55:22.000Z (almost 8 years ago)
- Default Branch: master
- Last Pushed: 2017-03-15T22:10:34.000Z (over 7 years ago)
- Last Synced: 2024-09-23T07:16:51.175Z (about 2 months ago)
- Topics: disambiguation, entity-linking, nlp, python-3, wikipedia, wikipedia-database
- Language: Python
- Size: 31.3 KB
- Stars: 16
- Watchers: 3
- Forks: 1
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
wiki_node_disambiguation
- - -# What's this ?
- You can run "Wikification" as easy as possible.
- According to wikipedia, [Wikification](https://en.wikipedia.org/wiki/Wikification) is `in computer science, entity linking with Wikipedia as the target knowledge base`
- You can get disambiguated result with its score.Please visit [Github page](https://github.com/Kensuke-Mitsuzawa/word2vec_wikification_py) also.
If you find any bugs and you report it to github issue, I'm glad.
Any pull-requests are welcomed.# Requirement
- Python3.x (checked under )
- I recommend to use "Anaconda" distribution.# Setup
`python setup.py install`
## Get wikipedia entity vector model
Go to [this page](http://www.cl.ecei.tohoku.ac.jp/~m-suzuki/jawiki_vector/) and download model file from [here](http://www.cl.ecei.tohoku.ac.jp/~m-suzuki/jawiki_vector/entity_vector.tar.bz2).
Or run `download_model.sh`## To those who uses interface.predict_japanese_wiki_names()
You're supposed to have mysql somewhere.
The step until using it.
1. start mysql server somewhere
2. download latest mysql dump files
3. initialize wikipedia database with mysqlTo download wikipedia dump files, execute following commands
```
wget https://dumps.wikimedia.org/jawiki/latest/jawiki-latest-redirect.sql.gz
wget https://dumps.wikimedia.org/jawiki/latest/jawiki-latest-page.sql.gz
gunzip jawiki-latest-redirect.sql.gz
gunzip jawiki-latest-page.sql.gz
```To initialize wikipedia database with mysql,
```
% CREATE DATABASE wikipedia;
% mysql -u [user_name] -p[password] wikipedia < jawiki-latest-redirect.sql
% mysql -u [user_name] -p[password] wikipedia < jawiki-latest-page.sql
```# Change logs
- version0.1
- released
- It supports only Japanese wikipedia