Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/pprzetacznik/nlp_wdsjn
The lecture is a continuation of the lecture "Natural Language Processing." It is an introduction to the issues of describing semantics and shows the importance of the semantic motivation of natural language processing algorithms.
https://github.com/pprzetacznik/nlp_wdsjn
Last synced: 26 days ago
JSON representation
The lecture is a continuation of the lecture "Natural Language Processing." It is an introduction to the issues of describing semantics and shows the importance of the semantic motivation of natural language processing algorithms.
- Host: GitHub
- URL: https://github.com/pprzetacznik/nlp_wdsjn
- Owner: pprzetacznik
- Created: 2015-10-30T10:55:43.000Z (about 9 years ago)
- Default Branch: master
- Last Pushed: 2016-01-20T00:20:25.000Z (almost 9 years ago)
- Last Synced: 2024-10-21T22:52:11.551Z (2 months ago)
- Language: HTML
- Size: 1000 KB
- Stars: 2
- Watchers: 3
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
WdSJN
================The lecture is a continuation of the lecture "Natural Language Processing." It is an introduction to the issues of describing semantics and shows the importance of the semantic motivation of natural language processing algorithms.
Run:
```Bash
$ git clone https://github.com/pprzetacznik/WdSJN.git
$ cd WdSJN
$ mkvirtualenv WdSJN
(WdSJN) $ pip install -r requirements.txt
```## Draw graphs
Run:
```Bash
(WdSJN) $ python draw_graph.py
```## Create snippets
Download dataset from from: http://www.nkjp.pl or use \*.html files from `data` directory.
Run:
```Bash
(WdSJN) $ python parse_notes.py data/bialy.html | uniq | shuf | head -n 100
(WdSJN) $ python parse_notes.py data/bialy.html | uniq | shuf | head -n 100 > data/bialy.snippets.txt
```## Create couple from graph and snippets
```Bash
(WdSJN) $ python -m snippet_coupler data/godlo.snippets.txt > "data/godlo.coupled.txt"
(WdSJN) $ python -m snippet_coupler data/orzel.snippets.txt > "data/orzel.coupled.txt"
(WdSJN) $ python -m snippet_coupler data/ptak.snippets.txt > "data/ptak.coupled.txt"
(WdSJN) $ python -m snippet_coupler data/bialy.snippets.txt > "data/bialy.coupled.txt"
```