Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/sandersn/nuwen
Code for my senior project on simple error-correcting parsers for natural language
https://github.com/sandersn/nuwen
Last synced: 9 days ago
JSON representation
Code for my senior project on simple error-correcting parsers for natural language
- Host: GitHub
- URL: https://github.com/sandersn/nuwen
- Owner: sandersn
- License: other
- Created: 2023-12-03T17:33:19.000Z (12 months ago)
- Default Branch: main
- Last Pushed: 2023-12-03T18:13:49.000Z (12 months ago)
- Last Synced: 2024-10-20T07:30:04.426Z (28 days ago)
- Language: Python
- Size: 397 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.txt
Awesome Lists containing this project
README
The principal executable files are
invisiblehand.py -- Grammar builder
diem.py -- POS trainer and parser
waApp1.py -- Bonus grammar builder GUI (requires wxWindows and wxPython)waApp1.py is in the very early stages of development (as it doesn't even have a name yet).
It may never become any more advanced, because The Invisible Hand can parse
correctly formatted text files in one go, which is really easier.The actual work is performed in the following files:
cfg.py -- Extended BNF grammar support
earley.py -- Implementation of Earley's parser (and error-correcting modification)
gram.py -- Statistical word-counting classes
lr.py -- Implementation of LR parsing (with error-correcting FMA)
parse.py -- Interface to parsing
parse_state.py -- Just the parent class for holding parse trees.
postag.py -- Part of speech identification
util.py -- Many small utility functions
web.py -- A few cgi specific utilities
----
french.grammar -- Example of small grammar
frgrammar.txt -- Source for said grammar
french.postag -- *extremely* small POS training corpus.I have not included the web equivalents to diem.py. I might later.
The names of the executable files are ship names
from Vernor Vinge's novel _A Deepness in the Sky_.No documentation other than what's in the source so far.
I'm investigating the documentation generator capabilities that come with the
Python distro, and will probably modify that.