Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/ianozsvald/learning_text_transformer_demo
Demo code for learning_text_transformer
https://github.com/ianozsvald/learning_text_transformer_demo
Last synced: 15 days ago
JSON representation
Demo code for learning_text_transformer
- Host: GitHub
- URL: https://github.com/ianozsvald/learning_text_transformer_demo
- Owner: ianozsvald
- License: mit
- Created: 2015-01-18T18:54:18.000Z (almost 10 years ago)
- Default Branch: master
- Last Pushed: 2015-02-22T22:00:15.000Z (over 9 years ago)
- Last Synced: 2024-08-01T15:15:02.136Z (3 months ago)
- Language: Python
- Size: 193 KB
- Stars: 25
- Watchers: 7
- Forks: 0
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# learning_text_transformer_demo
Demo code for http://annotate.io/ which is a "self-learning text transformer".This project learns a mapping from "data you have" to "data you want" from a set of examples you provide, you can then send in new data that gets cleaned according to the rules that were learned.
Status: prototype (the example works and you can post your own simple examples)
Getting started
---------------You need Python 2.7 or 3.4 along with the `requests` module. You can install `requests` using:
$ pip install requests
To run the demo:
$ python annotateio_demo.py
This demo is self-contained, it needs a connection to the internet, it posts the examples that are contained in the file, no other data (nothing from your machine) is sent.
More information
----------------A write-up of some of the goals including slides are available here:
* http://ianozsvald.com/2015/02/08/lightning-talk-at-pydatalondon-for-annotate/
* http://ianozsvald.com/2015/01/10/a-first-approach-to-automatic-text-data-cleaning/
* http://ianozsvald.com/2015/01/27/annotate-io-self-learning-text-cleaner-demo-online/