Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/ikegami-yukino/pytypo
English spelling correction
https://github.com/ikegami-yukino/pytypo
english-word nlp spelling-correction typo
Last synced: about 1 month ago
JSON representation
English spelling correction
- Host: GitHub
- URL: https://github.com/ikegami-yukino/pytypo
- Owner: ikegami-yukino
- License: mit
- Created: 2016-03-19T10:19:01.000Z (over 8 years ago)
- Default Branch: master
- Last Pushed: 2017-10-18T09:08:37.000Z (about 7 years ago)
- Last Synced: 2024-10-04T13:18:51.008Z (about 2 months ago)
- Topics: english-word, nlp, spelling-correction, typo
- Language: Python
- Homepage:
- Size: 5.53 MB
- Stars: 8
- Watchers: 4
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: README.rst
- Changelog: CHANGES.rst
- License: LICENSE
Awesome Lists containing this project
README
pytypo
===========
|travis| |coveralls| |landscape| |pyversion| |version| |license|pytypo corrects English spelling mistakes.
That feature is based on TYPO CORPUS (http://luululu.com/tweet/)
and Wikipedia (https://en.wikipedia.org/wiki/Wikipedia:Lists_of_common_misspellings/For_machines)And this module normalizes also lengthened English expression having repeating letters.
(e.g., this module converts "cooooooooooooooollllllllllllll" to "cool")That feature is based on the following paper:
Samuel Brody and Nicholas Diakopoulos.
Cooooooooooooooollllllllllllll!!!!!!!!!!!!!! using word lengthening to detect sentiment in microblogs.
In EMNLP2011, pp. 562-570, 2011.
http://aclweb.org/anthology//D/D11/D11-1052.pdfContributions are welcome!
Installation
============::
$ pip install pytypo
Usage
=====Import pytypo
--------------------------------------------::
>>> import pytypo
correct sentence
--------------------------------------------::
>>> pytypo.correct_sentence('you are coooolll!!!')
you are cool!- correct_sentence(str)
correct word
--------------------------------------------::
>>> pytypo.correct('okayyyyy')
okay- correct(str)
Shorten repeated substring until threshould without dictionary
-------------------------------------------------------------------::
>>> pytypo.cut_repeat('mamisaaaaaan', 1)
mamisan
>>> pytypo.cut_repeat('okayyyyy', 2)
okayy- cut_repeat(str, threshould)
* Note that this method don't use a lengthened expression normalize table (e.g., cooll -> cool).
If you want to normalize such expression, use `correct()` or `correct_sentence()` method.License
=========- This module is licensed under MIT License.
.. |travis| image:: https://travis-ci.org/ikegami-yukino/pytypo.svg?branch=master
:target: https://travis-ci.org/ikegami-yukino/pytypo
:alt: travis-ci.org.. |coveralls| image:: https://coveralls.io/repos/ikegami-yukino/pytypo/badge.svg?branch=master&service=github
:target: https://coveralls.io/github/ikegami-yukino/pytypo?branch=master
:alt: coveralls.io.. |landscape| image:: https://landscape.io/github/ikegami-yukino/pytypo/master/landscape.svg?style=flat
:target: https://landscape.io/github/ikegami-yukino/pytypo/master
:alt: Code Health.. |pyversion| image:: https://img.shields.io/pypi/pyversions/pytypo.svg
.. |version| image:: https://img.shields.io/pypi/v/pytypo.svg
:target: http://pypi.python.org/pypi/pytypo/
:alt: latest version.. |license| image:: https://img.shields.io/pypi/l/pytypo.svg
:target: http://pypi.python.org/pypi/pytypo/
:alt: license