Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/deanmalmgren/textract
extract text from any document. no muss. no fuss.
https://github.com/deanmalmgren/textract
data-mining natural-language-processing python text-mining
Last synced: 3 days ago
JSON representation
extract text from any document. no muss. no fuss.
- Host: GitHub
- URL: https://github.com/deanmalmgren/textract
- Owner: deanmalmgren
- License: mit
- Created: 2014-07-03T20:36:59.000Z (over 10 years ago)
- Default Branch: master
- Last Pushed: 2024-10-21T14:04:15.000Z (about 2 months ago)
- Last Synced: 2024-10-29T11:20:36.187Z (about 1 month ago)
- Topics: data-mining, natural-language-processing, python, text-mining
- Language: HTML
- Homepage: http://textract.readthedocs.io
- Size: 4.32 MB
- Stars: 3,896
- Watchers: 83
- Forks: 601
- Open Issues: 143
-
Metadata Files:
- Readme: README.rst
- Contributing: CONTRIBUTING.md
- License: LICENSE
Awesome Lists containing this project
- awesome-projects - textract - extract text from any document. no muss. no fuss. (HTML)
- best-of-python - GitHub - 50% open · ⏱️ 10.03.2024): (Data Loading & Extraction)
- project-awesome - deanmalmgren/textract - extract text from any document. no muss. no fuss. (HTML)
- awesome-python-resources - GitHub - 39% open · ⏱️ 10.03.2022): (网络)
- best-of-awesome - textract
- starred-awesome - textract - extract text from any document. no muss. no fuss. (HTML)
- awesome-python-machine-learning-resources - GitHub - 39% open · ⏱️ 10.03.2022): (数据读写与提取)
README
.. NOTES FOR CREATING A RELEASE:
..
.. * bumpversion {major|minor|patch}
.. * git push && git push --tags
.. * twine upload -r textract dist/*
.. * convert into release https://github.com/deanmalmgren/textract/releasestextract
========Extract text from any document. No muss. No fuss.
`Full documentation `__.
Originally written by @deanmalmgren. Maintained by the good people at
@jazzband |Jazz Band||Build Status| |Version| |Downloads| |Test Coverage| |Documentation Status|
|Updates| |Stars| |Forks|.. |Jazz Band| image:: https://jazzband.co/static/img/badge.svg
:target: https://jazzband.co/
:alt: Jazzband.. |Build Status| image:: https://travis-ci.org/deanmalmgren/textract.svg?branch=master
:target: https://travis-ci.org/deanmalmgren/textract.. |Version| image:: https://img.shields.io/pypi/v/textract.svg
:target: https://warehouse.python.org/project/textract/.. |Downloads| image:: https://img.shields.io/pypi/dm/textract.svg
:target: https://warehouse.python.org/project/textract/.. |Test Coverage| image:: https://coveralls.io/repos/github/deanmalmgren/textract/badge.svg?branch=master
:target: https://coveralls.io/github/deanmalmgren/textract?branch=master.. |Documentation Status| image:: https://readthedocs.org/projects/textract/badge/?version=latest
:target: https://readthedocs.org/projects/textract/?badge=latest.. |Updates| image:: https://pyup.io/repos/github/deanmalmgren/textract/shield.svg
:target: https://pyup.io/repos/github/deanmalmgren/textract/.. |Stars| image:: https://img.shields.io/github/stars/deanmalmgren/textract.svg
:target: https://github.com/deanmalmgren/textract/stargazers.. |Forks| image:: https://img.shields.io/github/forks/deanmalmgren/textract.svg
:target: https://github.com/deanmalmgren/textract/network