Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/proprietary/msoffice-document-tokenizer
Create wordlist dictionary from words/tokens in DOCX and PPTX Microsoft Office OpenXML documents
https://github.com/proprietary/msoffice-document-tokenizer
microsoft-office nlp nlp-parsing python tokenizer
Last synced: 16 days ago
JSON representation
Create wordlist dictionary from words/tokens in DOCX and PPTX Microsoft Office OpenXML documents
- Host: GitHub
- URL: https://github.com/proprietary/msoffice-document-tokenizer
- Owner: proprietary
- License: other
- Created: 2019-12-16T05:09:03.000Z (about 5 years ago)
- Default Branch: master
- Last Pushed: 2019-12-16T05:09:26.000Z (about 5 years ago)
- Last Synced: 2023-03-08T20:38:35.507Z (almost 2 years ago)
- Topics: microsoft-office, nlp, nlp-parsing, python, tokenizer
- Language: Python
- Size: 1000 Bytes
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0