https://github.com/walkr/texttractor
A lightweight library for extracting main text from webpages
https://github.com/walkr/texttractor
Last synced: 2 months ago
JSON representation
A lightweight library for extracting main text from webpages
- Host: GitHub
- URL: https://github.com/walkr/texttractor
- Owner: walkr
- License: mit
- Created: 2016-02-18T19:51:38.000Z (over 9 years ago)
- Default Branch: master
- Last Pushed: 2016-02-18T20:26:43.000Z (over 9 years ago)
- Last Synced: 2023-08-03T21:23:17.081Z (almost 2 years ago)
- Language: HTML
- Homepage:
- Size: 36.1 KB
- Stars: 0
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
Awesome Lists containing this project
README
TextTractor [[](https://travis-ci.org/walkr/texttractor)]
============
Extract main text from webpages.
> work in progress ...