Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/antoncoding/html_content_crawler
A smart html crawler to automatically detect main content of a HTML page.
https://github.com/antoncoding/html_content_crawler
beautifulsoup content crawlers html nlp-parsing parser
Last synced: 20 days ago
JSON representation
A smart html crawler to automatically detect main content of a HTML page.
- Host: GitHub
- URL: https://github.com/antoncoding/html_content_crawler
- Owner: antoncoding
- Created: 2018-07-11T12:54:59.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2018-07-19T16:32:39.000Z (over 6 years ago)
- Last Synced: 2024-10-07T06:06:58.426Z (about 1 month ago)
- Topics: beautifulsoup, content, crawlers, html, nlp-parsing, parser
- Language: Python
- Homepage:
- Size: 12.7 KB
- Stars: 0
- Watchers: 3
- Forks: 0
- Open Issues: 0