https://github.com/GeneralNewsExtractor/GeneralNewsExtractor
新闻网页正文通用抽取器 Beta 版.
https://github.com/GeneralNewsExtractor/GeneralNewsExtractor
python3 webcrawler webspider
Last synced: 2 months ago
JSON representation
新闻网页正文通用抽取器 Beta 版.
- Host: GitHub
- URL: https://github.com/GeneralNewsExtractor/GeneralNewsExtractor
- Owner: GeneralNewsExtractor
- License: gpl-3.0
- Created: 2019-09-08T13:47:38.000Z (over 5 years ago)
- Default Branch: master
- Last Pushed: 2024-06-25T04:13:11.000Z (12 months ago)
- Last Synced: 2024-10-29T15:34:44.261Z (7 months ago)
- Topics: python3, webcrawler, webspider
- Language: Python
- Homepage: https://generalnewsextractor.readthedocs.io/
- Size: 17.4 MB
- Stars: 3,615
- Watchers: 86
- Forks: 528
- Open Issues: 28
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE
Awesome Lists containing this project
- my-awesome - GeneralNewsExtractor/GeneralNewsExtractor - 06 star:3.7k fork:0.5k 新闻网页正文通用抽取器 Beta 版. (Python)