https://github.com/stanleylsx/app_comments_spider
爬取百度贴吧、TapTap、appstore、微博官方博主上的游戏评论(基于redis_scrapy),过滤器采用了bloomfilter。
https://github.com/stanleylsx/app_comments_spider
bloom-filter comments redis-scrapy scrapy spider
Last synced: about 2 months ago
JSON representation
爬取百度贴吧、TapTap、appstore、微博官方博主上的游戏评论(基于redis_scrapy),过滤器采用了bloomfilter。
- Host: GitHub
- URL: https://github.com/stanleylsx/app_comments_spider
- Owner: StanleyLsx
- Created: 2018-11-07T11:37:45.000Z (almost 7 years ago)
- Default Branch: master
- Last Pushed: 2018-11-15T06:18:01.000Z (almost 7 years ago)
- Last Synced: 2023-03-04T05:33:44.257Z (over 2 years ago)
- Topics: bloom-filter, comments, redis-scrapy, scrapy, spider
- Language: Python
- Homepage:
- Size: 51.8 KB
- Stars: 47
- Watchers: 2
- Forks: 14
- Open Issues: 1