Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/litreily/capturer
capture pictures from website like sina, lofter, huaban and so on
https://github.com/litreily/capturer
lofter python3 scrapy sina spider telegram
Last synced: about 2 hours ago
JSON representation
capture pictures from website like sina, lofter, huaban and so on
- Host: GitHub
- URL: https://github.com/litreily/capturer
- Owner: Litreily
- License: mit
- Created: 2018-02-05T12:23:56.000Z (almost 7 years ago)
- Default Branch: master
- Last Pushed: 2023-05-22T21:39:31.000Z (over 1 year ago)
- Last Synced: 2023-11-07T15:24:01.634Z (about 1 year ago)
- Topics: lofter, python3, scrapy, sina, spider, telegram
- Language: Python
- Homepage:
- Size: 101 KB
- Stars: 106
- Watchers: 4
- Forks: 30
- Open Issues: 6
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# What's Capturer
A capture tool used to capture pictures from web like Sina, LOFTER, huaban and so on.
> **If you have any suggestions or awesome websites of pictures want to capture, please let me know!!!**
## Support Websites
- [Sina](https://weibo.com/)
- [Lofter](http://www.lofter.com/)
- [Toutiao](https://www.toutiao.com)
- ~~[QQZone](https://qzone.qq.com/)~~: Need verify Captcha
- [Huaban](https://huaban.com/)
- ~~[Vmgirls](https://www.vmgirls.com/)~~: Website upgraded
- [Fabiaoqing](https://www.fabiaoqing.com/)
- `telegram`
- [NetBian](http://pic.netbian.com/)## How to use
- install `python3` and libs
- update your [Parameters](#parameters) of each kind of web
- run `./capturer` or run `main.py` or `***_spider.py` to capture images from
- `sina`
- `lofter`
- `toutiao`
- `qqzone`
- `telegram`
- `netbian`
- run `huaban/run.py` to capture images from `huaban`
- run `vmgirls/run.py` to capture images from `vmgirls`
- run `fabiaoqing/fabiaoqing_spider.py key1 [key2] [key3] ...`## Notices
Almost all of the file path based on `~/Pictures/python`, `~` means home dir.
## Parameters
### huaban
- `USERNAME`: username of huaban which you want to capture
- `ROOT_DIR`: directories where to store the images### Sina
- `uid`: user-id(10 numbers) of sina weibo that you want to capture
- `cookies`: your cookies after login the sina weibo
- `path`: directory to save the pictures### Lofter
- `username`: username of lofter that you want to capture
- `path`: directory to save the pictures, see the function `_get_path` in `lofter_spider.py`
- `query_number`: number of blogs in each query packet, default value is 40### Telegram
- `api_id`: you can get from
- `api_hash`: you can get from
- `socks proxy`: set proxy ip and port, default is `localhost:1080`## Blogs
You can find all the relate blogs in .
- Lofter - [爬取网易LOFTER图片](https://www.litreily.top/2018/03/17/lofter/)
- Sina - [爬取新浪微博用户图片](https://www.litreily.top/2018/04/10/sina/)
- qqzone - [爬取QQ空间相册](https://www.litreily.top/2019/03/03/qqzone/)
- Vmgirls - [Scrapy爬取vmgirls](https://www.litreily.top/2019/08/09/vmgirls/)
- Netbian - [爬取彼岸图网美图](https://www.litreily.top/2020/08/09/netbian/)