https://github.com/litreily/capturer

capture pictures from website like sina, lofter, huaban and so on
https://github.com/litreily/capturer

lofter python3 scrapy sina spider telegram

Last synced: 6 months ago
JSON representation

capture pictures from website like sina, lofter, huaban and so on

Host: GitHub
URL: https://github.com/litreily/capturer
Owner: Litreily
License: mit
Created: 2018-02-05T12:23:56.000Z (over 7 years ago)
Default Branch: master
Last Pushed: 2023-05-22T21:39:31.000Z (over 2 years ago)
Last Synced: 2025-03-27T02:43:48.979Z (7 months ago)
Topics: lofter, python3, scrapy, sina, spider, telegram
Language: Python
Homepage:
Size: 101 KB
Stars: 107
Watchers: 3
Forks: 30
Open Issues: 6
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # What's Capturer

A capture tool used to capture pictures from web like Sina, LOFTER, huaban and so on.

> **If you have any suggestions or awesome websites of pictures want to capture, please let me know!!!**

## Support Websites

- [Sina](https://weibo.com/)

- [Lofter](http://www.lofter.com/)

- [Toutiao](https://www.toutiao.com)

- ~~[QQZone](https://qzone.qq.com/)~~: Need verify Captcha

- [Huaban](https://huaban.com/)

- ~~[Vmgirls](https://www.vmgirls.com/)~~: Website upgraded

- [Fabiaoqing](https://www.fabiaoqing.com/)

- `telegram`

- [NetBian](http://pic.netbian.com/)

## How to use

- install `python3` and libs

- update your [Parameters](#parameters) of each kind of web

- run `./capturer` or run `main.py` or `***_spider.py` to capture images from

  - `sina`

  - `lofter`

  - `toutiao`

  - `qqzone`

  - `telegram`

  - `netbian`

- run `huaban/run.py` to capture images from `huaban`

- run `vmgirls/run.py` to capture images from `vmgirls`

- run `fabiaoqing/fabiaoqing_spider.py key1 [key2] [key3] ...`

## Notices

Almost all of the file path based on `~/Pictures/python`, `~` means home dir.

## Parameters

### huaban

- `USERNAME`: username of huaban which you want to capture

- `ROOT_DIR`: directories where to store the images

### Sina

- `uid`: user-id(10 numbers) of sina weibo that you want to capture

- `cookies`: your cookies after login the sina weibo

- `path`: directory to save the pictures

### Lofter

- `username`: username of lofter that you want to capture

- `path`: directory to save the pictures, see the function `_get_path` in `lofter_spider.py`

- `query_number`: number of blogs in each query packet, default value is 40

### Telegram

- `api_id`: you can get from 

- `api_hash`: you can get from 

- `socks proxy`: set proxy ip and port, default is `localhost:1080`

## Blogs

You can find all the relate blogs in .

- Lofter - [爬取网易LOFTER图片](https://www.litreily.top/2018/03/17/lofter/)

- Sina - [爬取新浪微博用户图片](https://www.litreily.top/2018/04/10/sina/)

- qqzone - [爬取QQ空间相册](https://www.litreily.top/2019/03/03/qqzone/)

- Vmgirls - [Scrapy爬取vmgirls](https://www.litreily.top/2019/08/09/vmgirls/)

- Netbian - [爬取彼岸图网美图](https://www.litreily.top/2020/08/09/netbian/)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/litreily/capturer

Awesome Lists containing this project

README