https://github.com/lin-zone/awesome

a list of some interesting repositories, tools
https://github.com/lin-zone/awesome

Last synced: 6 months ago
JSON representation

a list of some interesting repositories, tools

Host: GitHub
URL: https://github.com/lin-zone/awesome
Owner: lin-zone
Created: 2019-12-28T01:39:53.000Z (over 5 years ago)
Default Branch: master
Last Pushed: 2020-01-10T06:26:02.000Z (over 5 years ago)
Last Synced: 2024-04-10T05:19:43.009Z (about 1 year ago)
Size: 2.93 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

ultimate-awesome - awesome - A list of some interesting repositories, tools. (Programming Language Lists / Python Lists)

README

        # Awesome

## Scrapy Distributed

* [crawlab](https://github.com/crawlab-team/crawlab) - 基于Golang的分布式爬虫管理平台

* [Gerapy](https://github.com/Gerapy/Gerapy) - Distributed Crawler Management Framework Based on Scrapy, Scrapyd, Django and Vue.js

* [scrapydweb](https://github.com/my8100/scrapydweb) - ScrapydWeb: Web app for Scrapyd cluster management

## Scrapy Middleware

* [scrapy-mongodb](https://github.com/sebdah/scrapy-mongodb) - MongoDB pipeline for Scrapy

* [scrapy-splitvariants](https://github.com/scrapy-plugins/scrapy-splitvariants) - Scrapy spider middleware to split an item into multiple items using a multi-valued key

* [scrapy-proxies](https://github.com/aivarsk/scrapy-proxies) - Random proxy middleware for Scrapy

* [scrapy-fake-useragent](https://github.com/alecxe/scrapy-fake-useragent) - Random User-Agent middleware based on fake-useragent

* [scrapy-selenium](https://github.com/clemfromspace/scrapy-selenium) - Scrapy middleware to handle javascript pages using selenium

* [scrapy-crawlera](https://github.com/scrapy-plugins/scrapy-crawlera) - Crawlera middleware for Scrapy

* [crawlera](https://scrapinghub.com/crawlera) - The World's Smartest Proxy Network

* [scrapy-deltafetch](https://github.com/scrapy-plugins/scrapy-deltafetch) - Scrapy spider middleware to ignore requests to pages containing items seen in previous crawls

* [scrapy-random-useragent](https://github.com/cnu/scrapy-random-useragent) - Scrapy Middleware to set a random User-Agent for every Request.

* [scrapy-crawl-once](https://github.com/TeamHG-Memex/scrapy-crawl-once) - Scrapy middleware which allows to crawl only new content

* [scrapy-magicfields](https://github.com/scrapy-plugins/scrapy-magicfields) - Scrapy middleware to add extra fields to items, like timestamp, response fields, spider attributes etc.

## command line

* [cleo](https://github.com/sdispater/cleo) - Cleo allows you to create beautiful and testable command-line interfaces.

## HTML parser

* [scrapely](https://github.com/scrapy/scrapely) - A pure-python HTML screen-scraping library

## Crawler

* [Douyin-Bot](https://github.com/wangshub/Douyin-Bot) - Python 抖音机器人，论如何在抖音上找到漂亮小姐姐

* [SinaSpider](https://github.com/LiuXingMing/SinaSpider) - 新浪微博爬虫（Scrapy、Redis）

* [ECommerceCrawlers](https://github.com/DropsDevopsOrg/ECommerceCrawlers) - 实战多种网站、电商数据爬虫

* [examples-of-web-crawlers](https://github.com/shengqiangzhang/examples-of-web-crawlers) - 一些非常有趣的python爬虫例子,对新手比较友好,主要爬取淘宝、天猫、微信、豆瓣、QQ等网站

* [PythonCrawler](https://github.com/yhangf/PythonCrawler) - 用python编写的爬虫项目集合

* [amemv-crawler](https://github.com/loadchange/amemv-crawler) - 下载指定的抖音号的视频,抖音爬虫

* [course-crawler](https://github.com/Foair/course-crawler) - 中国大学MOOC、学堂在线、网易云课堂、好大学在线、爱课程 MOOC 课程下载

* [zhihu_crawler](https://github.com/SmileXie/zhihu_crawler) - Crawler of zhihu.com

* [awesome-spider](https://github.com/facert/awesome-spider) - 爬虫集合

* [python-spider](https://github.com/Jack-Cherish/python-spider) - Python3网络爬虫实战

* [Anti-Anti-Spider](https://github.com/luyishisi/Anti-Anti-Spider) - 处理反爬

* [FunpySpiderSearchEngine](https://github.com/mtianyan/FunpySpiderSearchEngine) - Scrapy 1.6.0爬取数据 + ElasticSearch6.8.0+Django2.2搜索引擎

* [spider163](https://github.com/chengyumeng/spider163) - 抓取网易云音乐热门评论

* [Python-Spider](https://github.com/lb2281075105/Python-Spider) - Python 爬虫

* [ScrapyProject](https://github.com/cuanboy/ScrapyProject) - Scrapy实战项目合集

## Tools

* [qrcode](https://github.com/sylnsfar/qrcode) - Python 艺术二维码生成器

## utils

* [queuelib](https://github.com/scrapy/queuelib) - Collection of persistent (disk-based) queues

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/lin-zone/awesome

Awesome Lists containing this project

README