An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with spiders

A curated list of projects in awesome lists tagged with spiders .

https://github.com/sjdirect/abot

Cross Platform C# web crawler framework built for speed and flexibility. Please star this project! +1.

abot abot-nuget c-sharp crawler cross-platform csharp csharp-library javascript-renderer netcore netcore2 netcore3 netsta netstandard20 netstandard21 parsing pluggable spider spiders unit-testing web-crawler

Last synced: 13 May 2025

https://github.com/yhangf/pythoncrawler

:heartpulse:用python编写的爬虫项目集合

python3 scripts spiders

Last synced: 15 May 2025

https://github.com/yhangf/PythonCrawler

:heartpulse:用python编写的爬虫项目集合

python3 scripts spiders

Last synced: 14 Mar 2025

https://github.com/FioraLove/Net-Spider

新手小白~~持续更新中〜随意Fork,Star,大佬萌多多issue哦 (目前更新: 1.短视频视频无水印解析(支持超过30多个平台 e.g., 抖音,快手,微博...) 2.bilibili音视频解析🐱‍👓 3.知音漫客图片加密 🔥 4. 世纪佳缘信息👨‍👩‍👧‍👦 5.小鸡词典🐥🐥。。。。。。

pixiv pornhub python-script python3 spiders

Last synced: 28 Mar 2025

https://github.com/lixi5338619/lxbook

《爬虫逆向进阶实战》书籍代码库

android-resever crawler frida java javascript python smali spiders unidbg xposed

Last synced: 13 Apr 2025

https://github.com/scrapinghub/spidermon

Scrapy Extension for monitoring spiders execution.

crawling hacktoberfest monitoring monitoring-tool scraping scrapinghub spiders testing

Last synced: 14 May 2025

https://github.com/TRHX/Python3-Spider-Practice

Python3 各种爬虫实战练习,JS 逆向、反反爬、验证码处理、登录签到抽奖、数据可视化,Python 3 practice of various spiders.

jsreverse python python3-spider-practice spider spiders

Last synced: 12 May 2025

https://github.com/MatrixSeven/ZhihuSpider

知乎爬虫/可以爬出关注关系的爬虫

java spiders zhihu

Last synced: 27 Mar 2025

https://github.com/hoochanlon/scripts

平台:Windows/Mac/Linux。脚本语言:多种、不限;我流,按需编写。涉及:桌面基线排查、软件激活破解、免杀及特权执行、渗透式支援固件识别读写、主机账户密码空值检测、Wi-Fi密码扫描、云主机终端安全加固、主机系统日志分析、自然语言处理、人文社科信息数据分析等。

automation ban-ip clawer linux macos security spiders wi-fi windows-update

Last synced: 04 Apr 2025

https://github.com/zhangyingwei/cockroach

又一个号称高性能的 java 爬虫工具/爬虫框架

cockroach java java-8 java-spider spider spiders

Last synced: 10 Apr 2025

https://github.com/viveckh/lilhomie

A Machine Learning Project implemented from scratch which involves web scraping, data engineering, exploratory data analysis and machine learning to predict housing prices in New York Tri-State Area.

data-engineering eda housing-price-analysis housing-price-prediction machine-learning machine-learning-projects predictions random-forest-regressor scrapy-crawler spiders trulia web-crawler

Last synced: 09 Sep 2025

https://github.com/yaleimeng/free_proxy_pool

对免费代理IP网站进行爬取,收集汇总为自己的代理池。关键是验证代理的有效性、匿名性、去重复

proxy proxypool spiders

Last synced: 11 May 2025

https://github.com/andsonder/space.keter.top

这里是sonder的有点又没有太多用的笔记本 “一个人只有不停的写作,才不会被人海淹没” 你可以通过这个链接来访问网页版:https://space.keter.top

algorithms deep-learning math spiders

Last synced: 30 Oct 2025

https://github.com/python-world/joble

This Platform Search Thousands Of Job Boards In Different Technologies From Over The World .

hacktoberfest hacktoberfest2020 jobsearch jobseeker scraping spiders

Last synced: 24 Jul 2025

https://github.com/budali/articalproject

爬虫的一些小项目,。欢迎star。

python scrapy spiders

Last synced: 13 Apr 2025

https://github.com/fooock/robots.txt

:robot: robots.txt as a service. Crawls robots.txt files, downloads and parses them to check rules through an API

antlr4 api crawler crawler-engine docker docker-compose gradle java kotlin makefile postgresql redis redis-stream redis-streams robots-parser robots-txt spiders spring-boot

Last synced: 18 Mar 2025

https://github.com/huangcongqing/spider

爬虫python3 (request,BeautifulSoup,xpath,re,Selenium,wordcloud等模块)

bf4 charles lxml python3 python3x re request requests selenium spider spiders xpath

Last synced: 06 May 2025

https://github.com/chenjinyi/php-images-spider

又一个PHP的图片爬虫,支持爬取Bilibili相簿,Artstation,Pixabay,Bing今日壁纸

image php php-cli php-curl php7 php72 spider spiders

Last synced: 09 Apr 2025

https://github.com/harhao/dushecommunity

利用Fiddler抓包分析毒舌影评社区的APP api接口。单机版的scrapy爬虫,基于scrapy-redis

python scrapy scrapy-redis spiders

Last synced: 14 Aug 2025

https://github.com/snowplow/iab-spiders-and-robots-java-client

Java 8+ client library for the IAB and ABC International Spiders and Robots list

abc abc-international-spiders bots iab iab-spiders java-library robots spiders

Last synced: 21 Apr 2025

https://github.com/chanran/distributedcrawling

分布式爬虫系统

distributed python scrapy spiders

Last synced: 05 Jan 2026

https://github.com/excalibra/scripts

Platform: Windows/Mac/Linux. Scripting: Various, no restrictions; customized as needed. Tasks: Desktop checks, software cracking, AV evasion, privilege execution, firmware read/write, password checks, Wi-Fi scans, cloud security, log analysis, NLP, and data analysis in humanities/social sciences.

automation ban-ip clawer linux macos security spiders update wi-fi windows

Last synced: 13 Apr 2025

https://github.com/croqaz/scrapy-quotes

Simple spiders to scraping popular "quotes"

scrapy spiders

Last synced: 25 Feb 2025

https://github.com/minicloudsky/pythoncode

my python daily study code

django pyecharts python scrapy spiders

Last synced: 14 Mar 2025

https://github.com/excalibra/script-draft

Platform: Windows/Mac/Linux. Scripting: Various, no restrictions; customized as needed. Tasks: Desktop checks, software cracking, AV evasion, privilege execution, firmware read/write, password checks, Wi-Fi scans, cloud security, log analysis, NLP, and data analysis in humanities/social sciences.

automation ban-ip clawer linux macos security spiders update wi-fi windows

Last synced: 26 Mar 2025

https://github.com/croqaz/scrapy-cameras

Toy project for Scrapy

camera scrapy spiders

Last synced: 25 Feb 2025

https://github.com/stormwind99/webslinger

Minecraft mod: Spiders shoot webbing at you - slinging webs from a distance, or when they hit you in melee!

forge forge-mod minecraft minecraft-forge-mod minecraft-mod spiders

Last synced: 30 Mar 2025

https://github.com/arkaniightt/scraper_vagas_bot

Este repositório contém um script para varrer um site específico em busca de vagas de emprego. O script coleta e organiza informações sobre oportunidades disponíveis neste site, facilitando a busca por emprego para os candidatos interessados. Os usuários podem personalizar a busca de acordo com suas preferências e qualificações específicas.

scraper scrapers scraping-python scraping-websites scrapy selenium selenium-python selenium-webdriver spiders

Last synced: 03 Nov 2025

https://github.com/devliusir/python

This is some experience and case in Python3 learning.

examples python python-library python3 pythonapplication spiders turtle

Last synced: 12 Mar 2025

https://github.com/asanka-j/post-scraper

Scraper of the blog https://blog.scrapinghub.com/, implemented in Python with Scrapy

python scrapy spiders

Last synced: 24 Aug 2025

https://github.com/niconoe/arabel

🕷️ Source code for the future Atlas of spiders of Belgium based on the ARABEL database

biodiversity biodiversity-data biodiversity-informatics spiders

Last synced: 03 Oct 2025

https://github.com/gyumeijie/quotesspider

A simpler spider which scrapes http://quotes.toscrape.com, a website that lists quotes from famous authors.

python scrapy scrapy-crawler scrapy-demo spiders

Last synced: 28 Mar 2025

https://github.com/tiagodongo/kriolspider

A guide to arachnids in Cape Verde

cape-verde catalog guide landing-page spiders

Last synced: 09 Apr 2025

https://github.com/thomased/ms_spiderpattern

Colour lure geometry paper

colour science spiders

Last synced: 25 Aug 2025

https://github.com/manu-sh/http_normalizer_parts

http url normalization utilities for web crawlers

http-url library normalization spiders web-crawling web-scraping

Last synced: 05 Aug 2025

https://github.com/excalibra/script

Platform: Windows/Mac/Linux. Scripting: Various, no restrictions; customized as needed. Tasks: Desktop checks, software cracking, AV evasion, privilege execution, firmware read/write, password checks, Wi-Fi scans, cloud security, log analysis, NLP, and data analysis in humanities/social sciences.

automation ban-ip clawer linux macos security spiders update wi-fi windows

Last synced: 17 Mar 2025

https://github.com/palwisha-18/quotes-scraper

Scrape Quote, its Author and Tags from qoutes.toscrape.com (Scrapy Tutorial)

python scrapy spiders web-scraping

Last synced: 05 Apr 2025

https://github.com/gr1d99/scripts

Random python scripts

python spiders web-crawler

Last synced: 18 Oct 2025