Projects in Awesome Lists tagged with scrapy-spider
A curated list of projects in awesome lists tagged with scrapy-spider .
https://github.com/TheWebScrapingClub/webscraping-from-0-to-hero
The web scraping open project repository aims to share knowledge and experiences about web scraping with Python
playwright python scrapy scrapy-spider scrapysplash webscraping
Last synced: 14 Mar 2025
https://github.com/thewebscrapingclub/webscraping-from-0-to-hero
The web scraping open project repository aims to share knowledge and experiences about web scraping with Python
playwright python scrapy scrapy-spider scrapysplash webscraping
Last synced: 08 Apr 2025
https://github.com/kezhenxu94/house-renting
Possibly the best practice of Scrapy 🕷 and renting a house 🏡
docker python scrapy scrapy-crawler scrapy-spider scrapyd
Last synced: 21 Nov 2024
https://github.com/lkuffo/web-scraping
Más de 50 ejemplos de web scraping utilizando: Requests | Scrapy | Selenium | LXML | BeautifulSoup
beautifulsoup beautifulsoup4 lxml-etree scraping scraping-python scraping-websites scrapping-python scrapy scrapy-crawler scrapy-spider selenium selenium-python selenium-webdriver web-scraping webscraping
Last synced: 07 Apr 2025
https://github.com/djm/python-scrapyd-api
A Python wrapper for working with Scrapyd's API.
api-wrapper python scrapy-spider
Last synced: 13 Apr 2025
https://github.com/DiegoCaraballo/Email-extractor
The main functionality is to extract all the emails from one or several URLs - La funcionalidad principal es extraer todos los correos electrónicos de una o varias Url
email email-extractor email-marketing emails extraction python scraper scrapers scraping scraping-websites scrapper scrapping scrapy scrapy-spider spyder stractor
Last synced: 21 Nov 2024
https://github.com/accordbox/scrapy-spider-example
Scrapy spider example for Scrapy Tutorial Series
python3 scrapy-spider scrapy-tutorial
Last synced: 12 Apr 2025
https://github.com/mahmudahsan/webscraping
Lets learn web scraping and apply them in real application.
beautiful-soup beautifulsoup beautifulsoup4 python python3 scrapy scrapy-crawler scrapy-demo scrapy-framework scrapy-spider scrapy-tutorial webscraping
Last synced: 25 Apr 2025
https://github.com/dotnetage/scrapy_plus
scrapy 常用爬网必备工具包
middlewares python scrapy scrapy-extension scrapy-spider spiders tor
Last synced: 25 Jan 2025
https://github.com/chinesehuazhou/scrapyproject
Scrapy项目(mysql+mongodb豆瓣top250电影)
mongodb mysql python scrapy scrapy-spider
Last synced: 11 Apr 2025
https://github.com/zoranpandovski/prodirectscraper
:necktie: Web scraper for http://www.prodirectselect.com/ :shoe:
python scraper scrapy scrapy-crawler scrapy-spider spider webscraper webscraping
Last synced: 10 Dec 2024
https://github.com/jumbaeric/woocommerce_shop_scraper_to_csv
The WooCommerce Shop Scraper to CSV tool is a powerful python tool designed to simplify the process of extracting product data from a WooCommerce-powered online shop and generating a comprehensive CSV file. Built on the foundations of WordPress and WooCommerce,
data python scraping-websites scrapy-spider woocommerce-products wordpress
Last synced: 13 Apr 2025
https://github.com/activatedgeek/winemag-dataset
Dataset of Wine Reviews from Wine Enthusiast Magazine :grapes: :wine_glass: :earth_asia:
crawler dataset python3 scrapy scrapy-spider vega-lite visualization wine wine-tasting
Last synced: 12 Apr 2025
https://github.com/dori-dev/quotes-crawler
Quotes crawler using scrapy and python.
crawler crawling python scraping-python scraping-websites scrapy scrapy-crawler scrapy-spider web-scraper
Last synced: 21 Apr 2025
https://github.com/ekaputra07/kodepos
Scrapper Kode POS Indonesia berbasis Python Scrapy. Sumber kode POS: http://kodepos.nomor.net
json kodepos python scrapy-spider
Last synced: 03 Jan 2025
https://github.com/rggh/scrapy14
Scraping News Stories From Multiple Sources
mysql scraping-news-stories scrapy scrapy-spider
Last synced: 07 Dec 2024
https://github.com/nguyenda18/portland-jail-data-crawler
Scraper used for recording changes to Portland jail database
dataframe datasette python python3 scrapy scrapy-crawler scrapy-spider
Last synced: 23 Apr 2025
https://github.com/santhoshse7en/imdb-scrapy
A fun projects made using Scrapy. The Spiders included in this are able to extract Movie, TV-Series, TV-Movies based on year and title type. A lot more to come ahead
beautifulsoup4 imdb imdb-dataset imdb-scraper scrapy scrapy-crawler scrapy-framework scrapy-spider
Last synced: 03 Mar 2025
https://github.com/venkatamutyala/wordpress-plugins-crawler-scrapy
Scrapy scripts to crawl all WordPress.org plugins
scrapy scrapy-crawler scrapy-spider webscraper wordpress wordpress-plugin-crawler
Last synced: 03 Apr 2025
https://github.com/ayemunhossain/scrapy-web-crawler-projects
Python crawler with Scrapy framework, which may help you to learn webscrapping.
python-web-scraping python-webscraping-application scrapy scrapy-crawler scrapy-python scrapy-spider webscraping
Last synced: 24 Mar 2025
https://github.com/farukalamai/yelp-scraper-scrapy-python
Yelp Restaurant data scraping using python, scrapy spider
ai-bot data-extraction data-mining data-scraper data-scraping python python-scraper scrapy scrapy-crawler scrapy-spider web-scraper web-scraping web-scraping-python web-scraping-software yelp yelp-api yelp-restaurants yelp-resturant-data-scraping yelp-scraper
Last synced: 29 Dec 2024
https://github.com/franck-gaspoz/moviedbassistant
Generates Movie documents (HTML, movie database,...) using Web Crawlers ( IMDb, ... )
cli database gui imdb movie scraper scrapy-crawler scrapy-spider scrawler
Last synced: 10 Apr 2025
https://github.com/noseparte/hero-kills-reptile
:snowflake: 英雄杀三部曲:英雄杀爬虫项目, 爬取英雄资料以及卡牌信息
Last synced: 15 May 2025
https://github.com/adamfisher/scrapyrt.client
A C# client to make calls to a scrapyrt (Scrapy real-time) HTTP endpoint.
crawler scraper scrapy scrapy-crawler scrapy-framework scrapy-spider
Last synced: 21 Mar 2025
https://github.com/aa-nadim/trip-crawler
This project is designed for web scraping and managing data using Docker containers. It includes services for a PostgreSQL database, a web scraper, pgAdmin, and a testing setup.
scrapy scrapy-crawler scrapy-spider
Last synced: 19 Feb 2025
https://github.com/minhtran241/data-scrapping-hugoboss
Data Scrapping using Scrapy
data-scraping scrapy scrapy-crawler scrapy-spider
Last synced: 28 Mar 2025
https://github.com/aquatiko/craigslist-spider
A python spider to scrape jobs list and details form https://newyork.craigslist.org.
craigslist dynamic jobseeker python3 scrapy-spider
Last synced: 14 Mar 2025
https://github.com/fmgarciaa/scrapy
Notes on Platzi's Scrapy course. I learned how to extract data from the internet in an automated way by creating spiders
platzi-course python scrapy-spider
Last synced: 07 Apr 2025
https://github.com/rggh/scrapy12
cookpad
cookpad infinite-scroll scrapy-spider
Last synced: 28 Mar 2025
https://github.com/fernandotorresl/platzi-intelligence-agency_bak
Platzi Intelligence Agency Website (Project from Scrapy Course - Platzi.com)
python scrapy scrapy-crawler scrapy-spider
Last synced: 26 Mar 2025
https://github.com/muneeb1030/webscrapper_mastodon
The Mastodon Social Platform Scraper is a Python-based web scraping tool designed to explore and extract valuable data from the Mastodon social platform.
data-analysis data-collection mastodon python3 scrapy scrapy-spider selenium-python webscraping
Last synced: 08 Feb 2025
https://github.com/buihdk/scrapy-books
A demo of scraping book data from the website https://books.toscrape.com using Scrapy
Last synced: 13 Mar 2025
https://github.com/amiriiw/coin_crawler
A cryptocurrency data scraping and web application built with Flask and Scrapy. This project collects cryptocurrency data from CoinMarketCap and serves it through a Flask-based web application, allowing users to view various crypto trends, newly listed coins, top gainers/losers, and specific coin details.
coin-crawler css3 flask-application html5 python3 requests scrapy scrapy-spider scrapy-splash spider
Last synced: 13 Mar 2025
https://github.com/ahmadhuss/scrapy_universal
A universal, plug-and-play Scrapy boilerplate for scalable web scraping projects.
python scrapy scrapy-crawler scrapy-spider
Last synced: 07 May 2025
https://github.com/armanjscript/web-scraping
Web Scraping projects with Python
beautifulsoup4 python scrapy scrapy-crawler scrapy-spider selenium selenium-python selenium-webdriver webscraping
Last synced: 04 Apr 2025
https://github.com/damikaalwis-gif/adscrapex
AdScrapeX is a web scraping project built with Scrapy and Scrapy Playwright to extract data from popular classified ad websites in Sri Lanka, including vehicles, properties, and job listings.
classifieds ikman-lk playwright python scrapy scrapy-spider webscraping
Last synced: 13 Dec 2024
https://github.com/marc7666/prac1-tcvd-web-scraping
PRAC1 of the subject "Data typology and life cycle" of the MSc in Data Science at Universitat oberta de Catalunya
csv-export csv-files python scrapy scrapy-crawler scrapy-spider webscraping
Last synced: 09 May 2025
https://github.com/intel00000/web_scraper_hasmo
A python web scraper designed to collect data from competitor websites, news articles, and market research reports.
google-api google-api-python-client gspread-api openai-api scrapy-crawler scrapy-framework scrapy-spider scrapy-splash
Last synced: 13 Mar 2025
https://github.com/sachnaror/scrapy_scraper
Using Scrapy: it scrapes quotes from [quotes.toscrape.com] and saves them to a CSV file
scrapy scrapy-crawler scrapy-spider
Last synced: 20 Feb 2025
https://github.com/loglux/scan_co_uk
This repository contains a Scrapy spider designed to scrape product information from Scan.co.uk based on provided search terms and filters.
filtering-data python scrapy scrapy-spider search webscraping
Last synced: 25 Feb 2025
https://github.com/loglux/amazon_uk
A Scrapy spider to scrape Amazon UK product details based on search terms, categories, and filters. Features deduplication filter for unique listings and optional sponsored link exclusion.
amazon filtering-data python scraper scrapy-spider search webscraping
Last synced: 25 Feb 2025
https://github.com/danielsiepmann/typo3-docs-search
Full Stack to allow searching docs.typo3.org
docs scrapy-spider search typo3
Last synced: 22 Mar 2025
https://github.com/abdullahalzubaer/web-scrapping-scrapy
A simple web scrapper
json scrapy scrapy-crawler scrapy-spider webscraping
Last synced: 30 Mar 2025
https://github.com/vaibhavyadav-dev/codeforces-problemset-scrapper
Web Scrapper that scrap the whole problemset of Codeforces into csv or json file.
codeforces competative competative-programming crawler problemset programming python scrapy-crawler scrapy-spider
Last synced: 06 Mar 2025
https://github.com/phalski/mystundenplan-crawler
A web crawler for myStundenplan
ingolstadt mystundenplan primuss scrapy scrapy-crawler scrapy-spider th-ingolstadt thi
Last synced: 14 Mar 2025
https://github.com/joannescode/quotestoscrape
Raspagem de conteúdo com a biblioteca Scrapy...
python3 scraping-websites scrapy-spider
Last synced: 26 Feb 2025
https://github.com/joannescode/metacritic-crawler
Raspagem de conteúdo com a biblioteca Scapy...
python3 scraping-websites scrapy-spider
Last synced: 26 Feb 2025
https://github.com/tknishh/scrapy-redis-data-pipeline
Building a highly scalable data streaming pipeline in python
datapipeline python redis-client redis-server scrapy-spider
Last synced: 18 Mar 2025
https://github.com/vlada-pv/scrapy-parser-wiki-movies
The Wiki Movies Scraper is a Scrapy project designed to collect information on movies from Wikipedia, including their Title, Genre, Director, Country and Year.
scrapy scrapy-spider wikipedia
Last synced: 15 Mar 2025
https://github.com/steadyxlr8/scrapy_practice
Web scrapping using Scrapy framework
python scrapy-spider web-scraping
Last synced: 31 Mar 2025
https://github.com/team-synergy-resource-mapper/classified-ad-scraper
AdScrapeX is a web scraping project built with Scrapy and Scrapy Playwright to extract data from popular classified ad websites in Sri Lanka, including vehicles, properties, and job listings.
classifieds ikman-lk playwright python scrapy scrapy-spider webscraping
Last synced: 31 Mar 2025
https://github.com/aman-dutta/oscar_winners
Scraping IMDB website to find out the Oscar winners using web-crawler and Python
python3 scrapy-crawler scrapy-spider
Last synced: 05 Mar 2025
https://github.com/chyornyy/funpay-parser
The Funpay Parser is a web scraper built with Scrapy that extracts information about lots on the https://funpay.com/ marketplace.
python3 scrapy scrapy-crawler scrapy-spider
Last synced: 31 Mar 2025
https://github.com/hayatiyrtgl/scrapy_amazon_project
Scrapy bot to crawl amazon website
python python-script scraper scraping scraping-websites scrapy scrapy-spider
Last synced: 08 Apr 2025
https://github.com/hayatiyrtgl/pc_prices_scraper
scraper scraping scraping-websites scrapy scrapy-spider
Last synced: 08 Apr 2025
https://github.com/gilzoide/pparker
Aranhas que buscam notícias
scraping scrapy-crawler scrapy-spider web-crawling web-scraping
Last synced: 07 Apr 2025
https://github.com/muneeb1030/webscrapper_altnews
The project utilizes a combination of Python, Scrapy, and Selenium to navigate through the dynamic content of AltNews.in and collect valuable information for analysis and verification.
data-analysis data-collection python3 scrapy scrapy-spider selenium selenium-python
Last synced: 28 Feb 2025
https://github.com/muneeb1030/webscrapper_politifact
This initiative seeks to extract and analyze fact-checking data from Politifact.com, providing valuable insights into political statements, rulings, and the evolving information landscape.
data data-collection dataanalysis python3 scrapy scrapy-spider webscraping
Last synced: 28 Feb 2025
https://github.com/vikram-bhati/quotes_scrapy
simple spider to quotes using scrapy
python scraping-websites scrapy scrapy-crawler scrapy-spider
Last synced: 20 Feb 2025
https://github.com/karantan/scrapy_scout
Web crawler for scraping varstvo-konkurence.si
Last synced: 02 Apr 2025
https://github.com/elymsyr/bimproject_mongo
Web Crawling with Scrapy (bimobject.com)
scrapy scrapy-crawler scrapy-spider web-crawler web-crawler-python
Last synced: 21 Feb 2025
https://github.com/fernandotorresl/scraping-conferencias
Word cloud analysis of daily government conferences from México president using scrapy
jupyter-notebook mexico python scrapy scrapy-spider wip
Last synced: 26 Mar 2025