https://github.com/saeed0xf/web-scrapers
a repository for web scrapers built in node.js ⛓ and python 🐍.
https://github.com/saeed0xf/web-scrapers
automation nodejs python python3 web webscraping
Last synced: 11 months ago
JSON representation
a repository for web scrapers built in node.js ⛓ and python 🐍.
- Host: GitHub
- URL: https://github.com/saeed0xf/web-scrapers
- Owner: saeed0xf
- Created: 2022-07-23T15:29:33.000Z (almost 4 years ago)
- Default Branch: main
- Last Pushed: 2022-07-23T15:43:59.000Z (almost 4 years ago)
- Last Synced: 2025-03-01T03:49:08.488Z (over 1 year ago)
- Topics: automation, nodejs, python, python3, web, webscraping
- Language: Python
- Homepage:
- Size: 42 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# web-scrapers
Web scrapers for scraping different websites.
- craiglist : It's built in node.js and it scrapes Job title, job descriptions, location, and job urls from craiglist site.
- imdb : It's built in python with scrapy module. It scrapes movie title, url, movie duration and url of top 250 movies from imdb website.
- population : It's also built in python with scrapy module. It scrapes world wide population for each country from worldometers website
- pptr : It's built in node.js with puppeteer headless browser.
- table : It's built in node.js. It only scrape tables.
- tinydeals : It's built in python. It scrapes product name, url, ratings and offers from tinydeals website. (now tinydeal website has been closed so we can't use this scraper.)
```
Note : Since websites are updated all the time some of these scrapers may not work.
```