Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/firaskahlaoui/web-scraping-lvl2
The "web-scraping-lvl2" project is a learning exercise to explore advanced web scraping techniques using Scrapy.
https://github.com/firaskahlaoui/web-scraping-lvl2
framework learning python scrapy web-scraping
Last synced: about 2 months ago
JSON representation
The "web-scraping-lvl2" project is a learning exercise to explore advanced web scraping techniques using Scrapy.
- Host: GitHub
- URL: https://github.com/firaskahlaoui/web-scraping-lvl2
- Owner: FirasKahlaoui
- Created: 2024-05-21T13:54:30.000Z (8 months ago)
- Default Branch: master
- Last Pushed: 2024-06-23T21:55:49.000Z (7 months ago)
- Last Synced: 2024-06-23T22:45:13.519Z (7 months ago)
- Topics: framework, learning, python, scrapy, web-scraping
- Language: Python
- Homepage:
- Size: 63.5 KB
- Stars: 10
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Web Scraping Level 2
This project demonstrates web scraping using Scrapy to extract data from two different websites: "http://quotes.toscrape.com" and "https://www.cars.com".
## About This Project
The "web-scraping-lvl2" project is a learning exercise to explore advanced web scraping techniques using Scrapy, a powerful web scraping framework in Python. It showcases scraping of quotes from "http://quotes.toscrape.com" and car listings from "https://www.cars.com".
### Features
- **Quote Scraping**: Extracts quotes, authors, and tags from "http://quotes.toscrape.com".
- **Car Scraping**: Retrieves car listings including make, model, year, price, and location from "https://www.cars.com".## Technologies Used
- Python
- Scrapy
- XPath or CSS selectors for data extraction
- Command line interface for running Scrapy spiders## Installation
1. Clone the repository:
```bash
git clone https://github.com/FirasKahlaoui/web-scraping-lvl2.git
cd web-scraping-lvl2