Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/amajji/web-scraping-with-scrapy-
This project aims to scrap a US government website using the Scrapy framework
https://github.com/amajji/web-scraping-with-scrapy-
scraper scraping scraping-websites scrapper scrapy webscraper webscraping
Last synced: about 14 hours ago
JSON representation
This project aims to scrap a US government website using the Scrapy framework
- Host: GitHub
- URL: https://github.com/amajji/web-scraping-with-scrapy-
- Owner: amajji
- License: gpl-3.0
- Created: 2022-05-13T12:49:17.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2022-05-13T13:48:16.000Z (over 2 years ago)
- Last Synced: 2023-03-10T01:56:17.124Z (over 1 year ago)
- Topics: scraper, scraping, scraping-websites, scrapper, scrapy, webscraper, webscraping
- Language: Jupyter Notebook
- Homepage:
- Size: 394 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Web scraping.
Data scientist | [Anass MAJJI](https://www.linkedin.com/in/anass-majji-729773157/)
***## :monocle_face: Description
- This project aims to scrap a US government website using the Scrapy framework.
## :rocket: Repository Structure
The repository contains the following files & directories:
- **Web_scrapping.ipynb:** This notebook contains all details about the webscraping steps using Scrapy framework.
![](scrapy.png)
## :chart_with_upwards_trend: Performance & results
Files are downloaded and stored in downloaded_files folder :![](result_scrapy.png)
---
## :mailbox_closed: Contact
For any information, feedback or questions, please [contact me][anass-email][anass-email]: mailto:[email protected]