Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/sanikamal/web-scraping-atoz

Extract data from websites using Python
https://github.com/sanikamal/web-scraping-atoz

beautifulsoup4 data-mining requests scrapy selenium web-scraping

Last synced: about 11 hours ago
JSON representation

Extract data from websites using Python

Awesome Lists containing this project

README

        

# Web Scraping with Python
> ### Web scraping is the process of extracting data from web sites via programmatic means.

### What is Web Scraping
`Web Scraping` (also termed `Screen Scraping`, `Web Data Extraction`, `Web Harvesting` etc.) is a technique employed to extract large amounts of data from websites whereby the data is extracted and saved to a local file in your computer or to a database in table (spreadsheet) format.

### Popular web scraping tools:

- `BeautifulSoup` is a python library for pulling data (parsing) out of HTML and XML files.

- `Scrapy` is a free open source application framework used for crawling web sites and extracting structured data which can be used for a variety of things like data mining,research ,information process or historical archival.

### Contents
- [Scraping Car_Dealer_Website](https://github.com/sanikamal/web-scraping-with-python/blob/main/notebook/Web_Scraping_a_Car_Dealer_Website.ipynb)

### Some Useful Link
- [Scrapy documentation](https://scrapy.org/)
- [Scrapinghub website](https://scrapinghub.com/)
- [BeautifulSoup](https://www.crummy.com/software/BeautifulSoup/bs4/doc/)