Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/sarrabenyahia/tuto-webscraping

webscraping course tutorial
https://github.com/sarrabenyahia/tuto-webscraping

api beautifulsoup headers javascript proxy requests scrapy selenium user-agent webscraping

Last synced: 15 days ago
JSON representation

webscraping course tutorial

Awesome Lists containing this project

README

        

# πŸ•ΈοΈ Web Scraping Course

Welcome to the Web Scraping Course repository! πŸš€ This repository contains all the materials needed to follow and understand the course, including source codes, exercises and their solutions, as well as presentation materials.

## πŸ“‚ Repository Structure

- πŸ“ `/bs4` : Contains materials related to BeautifulSoup 4, including code examples and exercises.
- πŸ“ `/scrapy` : Includes resources for learning and working with Scrapy, such as spiders and project setups.
- πŸ“ `/selenium` : Stores materials for web scraping with Selenium, including scripts and browser automation examples.
- πŸ“ `/presentations` : Stores PowerPoint files and other course materials.

## πŸ“š Course Content

This course covers fundamental and advanced aspects of web scraping, including:

- πŸ” Introduction to web scraping and its ethics
- 🐍 Using Python libraries such as BeautifulSoup and Requests
- πŸ›‘οΈ Work around websites protections: user-agents, proxy-rotations, IP address management
- ⚑ Asynchronous scraping with Scrapy
- 🌐 Scraping dynamic websites with Selenium
- πŸ† Best practices and optimizations

## πŸ“‹ Prerequisites

- 🐍 Python 3.9+
- πŸ”Œ Familiarity with API requests

## πŸ› οΈ Installation

1. Clone this repository:
```
git clone https://github.com/sarrabenyahia/web-scraping-course.git
```
2. Install dependencies:
```
pip install -r requirements.txt
```

## πŸš€ Usage

- πŸ“‚ Browse the folders to access different course materials.
- πŸ“ Follow the instructions in each folder to run code examples or complete exercises.

## 🀝 Contributing

Contributions to improve the course content are welcome. Feel free to open an issue or submit a pull request.

## πŸ“ Your feedback

We appreciate your feedback! Please share your impressions and suggestions by filling out the following form:

[Give your feedback here](https://docs.google.com/forms/d/e/1FAIpQLSfTRzgjomMKIsf4NJ7N-FHa94DfpNMwSRyRfKyEyyUPUYWLdg/viewform?usp=sf_link)