Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/memirdogan/web-scraping-with-beautiful-soup-and-selenium

This repository offers a guide to web scraping with Beautiful Soup and Selenium. It covers data extraction, HTML basics, XPath, and CSS selectors. Includes a project for scraping Trendyol. Ideal for beginners wanting to learn web scraping.
https://github.com/memirdogan/web-scraping-with-beautiful-soup-and-selenium

beautifulsoup beautifulsoup4 html python selenium selenium-webdriver webscraping

Last synced: 4 months ago
JSON representation

This repository offers a guide to web scraping with Beautiful Soup and Selenium. It covers data extraction, HTML basics, XPath, and CSS selectors. Includes a project for scraping Trendyol. Ideal for beginners wanting to learn web scraping.

Awesome Lists containing this project

README

        

![Star Badge](https://img.shields.io/static/v1?label=%F0%9F%8C%9F&message=If%20Useful&style=style=flat&color=BC4E99)
![Open Source Love](https://badges.frapsoft.com/os/v1/open-source.svg?v=103)
[![View My Profile](https://img.shields.io/badge/View-My_Profile-green?logo=GitHub)](https://github.com/Memirdogan)
[![View Repositories](https://img.shields.io/badge/View-My_Repositories-blue?logo=GitHub)](https://github.com/Memirdogan?tab=repositories)

# Web Scraping with Beautiful Soup and Selenium

Welcome to the Web Scraping with Beautiful Soup and Selenium repository! This repository provides a comprehensive guide to web scraping, from basic concepts to practical implementations.

## Table of Contents

- [Introduction](#introduction)
- [Prerequisites](#prerequisites)
- [Installation](#installation)
- [Usage](#usage)
- [Project Structure](#project-structure)
- [Topics Covered](#topics-covered)
- [Contributing](#contributing)

## Introduction

This repository is designed to help beginners understand the basics of web scraping using Beautiful Soup and Selenium. You'll learn how to extract data from websites and perform various tasks such as navigating HTML structures, using XPath, and applying CSS selectors. The repository includes a practical project for scraping data from the Trendyol website.

## Prerequisites

Before you begin, make sure you have the following installed:

- Python 3.x
- pip (Python package installer)

## Installation

To install the required libraries

```
beautifulsoup4
selenium
requests
```

## Usage

Here's a brief overview of how to use the scripts in this repository:

1. Clone the repository:
```bash
git clone https://github.com/Memirdogan/Web-Scraping-with-Beautiful-Soup-and-Selenium.git
```
2. Navigate to the project directory:
```bash
cd Web-Scraping-with-Beautiful-Soup-and-Selenium
```

## Project Structure

```plaintext
Web-Scraping-with-Beautiful-Soup-and-Selenium/
├── .idea/
├── Beautiful Soup’a Giriş/
├── Bot Tespiti ve Proxy/
├── HTML Temelleri/
│ └── Uygulama
├── Selenium'a Giriş/
├── TrendyolPhoneData/
│ ├── TrendyolPhoneData.py
│ ├── main.py
│ ├── DataClear.py
│ ├── datasetanalyser.py
│ └── ...
└── XPath ve CSS Seçicilere Giriş/
```

## Topics Covered

- Introduction to Beautiful Soup
- Bot Detection and Proxy
- HTML Basics
- Introduction to Selenium
- TrendyolPhoneData Project
- Introduction to XPath and CSS Selectors

## Contributing

Contributions are welcome! Please feel free to submit a Pull Request.