Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/emmanuel10701/data_scraping
Data-Scraping
https://github.com/emmanuel10701/data_scraping
beautifulsoup csv excel numpy pandas python web-scapping
Last synced: 4 days ago
JSON representation
Data-Scraping
- Host: GitHub
- URL: https://github.com/emmanuel10701/data_scraping
- Owner: Emmanuel10701
- Created: 2025-01-15T09:17:02.000Z (21 days ago)
- Default Branch: main
- Last Pushed: 2025-01-27T17:42:59.000Z (8 days ago)
- Last Synced: 2025-01-27T18:42:37.158Z (8 days ago)
- Topics: beautifulsoup, csv, excel, numpy, pandas, python, web-scapping
- Language: Python
- Homepage:
- Size: 7.81 KB
- Stars: 3
- Watchers: 1
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# E-commerce Reviews Scraping
This Python project scrapes customer reviews from an e-commerce website (or a local HTML file) and saves the extracted data into both **CSV** and **Excel** formats. It uses libraries like **BeautifulSoup**, **Pandas**, **Requests**, and **openpyxl** to achieve this.
## Requirements
Before using this project, ensure that **Python** is installed on your machine, and the necessary libraries are set up:
1. **Python Installation**:
- Make sure you have Python installed on your system. You can download it from [python.org](https://www.python.org/downloads/).
- After installation, verify by running the following command in your terminal:
```bash
python --version
```
or
```bash
python3 --version
```
This should print the Python version (e.g., `Python 3.x.x`).2. **Library Installation**:
The following Python libraries are required to run this project:
- **pandas**: For handling and saving the scraped data.
- **beautifulsoup4**: For parsing and extracting data from HTML.
- **requests**: For sending HTTP requests to scrape data from a live URL.
- **openpyxl**: For saving data to an Excel file.To install the required libraries, open your terminal and run the following command:
```bash
pip install pandas beautifulsoup4 requests openpyxl