https://github.com/arya-io/flipkart-data-scraping

A data scraping project to extract product information such as names, prices, descriptions, and ratings from Flipkart using Selenium and BeautifulSoup.
https://github.com/arya-io/flipkart-data-scraping

beautifulsoup data-science data-scraping flipkart pandas python selenium web-scraping

Last synced: about 1 month ago
JSON representation

A data scraping project to extract product information such as names, prices, descriptions, and ratings from Flipkart using Selenium and BeautifulSoup.

Host: GitHub
URL: https://github.com/arya-io/flipkart-data-scraping
Owner: arya-io
License: mit
Created: 2024-10-15T16:33:31.000Z (9 months ago)
Default Branch: main
Last Pushed: 2024-10-19T08:03:30.000Z (9 months ago)
Last Synced: 2025-02-13T02:47:32.391Z (5 months ago)
Topics: beautifulsoup, data-science, data-scraping, flipkart, pandas, python, selenium, web-scraping
Language: Jupyter Notebook
Homepage:
Size: 22.5 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Flipkart-Scraping

## Overview
This project is a web scraping application that extracts product details from Flipkart's mobile section. The scraper gathers information on various products, including their names, prices, descriptions, and ratings, across multiple pages. It is designed to help users understand how to collect and organize data from e-commerce websites for analysis.

## Features
- Scrapes product names, prices, descriptions, and ratings.
- Navigates through multiple pages of product listings.
- Utilizes Selenium for dynamic content handling and BeautifulSoup for data extraction.
- Saves the collected data into a CSV file for further analysis.

## Technologies Used
- Python
- Selenium
- BeautifulSoup
- Pandas

## Installation
To run this project, you'll need to have Python installed on your machine. Follow these steps to set up the environment:

1. Clone this repository:
```bash
git clone https://github.com/arya-io/Flipkart_Scraping.git
```

2. Navigate to the project directory:
```bash
cd Flipkart_Scraping
```

3. Install the required libraries:
```bash
pip install -r requirements.txt
```

## Usage
1. Open the Jupyter Notebook file `Flipkart_Scraping.ipynb`.
2. Run each cell step-by-step to execute the web scraping process.
3. The scraped data will be saved in `Flipkart_Scraping.csv` in the project directory.

## Note
Please ensure you comply with Flipkart's terms of service when scraping their website.

## Contributing
Contributions are welcome! Please feel free to submit a pull request or open an issue for any enhancements or suggestions.

## License
This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.

## Acknowledgements
- [Selenium Documentation](https://www.selenium.dev/documentation/)
- [BeautifulSoup Documentation](https://www.crummy.com/software/BeautifulSoup/bs4/doc/)
- [Pandas Documentation](https://pandas.pydata.org/docs/)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/arya-io/flipkart-data-scraping

Awesome Lists containing this project

README