Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/arya-io/flipkart-data-scraping
A data scraping project to extract product information such as names, prices, descriptions, and ratings from Flipkart using Selenium and BeautifulSoup.
https://github.com/arya-io/flipkart-data-scraping
beautifulsoup data-science data-scraping flipkart pandas python selenium web-scraping
Last synced: 7 days ago
JSON representation
A data scraping project to extract product information such as names, prices, descriptions, and ratings from Flipkart using Selenium and BeautifulSoup.
- Host: GitHub
- URL: https://github.com/arya-io/flipkart-data-scraping
- Owner: arya-io
- License: mit
- Created: 2024-10-15T16:33:31.000Z (24 days ago)
- Default Branch: main
- Last Pushed: 2024-10-19T08:03:30.000Z (20 days ago)
- Last Synced: 2024-11-01T21:07:43.865Z (7 days ago)
- Topics: beautifulsoup, data-science, data-scraping, flipkart, pandas, python, selenium, web-scraping
- Language: Jupyter Notebook
- Homepage:
- Size: 22.5 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Flipkart-Scraping
## Overview
This project is a web scraping application that extracts product details from Flipkart's mobile section. The scraper gathers information on various products, including their names, prices, descriptions, and ratings, across multiple pages. It is designed to help users understand how to collect and organize data from e-commerce websites for analysis.## Features
- Scrapes product names, prices, descriptions, and ratings.
- Navigates through multiple pages of product listings.
- Utilizes Selenium for dynamic content handling and BeautifulSoup for data extraction.
- Saves the collected data into a CSV file for further analysis.## Technologies Used
- Python
- Selenium
- BeautifulSoup
- Pandas## Installation
To run this project, you'll need to have Python installed on your machine. Follow these steps to set up the environment:1. Clone this repository:
```bash
git clone https://github.com/arya-io/Flipkart_Scraping.git
```2. Navigate to the project directory:
```bash
cd Flipkart_Scraping
```3. Install the required libraries:
```bash
pip install -r requirements.txt
```## Usage
1. Open the Jupyter Notebook file `Flipkart_Scraping.ipynb`.
2. Run each cell step-by-step to execute the web scraping process.
3. The scraped data will be saved in `Flipkart_Scraping.csv` in the project directory.## Note
Please ensure you comply with Flipkart's terms of service when scraping their website.## Contributing
Contributions are welcome! Please feel free to submit a pull request or open an issue for any enhancements or suggestions.## License
This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.## Acknowledgements
- [Selenium Documentation](https://www.selenium.dev/documentation/)
- [BeautifulSoup Documentation](https://www.crummy.com/software/BeautifulSoup/bs4/doc/)
- [Pandas Documentation](https://pandas.pydata.org/docs/)