Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/navneetguptacse/web-scraping.scr
Python's micro web framework (Flask) - based web scraping tool for `Flipkart's` product data extraction - descriptions, ratings, and reviews using Python and beautiful-soup.
https://github.com/navneetguptacse/web-scraping.scr
beautifulsoup flask flipkart-scraper-python micro-framework python scrapping website
Last synced: 7 days ago
JSON representation
Python's micro web framework (Flask) - based web scraping tool for `Flipkart's` product data extraction - descriptions, ratings, and reviews using Python and beautiful-soup.
- Host: GitHub
- URL: https://github.com/navneetguptacse/web-scraping.scr
- Owner: navneetguptacse
- Created: 2023-02-26T10:20:14.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2023-09-18T18:31:42.000Z (about 1 year ago)
- Last Synced: 2024-11-13T03:09:38.594Z (7 days ago)
- Topics: beautifulsoup, flask, flipkart-scraper-python, micro-framework, python, scrapping, website
- Language: Python
- Homepage: https://www.kaggle.com/code/anjusukumaran4/web-scraping-using-beautiful-soup-flipkart-mobiles
- Size: 323 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
## Web Scraping with Flask
### **Description**
Web Scraping with Flask is a versatile web application that leverages the power of web scraping to retrieve and analyze product reviews from the popular e-commerce platform, Flipkart. This project combines the capabilities of Flask, a micro web framework, with BeautifulSoup for parsing HTML, and MongoDB for storing and managing scraped data efficiently.
### **Key Features**
- **User-Friendly Web Interface :** The project offers a user-friendly web interface for users to input their search query, making it easy to initiate the scraping process.
- **Scraping Flipkart :** The application scrapes Flipkart search results for a specified product or query, extracting valuable information such as product names, customer names, ratings, review headings, and comments.
- **Data Storage in MongoDB :** The scraped data is stored in a MongoDB database. This allows for efficient data retrieval, storage, and querying, ensuring that the data is well-organized and easily accessible.
- **Logging for Error Handling :** The project incorporates logging to track any potential errors or exceptions during the scraping process, making it easier to troubleshoot and maintain.
- **Cross-Origin Resource Sharing (CORS) :** To ensure compatibility with web browsers and prevent cross-origin issues, the application utilizes Flask-CORS for handling cross-origin requests.
- **Dynamic CSV Generation :** The application dynamically generates a CSV file containing the scraped data, enabling users to download and analyze the data offline.
### **How It Works**
- Users access the web application through their web browser.
- They input their desired product or query in the search bar and submit the form.
- The application performs web scraping on Flipkart's website to retrieve product reviews related to the query.
- The scraped data is stored in a MongoDB database for future reference and analysis.
- Users are presented with a user-friendly interface displaying the scraped reviews, including product names, customer names, ratings, review headings, and comments.
- Users can also download the scraped data in CSV format for further analysis.