Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/ukharecha/cuisine-collector
Cuisine Collector scrapes recipe data from the Ranveer Brar's website, extracting detailed information including titles, ingredients, and cooking instructions. This project automates data collection using Playwright, providing a structured JSON output for easy access and analysis.
https://github.com/ukharecha/cuisine-collector
playwright python3 webscraping
Last synced: 6 days ago
JSON representation
Cuisine Collector scrapes recipe data from the Ranveer Brar's website, extracting detailed information including titles, ingredients, and cooking instructions. This project automates data collection using Playwright, providing a structured JSON output for easy access and analysis.
- Host: GitHub
- URL: https://github.com/ukharecha/cuisine-collector
- Owner: UKharecha
- Created: 2024-08-16T07:14:05.000Z (3 months ago)
- Default Branch: main
- Last Pushed: 2024-08-30T06:49:33.000Z (2 months ago)
- Last Synced: 2024-10-10T11:20:46.752Z (27 days ago)
- Topics: playwright, python3, webscraping
- Language: Python
- Homepage: https://ranveerbrar.com/recipes/
- Size: 18.6 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# 🍽️ Cuisine Collector
**Cuisine Collector** is a Python-based web scraping tool designed to extract and compile detailed recipe information from culinary websites. 🍲 Using the Playwright library, this project automates browser interactions to gather recipe details like titles, ingredients, and cooking instructions. 📜
## Key Features
- **Automated Recipe Extraction**: Scrape recipes from specified websites effortlessly. 🚀
- **Comprehensive Data Collection**: Retrieve titles, ingredients, and cooking processes. 📝
- **Customizable**: Adapt easily to different recipe sites by modifying selectors. 🔧
- **Output in JSON Format**: Save collected recipes in a structured JSON file for easy use and analysis. 📂## Installation
1. Clone the repository:
```bash
git clone https://github.com/UKharecha/Cuisine-Collector.git
```2. Navigate to the project directory:
```bash
cd CuisineCollector
```3. Install the required dependencies:
```bash
pip install playwright
playwright install```
## Usage
1. Update the `scraper.py` file with the target website and appropriate selectors. 🛠️
2. Run the scraper:
```bash
python main.py
```3. Check the `data/processed/recipes.json` file for the extracted recipe data. 📁
## Contribution
We welcome contributions! 🎉 Feel free to submit issues or pull requests. 🤝
---
Happy scraping! 🍴