Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/ukharecha/cuisine-collector

Cuisine Collector scrapes recipe data from the Ranveer Brar's website, extracting detailed information including titles, ingredients, and cooking instructions. This project automates data collection using Playwright, providing a structured JSON output for easy access and analysis.
https://github.com/ukharecha/cuisine-collector

playwright python3 webscraping

Last synced: 9 days ago
JSON representation

Host: GitHub
URL: https://github.com/ukharecha/cuisine-collector
Owner: UKharecha
Created: 2024-08-16T07:14:05.000Z (6 months ago)
Default Branch: main
Last Pushed: 2024-08-30T06:49:33.000Z (6 months ago)
Last Synced: 2025-02-10T22:44:38.238Z (9 days ago)
Topics: playwright, python3, webscraping
Language: Python
Homepage: https://ranveerbrar.com/recipes/
Size: 18.6 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# 🍽️ Cuisine Collector

**Cuisine Collector** is a Python-based web scraping tool designed to extract and compile detailed recipe information from culinary websites. 🍲 Using the Playwright library, this project automates browser interactions to gather recipe details like titles, ingredients, and cooking instructions. 📜

## Key Features

- **Automated Recipe Extraction**: Scrape recipes from specified websites effortlessly. 🚀
- **Comprehensive Data Collection**: Retrieve titles, ingredients, and cooking processes. 📝
- **Customizable**: Adapt easily to different recipe sites by modifying selectors. 🔧
- **Output in JSON Format**: Save collected recipes in a structured JSON file for easy use and analysis. 📂

## Installation

1. Clone the repository:
```bash
git clone https://github.com/UKharecha/Cuisine-Collector.git
```

2. Navigate to the project directory:
```bash
cd CuisineCollector
```

3. Install the required dependencies:
```bash
pip install playwright
playwright install

```

## Usage

1. Update the `scraper.py` file with the target website and appropriate selectors. 🛠️
2. Run the scraper:
```bash
python main.py
```

3. Check the `data/processed/recipes.json` file for the extracted recipe data. 📁

## Contribution

We welcome contributions! 🎉 Feel free to submit issues or pull requests. 🤝

---

Happy scraping! 🍴