https://github.com/arumie/recipe-scraper
Scraper for scraping recipes
https://github.com/arumie/recipe-scraper
deno puppeteer typescript
Last synced: 2 months ago
JSON representation
Scraper for scraping recipes
- Host: GitHub
- URL: https://github.com/arumie/recipe-scraper
- Owner: arumie
- License: mit
- Created: 2024-11-27T20:22:45.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-11-27T21:43:24.000Z (over 1 year ago)
- Last Synced: 2025-03-22T08:29:53.343Z (over 1 year ago)
- Topics: deno, puppeteer, typescript
- Language: TypeScript
- Homepage: https://dzach.dev
- Size: 11.7 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE.txt
Awesome Lists containing this project
README
# Recipe Scraper
This project is a recipe scraper that extracts recipe information from a given
URL and converts it into Markdown format. It uses Puppeteer for web scraping and
Deno as the runtime environment.
## Features
- Scrapes recipe title, description, tags, ingredients, and steps.
- Converts the scraped recipe into Markdown format.
- Supports headless and non-headless browser modes.
## Installation
1. Install [Deno](https://deno.land/#installation).
2. Clone this repository:
```sh
git clone https://github.com/your-username/recipe-scraper.git
cd recipe-scraper
deno install -A
```
## Usage
To run the scraper, use the following command:
```sh
deno task run [headless]
```
### Arguments
- ``: URL of the recipe to scrape (required).
- `[headless]`: Whether to run the browser in headless mode (default: true).
### Example
```sh
deno task run https://www.example.com/recipe true
```
## Project Structure
- `main.ts`: Entry point of the application.
- `src/scraper.ts`: Contains the main scraping logic.
- `src/utils.ts`: Utility functions for the scraper.
- `deno.json`: Configuration file for Deno tasks.
## License
This project is licensed under the MIT License.