https://github.com/dreamjet31/theinfautation-scraper

The Infautation scraper for me
https://github.com/dreamjet31/theinfautation-scraper

axios cheerio infautation javascript mongodb nodejs puppeteer scraping

Last synced: about 1 month ago
JSON representation

The Infautation scraper for me

Host: GitHub
URL: https://github.com/dreamjet31/theinfautation-scraper
Owner: dreamjet31
Created: 2024-03-22T07:19:36.000Z (about 1 year ago)
Default Branch: main
Last Pushed: 2024-09-16T09:48:30.000Z (8 months ago)
Last Synced: 2025-02-10T08:49:37.780Z (3 months ago)
Topics: axios, cheerio, infautation, javascript, mongodb, nodejs, puppeteer, scraping
Language: JavaScript
Homepage:
Size: 65.4 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# TheInfatuation Scraper

## Project Description

A versatile web scraper built with Node.js that extracts valuable restaurant data from The Infatuation: [https://www.theinfatuation.com](https://www.theinfatuation.com). This scraper can gather data from two distinct categories:

* **Specific-Purpose Restaurant Guides:** Scrapes restaurant listings tailored to specific purposes (e.g., best brunch spots, date night restaurants) along with detailed information on each restaurant.

* **City-Based Restaurant Listings:** Scrapes comprehensive restaurant listings from a given city on The Infatuation.

## Data Collected

For each restaurant, the scraper gathers the following details:

* **Name**
* **Description**
* **Cuisine**
* **Street Address**
* **Neighborhood**
* **Phone Number**
* **Country**
* **Source (The Infatuation)**
* **City**
* **Rating**
* **Latitude**
* **Longitude**
* **URL**
* **Photos**
* **Website**
* **Perfect For (e.g., date night, groups)**
* **Price**

## Technologies Used

* **Node.js:** Core runtime environment for the project.
* **JavaScript:** The primary programming language.
* **Puppeteer:** For browser automation and controlling web page interactions.
* **Axios:** For making HTTP requests to The Infatuation.
* **Cheerio:** For parsing HTML and extracting specific data elements.
* **MongoDB:** To store the scraped data in a structured format.

## How to Use

1. **Prerequisites**
* Node.js and npm (or yarn) installed on your system.
* A running MongoDB instance.

2. **Installation**
```bash
git clone https://github.com/flurryunicorn/theinfautation-scraper
cd theinfautation-scraper
npm install
```

3. **Configuration**
* Create a `.env` file in the project root to store your MongoDB connection string. Example:
```
MONGODB_URI=mongodb://localhost:27017/your_database_name
```

4. **Running the Scraper**
* Modify the `index.js` (or relevant script) file to specify the target URLs and desired scraping behavior.
* Execute the script:
```bash
node index.js
```

## Contributing
Pull requests for feature suggestions/improvements are welcome. For major changes, please open an issue first to discuss what you would like to change.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/dreamjet31/theinfautation-scraper

Awesome Lists containing this project

README