Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/dms-codes/scrape-prenadamedia-com

Prenada Media Scraper This Python script is designed to scrape book information from the Prenada Media website. It gathers data such as book titles, authors, prices, publication years, descriptions, availability, ISBNs, weights, dimensions, page counts, cover types, and image URLs. The scraped data is then stored in a CSV file for further analysis
https://github.com/dms-codes/scrape-prenadamedia-com

python scrape webscraping

Last synced: 2 days ago
JSON representation

Host: GitHub
URL: https://github.com/dms-codes/scrape-prenadamedia-com
Owner: dms-codes
Created: 2022-11-24T10:47:30.000Z (about 2 years ago)
Default Branch: main
Last Pushed: 2023-09-30T22:34:17.000Z (over 1 year ago)
Last Synced: 2023-09-30T23:27:25.569Z (over 1 year ago)
Topics: python, scrape, webscraping
Language: Python
Homepage: https://github.com/dms-codes/scrape-prenadamedia-com
Size: 1.95 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Prenada Media Scraper

This Python script is designed to scrape book information from the [Prenada Media website](https://prenadamedia.com/). It gathers data such as book titles, authors, prices, publication years, descriptions, availability, ISBNs, weights, dimensions, page counts, cover types, and image URLs. The scraped data is then stored in a CSV file for further analysis or use.

## Features

- Scrapes book data from the Prenada Media website.
- Retrieves a variety of book information.
- Supports scraping multiple pages within each category.
- Automatically determines if a book is out of stock.

## Prerequisites

Before running the script, make sure you have the following Python libraries installed:

- `requests`: To make HTTP requests.
- `BeautifulSoup`: For web scraping.

You can install these libraries using `pip`:

```bash
pip install requests beautifulsoup4
```

## Usage

1. Modify the script to specify the `HOME_URL` variable with the URL of the Prenada Media category you want to scrape.

2. Run the script by executing the following command in your terminal:

```bash
python your_script_name.py
```

3. The script will scrape book data from the specified category, including details like titles, authors, prices, and more.

4. The scraped data will be stored in a CSV file named 'data.csv' in the same directory as the script.

## Configuration

- `HOME_URL`: The URL of the Prenada Media category you want to scrape.
- `COLUMNS`: The column names for the CSV file where the data will be saved.
- `STOCK`: The default stock value for in-stock items (modify as needed).

## Example

Suppose you want to scrape books from a specific Prenada Media category. After running the script, you will find the scraped data saved in 'data.csv' in the script's directory.

## License

This script is provided under the [MIT License](LICENSE).
```

Replace `"your_script_name.py"` with the actual name of your script. You can also add more details to the README.md file, such as installation instructions and additional usage examples, based on your project's specific needs.