Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/klus3kk/rym_extractor
A Python tool for extracting and organizing music album data from HTML files. This project separates albums with ratings from wishlisted albums on RateYourMusic (RYM).
https://github.com/klus3kk/rym_extractor
Last synced: about 13 hours ago
JSON representation
A Python tool for extracting and organizing music album data from HTML files. This project separates albums with ratings from wishlisted albums on RateYourMusic (RYM).
- Host: GitHub
- URL: https://github.com/klus3kk/rym_extractor
- Owner: Klus3kk
- Created: 2024-08-05T10:22:16.000Z (5 months ago)
- Default Branch: main
- Last Pushed: 2024-08-21T20:34:33.000Z (5 months ago)
- Last Synced: 2024-08-21T22:48:47.524Z (5 months ago)
- Language: HTML
- Size: 94.7 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# rym_extractor
RYM Album Data Extractor is a Python tool designed to scrape and organize music album data from RateYourMusic (RYM) HTML files. This tool extracts album details and separates albums with ratings from those that are wishlisted## Features
- Extracts artist names, album titles, and ratings from an HTML file.
- Separates wishlisted albums (albums without ratings) into a separate file.
- Skips entries with missing artist or album information.## Getting Started
### Prerequisites
- Python 3.x
- BeautifulSoup4
- pandas### Installation
1. Clone the repository:
```bash
git clone https://github.com/Klus3kk/rym-album-data-extractor.git
cd rym-album-data-extractor
```2. Install the required packages:
```bash
pip install pandas beautifulsoup4
```### How to get HTML RYM album ratings
1. On the home page of RateYourMusic click on your profile picture.
2. Scroll down, click on the "Music" button. You'll find it above your rated albums.
3. After that click on the "Print this page" button.
4. Right click on the shown page and click "Save as..."
5. Save the file as "Webpage, HTML Only" with the reccomended name "file".
6. After that paste the HTML file to the folder, where the rym.py is located **and you're ready to go!**
### Usage
1. Place your `file.html` in the project directory.
2. Run the extraction script:
```bash
python rym.py
```3. The script will generate two CSV files:
- `music_data.csv` for albums with ratings.
- `wishlisted_music_data.csv` for albums without ratings (wishlisted albums).