https://github.com/tuvimen/imdb-scraper
A very simple shell script for scraping imdb in json
https://github.com/tuvimen/imdb-scraper
imdb json reliq scraper shell
Last synced: 2 months ago
JSON representation
A very simple shell script for scraping imdb in json
- Host: GitHub
- URL: https://github.com/tuvimen/imdb-scraper
- Owner: TUVIMEN
- License: gpl-3.0
- Created: 2023-05-05T04:35:57.000Z (about 3 years ago)
- Default Branch: master
- Last Pushed: 2025-06-04T16:49:42.000Z (about 1 year ago)
- Last Synced: 2025-07-29T20:35:43.651Z (11 months ago)
- Topics: imdb, json, reliq, scraper, shell
- Language: Shell
- Homepage:
- Size: 17.6 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# imdb-scraper
A very simple shell script for scraping imdb in json.
## Requirements
- [reliq](https://github.com/TUVIMEN/reliq)
- [jq](https://github.com/stedolan/jq)
- parallel (optional)
## Installation
install -m 755 imdb-scraper /usr/bin
## Json format
Here's [json](example.json).
## Usage
Script downloads all articles using /sitemap/title-[0-9]*.xml.gz and writes it into stdout.
If you want to download it in parallel just uncomment the 3 comments and comment the for loop.