https://github.com/tuvimen/imdb-scraper

A very simple shell script for scraping imdb in json
https://github.com/tuvimen/imdb-scraper

imdb json reliq scraper shell

Last synced: 2 months ago
JSON representation

A very simple shell script for scraping imdb in json

Host: GitHub
URL: https://github.com/tuvimen/imdb-scraper
Owner: TUVIMEN
License: gpl-3.0
Created: 2023-05-05T04:35:57.000Z (about 3 years ago)
Default Branch: master
Last Pushed: 2025-06-04T16:49:42.000Z (about 1 year ago)
Last Synced: 2025-07-29T20:35:43.651Z (11 months ago)
Topics: imdb, json, reliq, scraper, shell
Language: Shell
Homepage:
Size: 17.6 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # imdb-scraper

A very simple shell script for scraping imdb in json.

## Requirements

 - [reliq](https://github.com/TUVIMEN/reliq)

 - [jq](https://github.com/stedolan/jq)

 - parallel (optional)

## Installation

    

    install -m 755 imdb-scraper /usr/bin

## Json format

Here's [json](example.json).

## Usage

Script downloads all articles using /sitemap/title-[0-9]*.xml.gz and writes it into stdout.

If you want to download it in parallel just uncomment the 3 comments and comment the for loop.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/tuvimen/imdb-scraper

Awesome Lists containing this project

README