https://github.com/tuvimen/pornhub-scraper
A bash script for scraping pornhub in json
https://github.com/tuvimen/pornhub-scraper
archiving bash json pornhub reliq scraping
Last synced: over 1 year ago
JSON representation
A bash script for scraping pornhub in json
- Host: GitHub
- URL: https://github.com/tuvimen/pornhub-scraper
- Owner: TUVIMEN
- License: gpl-3.0
- Created: 2023-11-20T15:38:15.000Z (over 2 years ago)
- Default Branch: master
- Last Pushed: 2025-02-26T16:10:48.000Z (over 1 year ago)
- Last Synced: 2025-02-26T17:24:14.989Z (over 1 year ago)
- Topics: archiving, bash, json, pornhub, reliq, scraping
- Language: Shell
- Homepage:
- Size: 252 KB
- Stars: 1
- Watchers: 1
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# pornhub-scraper
A bash script for scraping pornhub in json.
## Requirements
- [reliq](https://github.com/TUVIMEN/reliq)
- [jq](https://github.com/stedolan/jq)
## Installation
install -m 755 pornhub-scraper /usr/bin
## Json format
Here's example of [pornstar](pornstar-example.json), [video](video-example.json), [model](model-example.json), [playlist](playlist-example.json) and [user](user-example.json).
## Usage
pornhub-scraper [OPTION] [DIR]
The script gets links from sitemap and downloads specified pages (4 in parallel). Files are named by sha256sum of their urls.
It's recommended that you use -S option to save time.
Download pornstars into DIR
pornhub-scraper -p DIR
Download videos into current directory using 8 threads
pornhub-scraper -t 8 -v
Download models into current directory
pornhub-scraper -m
Download playlists into DIR
pornhub-scraper -P DIR
Download users from urls in FILE
pornhub-scraper -u FILE
Get some help
pornhub-scraper -h
## Results
2022-09-15
3
lzip -dc pornhub-pornstars.json | jq 'if (.info_pieces[] | .key=="Gender" and .value=="Trans Woman") then .name else null end | select(. != null)' -r | wc -l
131
lzip -dc pornhub-pornstars.json | jq 'if (.info_pieces[] | .key=="Gender" and .value=="Trans Man") then .name else null end | select(. != null)' -r | wc -l