Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/heckj/crawl2pdf

Tool to take a website and crawl it down, rendering each page into its own PDF file
https://github.com/heckj/crawl2pdf

Last synced: 15 days ago
JSON representation

Tool to take a website and crawl it down, rendering each page into its own PDF file

Host: GitHub
URL: https://github.com/heckj/crawl2pdf
Owner: heckj
License: apache-2.0
Created: 2021-03-19T03:54:08.000Z (over 3 years ago)
Default Branch: main
Last Pushed: 2022-01-13T22:37:22.000Z (almost 3 years ago)
Last Synced: 2024-10-10T19:41:10.124Z (about 1 month ago)
Language: JavaScript
Size: 48.8 KB
Stars: 2
Watchers: 2
Forks: 1
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# crawl2pdf

A tool that either takes a command `crawl` and a URL:

- crawls the site, rendering the URLS into PDF pages

or takes a command `render`, and either a file or a url:

- renders the URL into a PDF
- and renders the URLS in that file into PDF

To use:

npm install
node crawl2pdf

crawl2pdf [command]

Commands:
crawl2pdf render [-u]|[-f] render the page into a PDF
crawl2pdf crawl [-a] [-u] starting crawling a site and rendering PDFs for
each page

Options:
--version Show version number [boolean]
-u, --url URL to load
-a, --articles Only render articles into PDFs
-f, --file file with URLS to load
-h, --help Show help [boolean]

## DEBUG MODE

use `DEBUG=pw:api` before a command to generate Playwright debugging output