Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/heckj/crawl2pdf

Tool to take a website and crawl it down, rendering each page into its own PDF file
https://github.com/heckj/crawl2pdf

Last synced: 15 days ago
JSON representation

Tool to take a website and crawl it down, rendering each page into its own PDF file

Awesome Lists containing this project

README

        

# crawl2pdf

A tool that either takes a command `crawl` and a URL:

- crawls the site, rendering the URLS into PDF pages

or takes a command `render`, and either a file or a url:

- renders the URL into a PDF
- and renders the URLS in that file into PDF

To use:

npm install
node crawl2pdf

crawl2pdf [command]

Commands:
crawl2pdf render [-u]|[-f] render the page into a PDF
crawl2pdf crawl [-a] [-u] starting crawling a site and rendering PDFs for
each page

Options:
--version Show version number [boolean]
-u, --url URL to load
-a, --articles Only render articles into PDFs
-f, --file file with URLS to load
-h, --help Show help [boolean]

Copyright 2021, thanks Playwright!

## DEBUG MODE

use `DEBUG=pw:api` before a command to generate Playwright debugging output