Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/heckj/crawl2pdf
Tool to take a website and crawl it down, rendering each page into its own PDF file
https://github.com/heckj/crawl2pdf
Last synced: 15 days ago
JSON representation
Tool to take a website and crawl it down, rendering each page into its own PDF file
- Host: GitHub
- URL: https://github.com/heckj/crawl2pdf
- Owner: heckj
- License: apache-2.0
- Created: 2021-03-19T03:54:08.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2022-01-13T22:37:22.000Z (almost 3 years ago)
- Last Synced: 2024-10-10T19:41:10.124Z (about 1 month ago)
- Language: JavaScript
- Size: 48.8 KB
- Stars: 2
- Watchers: 2
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# crawl2pdf
A tool that either takes a command `crawl` and a URL:
- crawls the site, rendering the URLS into PDF pages
or takes a command `render`, and either a file or a url:
- renders the URL into a PDF
- and renders the URLS in that file into PDFTo use:
npm install
node crawl2pdfcrawl2pdf [command]
Commands:
crawl2pdf render [-u]|[-f] render the page into a PDF
crawl2pdf crawl [-a] [-u] starting crawling a site and rendering PDFs for
each pageOptions:
--version Show version number [boolean]
-u, --url URL to load
-a, --articles Only render articles into PDFs
-f, --file file with URLS to load
-h, --help Show help [boolean]Copyright 2021, thanks Playwright!
## DEBUG MODE
use `DEBUG=pw:api` before a command to generate Playwright debugging output