An open API service indexing awesome lists of open source software.

https://github.com/binki/rescrape-save-page-we

Post-process files from Save Page WE extension to download links to wp-content/uploads
https://github.com/binki/rescrape-save-page-we

Last synced: 3 days ago
JSON representation

Post-process files from Save Page WE extension to download links to wp-content/uploads

Awesome Lists containing this project

README

          

This takes HTML files which were downloaded with Save Page WE and tries to fix up links to external resources by downloading them.
It is currently hardcoded to support downloading from a WordPress installation by limiting the links to `wp-content/uploads` (this would need to be changed to support general scenarios).
It outputs the modified HTML (which is updated to refer to the downloaded files) to `original-name.offline.html`.

I was using Save Page WE 9.9 at the time.
It seems like it failed to save `` tags using srcsets or maybe had issues because the DHS Band website I was downloading contained ``.