https://github.com/binki/rescrape-save-page-we
Post-process files from Save Page WE extension to download links to wp-content/uploads
https://github.com/binki/rescrape-save-page-we
Last synced: 3 days ago
JSON representation
Post-process files from Save Page WE extension to download links to wp-content/uploads
- Host: GitHub
- URL: https://github.com/binki/rescrape-save-page-we
- Owner: binki
- License: mit
- Created: 2018-05-21T14:48:40.000Z (about 8 years ago)
- Default Branch: master
- Last Pushed: 2018-05-21T19:27:10.000Z (about 8 years ago)
- Last Synced: 2025-11-23T14:25:42.220Z (7 months ago)
- Language: JavaScript
- Size: 5.86 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE.md
Awesome Lists containing this project
README
This takes HTML files which were downloaded with Save Page WE and tries to fix up links to external resources by downloading them.
It is currently hardcoded to support downloading from a WordPress installation by limiting the links to `wp-content/uploads` (this would need to be changed to support general scenarios).
It outputs the modified HTML (which is updated to refer to the downloaded files) to `original-name.offline.html`.
I was using Save Page WE 9.9 at the time.
It seems like it failed to save `
` tags using srcsets or maybe had issues because the DHS Band website I was downloading contained ``.