Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/steffenfritz/html2warc
simple script to convert web resources to a single warc file
https://github.com/steffenfritz/html2warc
Last synced: 26 days ago
JSON representation
simple script to convert web resources to a single warc file
- Host: GitHub
- URL: https://github.com/steffenfritz/html2warc
- Owner: steffenfritz
- License: mit
- Fork: true (ampoffcom/html2warc)
- Created: 2015-12-30T14:29:32.000Z (almost 9 years ago)
- Default Branch: master
- Last Pushed: 2023-05-11T18:17:59.000Z (over 1 year ago)
- Last Synced: 2024-08-03T14:05:45.070Z (4 months ago)
- Language: Python
- Size: 10.7 KB
- Stars: 18
- Watchers: 4
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: license.txt
Awesome Lists containing this project
- awesome-datahoarding - html2warc
- awesome-datahoarder - html2warc
README
# html2warc
A simple script to convert offline data into a warc file# Usage
python html2warc.py $TARGET_URI $SOURCE_DIR $TARGET_WARC