https://github.com/webrecorder/har2warc
Convert HTTP Archive (HAR) -> Web Archive (WARC) format
https://github.com/webrecorder/har2warc
Last synced: 7 months ago
JSON representation
Convert HTTP Archive (HAR) -> Web Archive (WARC) format
- Host: GitHub
- URL: https://github.com/webrecorder/har2warc
- Owner: webrecorder
- License: apache-2.0
- Created: 2017-03-16T00:14:15.000Z (over 8 years ago)
- Default Branch: master
- Last Pushed: 2018-10-21T05:01:44.000Z (about 7 years ago)
- Last Synced: 2025-04-05T20:22:13.420Z (8 months ago)
- Language: Python
- Homepage: https://pypi.python.org/pypi/har2warc
- Size: 635 KB
- Stars: 51
- Watchers: 6
- Forks: 4
- Open Issues: 2
-
Metadata Files:
- Readme: README.rst
- License: LICENSE
Awesome Lists containing this project
- webarchiving-awesome-graph - har2warc - Convert HTTP Archive (HAR) -> Web Archive (WARC) format (Python). 💽 ⭐ 54 👀 6 (Tools & Software / Utilities)
- awesome-web-archiving - har2warc - Convert HTTP Archive (HAR) -> Web Archive (WARC) format (Python). (Tools & Software / Utilities)
- osint_stuff_tool_collection - har2warc - > Web Archive (WARC) format| ([](#table-of-contents) Table of contents / [](#warc)Tools for working with WARC (WebARChive) files)
README
har2warc
========
Convert HTTP Archive (HAR) -> Web Archive (WARC) format
``pip install har2warc``
Command-Line Usage
~~~~~~~~~~~~~~~~~~
``har2warc ``
Libary Usage
~~~~~~~~~~~~
har2warc can be used as a python library.
Simple usage similar to CLI interface:
.. code:: python
from har2warc.har2warc import har2warc
har2warc('input.har', 'output.warc.gz')
Also supports reading and writing from buffers:
.. code:: python
from har2warc.har2warc import har2warc
har = json.loads(...)
with open('output.warc.gz', 'w+b') as warc:
har2warc(har, warc)
# READ WARC
warc.seek(0)
warc.read()