Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with webarchive

A curated list of projects in awesome lists tagged with webarchive .

https://github.com/helgeho/ArchiveSpark

An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.

archivespark internet-archive spark spark-framework warc web-archiving webarchive

Last synced: 01 Aug 2024

https://github.com/karust/gogetcrawl

Extract web archive data using Wayback Machine and Common Crawl

commoncrawl concurrency crawler golang wayback-machine webarchive

Last synced: 01 Aug 2024

https://github.com/N0taN3rd/node-warc

Parse And Create Web ARChive (WARC) files with node.js

chrome-remote-interface pupeteer warc warc-files web-archives web-archiving webarchive webarchiving

Last synced: 17 Aug 2024

https://github.com/ibnesayeed/archival-tests

A set of web archival replay test cases

archival-replay memento replay-tests testing webarchive webarchiving

Last synced: 01 Oct 2024

https://github.com/maxmmueller/404-to-archive-redirector

Greasemonkey script that redirects from a 404 page to the Wayback Machine.

404-redirect greasemonkey javascript tampermonkey webarchive

Last synced: 01 Oct 2024