An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with archivebox

A curated list of projects in awesome lists tagged with archivebox .

https://github.com/archivebox/archivebox

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

archivebox backups bookmark-archiver browser-bookmarks chromium digipres firefox headless-browser internet-archiving pinboard pocket python rss self-hosted singlefile warc wayback-machine web-archiving wget youtube-dl

Last synced: 09 Sep 2025

https://github.com/pirate/ArchiveBox

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

archivebox backups bookmark-archiver browser-bookmarks chromium digipres firefox headless-browser internet-archiving pinboard pocket python rss self-hosted singlefile warc wayback-machine web-archiving wget youtube-dl

Last synced: 01 Apr 2025

https://github.com/ArchiveBox/ArchiveBox

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

archivebox backups bookmark-archiver browser-bookmarks chromium digipres firefox headless-browser internet-archiving pinboard pocket python rss self-hosted singlefile warc wayback-machine web-archiving wget youtube-dl

Last synced: 13 Mar 2025

https://github.com/archivebox/good-karma-kit

😇 A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, BOINC, and more...

archivebox archivewarrior boinc distributed-computing distributed-storage docker docker-compose foldingathome good-karma i2p internet-archiving ipfs kiwix pywb sia storj tor zimfarm

Last synced: 06 Apr 2025

https://github.com/ArchiveBox/good-karma-kit

😇 A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, BOINC, and more...

archivebox archivewarrior boinc distributed-computing distributed-storage docker docker-compose foldingathome good-karma i2p internet-archiving ipfs kiwix pywb sia storj tor zimfarm

Last synced: 07 Apr 2025

https://github.com/archivebox/archivebox-browser-extension

Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.

archivebox archiving browser-extension chrome-extension digipres digital-preservation firefox-extension internet-archiving svelte web-archiving

Last synced: 07 Jul 2025

https://github.com/ArchiveBox/archivebox-browser-extension

Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.

archivebox archiving browser-extension chrome-extension digipres digital-preservation firefox-extension internet-archiving svelte web-archiving

Last synced: 03 Apr 2025

https://github.com/archivebox/electron-archivebox

Desktop Electron app for ArchiveBox internet archiver. (ALPHA: not ready for general use)

archivebox desktop desktop-electron digipres docker electron gui internet-archiving linux macos web-archiving windows

Last synced: 05 Aug 2025

https://github.com/ArchiveBox/electron-archivebox

Desktop Electron app for ArchiveBox internet archiver. (ALPHA: not ready for general use)

archivebox desktop desktop-electron digipres docker electron gui internet-archiving linux macos web-archiving windows

Last synced: 14 Mar 2025

https://github.com/archivebox/abx-dl

⬇️ A simple all-in-one CLI tool to download EVERYTHING from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl, simpler ArchiveBox). 🎭 Uses headless Chrome to get HTML, JS, CSS, images/video/audio/subtitles, PDFs, screenshots, article text, git repos, and more...

ai-scraping archivebox chrome cli cli-tool crawling curl downloader gallery-dl headless http-client internet-archiving playwright puppeteer scraping wget youtube-dl yt-dlp

Last synced: 17 Mar 2025

https://github.com/pirate/internet-archiving-talk

🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.

archivebox censorship ethics internet-archiving slideshow talks warc web-archiving wget

Last synced: 24 Mar 2025

https://github.com/archivebox/readability-extractor

Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page's article text.

archivebox internet-archiving node readability wrapper

Last synced: 07 Jul 2025

https://github.com/ArchiveBox/readability-extractor

Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page's article text.

archivebox internet-archiving node readability wrapper

Last synced: 06 Apr 2025

https://github.com/ArchiveBox/pocket-exporter

A service to help export your pocket bookmarks, tags, saved article text, and more...

archivebox archiving bookmarks getpocket html internet-archiving pocket urls web-archiving

Last synced: 19 Aug 2025

https://github.com/archivebox/archivebox-proxy

Official ArchiveBox MITM proxy: saves URLs of all requests passing through to an ArchiveBox server for archival.

archivebox digipres digital-preservation https-proxy internet-archiving mitmproxy proxy web-archiving web-proxy

Last synced: 07 Jul 2025

https://github.com/uriel1998/muna

Clean a series of links, resolving redirects and finding Wayback results if page is gone. Originally written to aid with importing from ArchiveBox.

archivebox bash url urls wayback-machine

Last synced: 14 Apr 2025

https://github.com/dbeley/reddit_export_userdata

Export userdata from your reddit accounts. Submissions, comments, saved, upvoted contents are supported.

archivebox archiving reddit reddit-scraper

Last synced: 29 Apr 2025

https://github.com/yunohost-apps/archivebox_ynh

Self-hosted internet archiving solution to collect, save, and view sites you want to preserve offline, for YunoHost.

archivebox yunohost yunohost-apps

Last synced: 23 Apr 2025

https://github.com/archivebox/docs

Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.

archivebox cli community digipres documentation internet-archiving python rest sphinx ui usage web-archiving wiki

Last synced: 07 Jul 2025

https://github.com/dbeley/archiveboxmatic

ArchiveBoxMatic: configure ArchiveBox with the simplicity of a yaml file.

archivebox archiving web-archiving

Last synced: 29 Apr 2025

https://github.com/archivebox/pip-archivebox

Official Python package for ArchiveBox, the self-hosted internet archiving solution.

archivebox digipres internet-archiving pip pypi python sdist setuptools web-archiving wheel

Last synced: 07 Jul 2025

https://github.com/archivebox/abx-spec-behaviors

🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser environments, puppeteer, playwright, extensions, AI tools, and many other contexts with minimal adjustment.

abx archivebox automation behaviors browser browsertrix-behaviors claude computer-use crawling digipres ecosystem greasemonkey playwright plugins puppeteer rfp scraping specification tampermonkey tool-use

Last synced: 01 Sep 2025

https://github.com/archivebox/community

A wiki of the broader Web Archiving Community: important organizations, alternative projects, blog posts, and more.

archivebox archiving digipres internet preservation webarchiving

Last synced: 08 Sep 2025

https://github.com/thelovinator1/feedvault.se

FeedVault is an open-source web application that allows users to archive and search their favorite web feeds.

archive archivebox atom-feed backup feed-archive hacktoberfest internet-archive internet-archiving rss rss-aggregator rss-archive wayback-machine

Last synced: 02 Sep 2025

https://github.com/archivebox/pocket-exporter

A service to help export your pocket bookmarks, tags, saved article text, and more...

archivebox archiving bookmarks getpocket html internet-archiving pocket urls web-archiving

Last synced: 03 Jul 2025