An open API service indexing awesome lists of open source software.

Projects in Awesome Lists by ArchiveBox

A curated list of projects in awesome lists by ArchiveBox .

https://github.com/archivebox/archivebox

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

archivebox backups bookmark-archiver browser-bookmarks chromium digipres firefox headless-browser internet-archiving pinboard pocket python rss self-hosted singlefile warc wayback-machine web-archiving wget youtube-dl

Last synced: 09 Sep 2025

https://github.com/pirate/ArchiveBox

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

archivebox backups bookmark-archiver browser-bookmarks chromium digipres firefox headless-browser internet-archiving pinboard pocket python rss self-hosted singlefile warc wayback-machine web-archiving wget youtube-dl

Last synced: 01 Apr 2025

https://github.com/ArchiveBox/ArchiveBox

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

archivebox backups bookmark-archiver browser-bookmarks chromium digipres firefox headless-browser internet-archiving pinboard pocket python rss self-hosted singlefile warc wayback-machine web-archiving wget youtube-dl

Last synced: 13 Mar 2025

https://github.com/archivebox/good-karma-kit

😇 A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, BOINC, and more...

archivebox archivewarrior boinc distributed-computing distributed-storage docker docker-compose foldingathome good-karma i2p internet-archiving ipfs kiwix pywb sia storj tor zimfarm

Last synced: 06 Apr 2025

https://github.com/ArchiveBox/good-karma-kit

😇 A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, BOINC, and more...

archivebox archivewarrior boinc distributed-computing distributed-storage docker docker-compose foldingathome good-karma i2p internet-archiving ipfs kiwix pywb sia storj tor zimfarm

Last synced: 07 Apr 2025

https://github.com/archivebox/archivebox-browser-extension

Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.

archivebox archiving browser-extension chrome-extension digipres digital-preservation firefox-extension internet-archiving svelte web-archiving

Last synced: 07 Jul 2025

https://github.com/ArchiveBox/archivebox-browser-extension

Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.

archivebox archiving browser-extension chrome-extension digipres digital-preservation firefox-extension internet-archiving svelte web-archiving

Last synced: 03 Apr 2025

https://github.com/archivebox/electron-archivebox

Desktop Electron app for ArchiveBox internet archiver. (ALPHA: not ready for general use)

archivebox desktop desktop-electron digipres docker electron gui internet-archiving linux macos web-archiving windows

Last synced: 05 Aug 2025

https://github.com/ArchiveBox/electron-archivebox

Desktop Electron app for ArchiveBox internet archiver. (ALPHA: not ready for general use)

archivebox desktop desktop-electron digipres docker electron gui internet-archiving linux macos web-archiving windows

Last synced: 14 Mar 2025

https://github.com/archivebox/abx-dl

⬇️ A simple all-in-one CLI tool to download EVERYTHING from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl, simpler ArchiveBox). 🎭 Uses headless Chrome to get HTML, JS, CSS, images/video/audio/subtitles, PDFs, screenshots, article text, git repos, and more...

ai-scraping archivebox chrome cli cli-tool crawling curl downloader gallery-dl headless http-client internet-archiving playwright puppeteer scraping wget youtube-dl yt-dlp

Last synced: 15 Mar 2026

https://github.com/archivebox/readability-extractor

Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page's article text.

archivebox internet-archiving node readability wrapper

Last synced: 07 Jul 2025

https://github.com/ArchiveBox/readability-extractor

Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page's article text.

archivebox internet-archiving node readability wrapper

Last synced: 06 Apr 2025

https://github.com/ArchiveBox/pocket-exporter

A service to help export your pocket bookmarks, tags, saved article text, and more...

archivebox archiving bookmarks getpocket html internet-archiving pocket urls web-archiving

Last synced: 19 Aug 2025

https://github.com/archivebox/archivebox-proxy

Official ArchiveBox MITM proxy: saves URLs of all requests passing through to an ArchiveBox server for archival.

archivebox digipres digital-preservation https-proxy internet-archiving mitmproxy proxy web-archiving web-proxy

Last synced: 07 Jul 2025

https://github.com/archivebox/digestbox

DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by ArchiveBox.io under the hood.

archivebox backups digipres headless-browser internet-archiving warc web-archiving

Last synced: 01 Feb 2026

https://github.com/archivebox/abx-pkg

📦 Modern strongly typed Python library for managing system dependencies with package managers like apt, brew, pip, npm, etc.

apt brew dependencies dependencies-checking dependency-manager django dpkg js npm package-managers packages pip pydantic python

Last synced: 01 Apr 2026

https://github.com/ArchiveBox/abx-pkg

📦 Modern strongly typed Python library for managing system dependencies with package managers like apt, brew, pip, npm, etc.

apt brew dependencies dependencies-checking dependency-manager django dpkg js npm package-managers packages pip pydantic python

Last synced: 14 Aug 2025

https://github.com/archivebox/docs

Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.

archivebox cli community digipres documentation internet-archiving python rest sphinx ui usage web-archiving wiki

Last synced: 07 Jul 2025

https://github.com/archivebox/pip-archivebox

Official Python package for ArchiveBox, the self-hosted internet archiving solution.

archivebox digipres internet-archiving pip pypi python sdist setuptools web-archiving wheel

Last synced: 07 Jul 2025

https://github.com/archivebox/abx-spec-behaviors

🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser environments, puppeteer, playwright, extensions, AI tools, and many other contexts with minimal adjustment.

abx archivebox automation behaviors browser browsertrix-behaviors claude computer-use crawling digipres ecosystem greasemonkey playwright plugins puppeteer rfp scraping specification tampermonkey tool-use

Last synced: 01 Sep 2025

https://github.com/archivebox/community

A wiki of the broader Web Archiving Community: important organizations, alternative projects, blog posts, and more.

archivebox archiving digipres internet preservation webarchiving

Last synced: 14 Feb 2026

https://github.com/archivebox/pocket-exporter

A service to help export your pocket bookmarks, tags, saved article text, and more...

archivebox archiving bookmarks getpocket html internet-archiving pocket urls web-archiving

Last synced: 03 Jul 2025