Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/VIDA-NYU/reprozip

ReproZip is a tool that simplifies the process of creating reproducible experiments from command-line executions, a frequently-used common denominator in computational science.

archiving computational-science docker hacktoberfest linux nyu ptrace python reproducibility reproducible-research reproducible-science reprounzip reprozip science scientific-computing vagrant

Last synced: 02 Jul 2024

https://github.com/bandundu/email-archiver

Email archiving tool for IMAP/POP3 accounts (early development)

archiving email flask imap pop react

Last synced: 01 Jul 2024

https://github.com/all-the-data/awesome-data-hoarding

How to save everything online. Tools for for scraping, saving, downloading, hoarding, archiving, etc.

archiving awesome awesome-list data-hoarder hoarding reddit reddit-downloader

Last synced: 24 Jun 2024

https://xwmx.github.io/nb/

CLI and local web plain text note‑taking, bookmarking, and archiving with linking, tagging, filtering, search, Git versioning & syncing, Pandoc conversion, + more, in a single portable script.

archiving bash bookmark-manager bookmarks cli command-line git knowledge-base markdown note-taking notebook notes notes-app pandoc productivity shell sync vim vscode zettelkasten

Last synced: 18 Jun 2024

https://github.com/plougher/squashfs-tools

tools to create and extract Squashfs filesystems

archiving compression filesystem linux lzo mkfs mksquashfs squashfs squashfs-image tar tarball zstd

Last synced: 17 Jun 2024

https://github.com/palewire/archiveis

A simple Python wrapper for the archive.is capturing service

api archiving news python

Last synced: 16 Jun 2024

https://github.com/dbeley/archiveboxmatic

ArchiveBoxMatic: configure ArchiveBox with the simplicity of a yaml file.

archivebox archiving web-archiving

Last synced: 16 Jun 2024

https://github.com/mhx/dwarfs

A fast high compression read-only file system for Linux, Windows and macOS

archiving compression cpp deduplication dwarfs filesystem flac fuse fuse-filesystem gpl-license linux lrzip lzma macfuse macos squashfs windows winfsp zpaq zstd

Last synced: 11 Jun 2024

https://github.com/bottomless-archive-project/library-of-alexandria

Library of Alexandria (LoA in short) is a project that aims to collect and archive documents from the internet.

archiving library-of-alexandria

Last synced: 10 Jun 2024

https://github.com/itext/itext-dotnet

iText for .NET is the .NET version of the iText library, formerly known as iTextSharp, which it replaces. iText represents the next level of SDKs for developers that want to take advantage of the benefits PDF can bring. Equipped with a better document engine, high and low-level programming capabilities and the ability to create, edit and enha

accessibility acroform archiving ccpa digital-signature documents encryption gdpr itext itextsharp library pades pdf pdf-generation pdfa pdfua sdk security svg xfdf

Last synced: 09 Jun 2024

https://github.com/JosephLai241/URS

Universal Reddit Scraper - A comprehensive Reddit scraping/archival command-line tool.

archiving command-line comments csv data-analysis data-science json livestream osint-tool praw pyo3 python reddit reddit-scraper redditor rust scraper subreddit trees wordcloud

Last synced: 08 Jun 2024

https://github.com/itext/itext-java

iText for Java represents the next level of SDKs for developers that want to take advantage of the benefits PDF can bring. Equipped with a better document engine, high and low-level programming capabilities and the ability to create, edit and enhance PDF documents, iText can be a boon to nearly every workflow.

accessibility acroform archiving ccpa digital-signature documents encryption gdpr itext library pades pades-standard pdf pdf-generation pdfa pdfua sdk security svg xfdf

Last synced: 06 Jun 2024

https://xwmx.github.io/nb

CLI and local web plain text note‑taking, bookmarking, and archiving with linking, tagging, filtering, search, Git versioning & syncing, Pandoc conversion, + more, in a single portable script.

archiving bash bookmark-manager bookmarks cli command-line git knowledge-base markdown note-taking notebook notes notes-app pandoc productivity shell sync vim vscode zettelkasten

Last synced: 04 Jun 2024

https://github.com/cleidigh/Message-archive-options-TB

Thunderbird Add-On Adds the ability to customize the date format of the archive folders created by TB Archive Messages

archiving javascript thunderbird thunderbird-addon thunderbird-extension

Last synced: 01 Jun 2024

https://github.com/ede1998/autoarchive

Add an archiving button to thunderbird.

addon archiving mail thunderbird

Last synced: 01 Jun 2024

https://github.com/google/fuse-archive

FUSE file system for archives and compressed files (ZIP, RAR, 7Z, ISO, TGZ, XZ...)

archiving filesystem fuse-filesystem zip

Last synced: 31 May 2024

https://github.com/gwern/gwern.net

Site infrastructure for gwern.net (CSS/JS/HS/images/icons). Custom Hakyll website with unique automatic link archiving, recursive tooltip popup UX, dark mode, and typography (sidenotes+dropcaps+admonitions+inflation-adjuster).

admonitions archiving gwern hakyll inflation pandoc sidenotes tooltips typography wikipedia

Last synced: 30 May 2024

https://github.com/shukriadams/browsemonkey

Takes snapshots of file systems for offline browsing and searching.

archiving datahoarder takes-snapshots util

Last synced: 24 May 2024

https://github.com/ropensci/arkdb

Archive and unarchive databases as flat text files

archiving database dbi peer-reviewed r r-package rstats

Last synced: 20 May 2024

https://github.com/Kovah/LinkAce

LinkAce is a self-hosted archive to collect links of your favorite websites.

archive archiving bookmark-manager bookmark-managers bookmarking bookmarks docker laravel php self-hosted selfhosted

Last synced: 19 May 2024

https://github.com/xwmx/nb

CLI and local web plain text note‑taking, bookmarking, and archiving with linking, tagging, filtering, search, Git versioning & syncing, Pandoc conversion, + more, in a single portable script.

archiving bash bookmark-manager bookmarks cli command-line git knowledge-base markdown note-taking notebook notes notes-app pandoc productivity shell sync vim vscode zettelkasten

Last synced: 16 May 2024

https://github.com/grawity/irc-docs

Collected IRC protocol documentation

archiving history irc

Last synced: 15 May 2024

https://github.com/ArchiveTeam/wget-lua

Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.

archiveteam archiving crawl crawler crawlers crawling downloader ftp lua scraper scraping spider warc webarchiving wget wget-lua zstd

Last synced: 11 May 2024

https://github.com/alopezrivera/anchorage

Save your bookmark collection in the Internet Archive, or locally.

archiving internet-archive permanence web

Last synced: 10 May 2024

https://github.com/o0101/Blogfolio

:pencil2: Blogfolio of my creative work in open-source software engineering, 2017 to present.

archiving bepis blog bona-fides browser cryptography github-stars portfolio rbi selector-generalization types utilities

Last synced: 10 May 2024

https://github.com/Ouranosinc/miranda

A modern Python utility library for climate data collection and management

archiving climate collection management netcdf

Last synced: 09 May 2024

https://github.com/the-paperless-project/paperless

Scan, index, and archive all of your paper documents

archiving documents ocr paper search

Last synced: 07 May 2024

https://github.com/peterk/warcworker

A dockerized, queued high fidelity web archiver based on Squidwarc

archiving high-fidelity-preservation preservation webarchives webarchiving

Last synced: 07 May 2024

https://github.com/ArchiveTeam/grab-site

The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns

archiving crawl crawler spider warc

Last synced: 06 May 2024

https://github.com/promyloph/crocoite

Web archiving using Google Chrome

archiving chrome-browser devtools warc

Last synced: 06 May 2024

https://github.com/wapmorgan/UnifiedArchive

UnifiedArchive - an archive manager with unified interface for different formats (bundled with cli utility). Supports all formats with basic operations (reading, extracting and creation) and popular formats specific features (compression level, password-protection, comment)

7zip archives archiving bz2 bzip2 cab gzip iso lzma2 manipulate-archives rar tar zip

Last synced: 05 May 2024

https://github.com/ruarxive/ydiskarc

command-line tool to backup public resources from Yandex.disk (disk.yandex.ru / yadi.sk) filestorage service

archiving backup yandex yandex-api yandex-disk

Last synced: 04 May 2024

https://github.com/gdamdam/iagitup

A command line tool to archive a git repository from GitHub to the Internet Archive.

archive archiving cli git github internet-archive internetarchive

Last synced: 04 May 2024

https://github.com/onox/dcf-ada

An Ada 2012 library for document container files

ada archiving dcf dcf-ada zip

Last synced: 02 May 2024

https://github.com/libarchive/libarchive

Multi-format archive and compression library

7zip archiving backup bz2 cab cpio gz iso9660 libarchive lzma posix rar tar xar xz zip zstd

Last synced: 01 May 2024

https://github.com/palewire/savemy.news

Save My News: A personal, permanent clipping service

archiving django journalism news python

Last synced: 26 Apr 2024

https://github.com/palewire/storytracker

Tools for tracking stories on news homepages

archiving journalism python

Last synced: 26 Apr 2024

https://github.com/palewire/savepagenow

A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service

api archiving command-line-interface internetarchive news python

Last synced: 26 Apr 2024

https://github.com/wabarc/rivet

A toolkit makes it easier to archive webpages to IPFS

archiving ipfs webpage

Last synced: 25 Apr 2024

https://github.com/webrecorder/archiveweb.page

A High-Fidelity Web Archiving Extension for Chrome and Chromium based browsers!

archiving chromium extension wacz web-archiving webrecorder

Last synced: 25 Apr 2024

https://github.com/paperless-ngx/paperless-ngx

A community-supported supercharged version of paperless: scan, index and archive all your physical documents

angular archiving django dms document-management document-management-system machine-learning ocr optical-character-recognition pdf

Last synced: 22 Apr 2024

https://github.com/webrecorder/browsertrix

Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!

archiving cloud kubernetes wacz warc web-archive web-archiving webrecorder

Last synced: 21 Apr 2024

https://github.com/chronicle-app/chronicle-etl

📜 A CLI toolkit for extracting and working with your digital history

archiving chronicle chronicle-etl cli csv data-liberation etl json memex personal-archive personal-data quantified-self ruby

Last synced: 20 Apr 2024

https://github.com/palewire/wordpress-memento-plugin

A plugin for Wordpress web sites to enable the Memento framework for time-based access

archiving memento php wordpress

Last synced: 18 Apr 2024

https://github.com/palewire/django-memento-framework

A set for helpers for Django web sites to enable the Memento framework for time-based access

archiving django memento python

Last synced: 18 Apr 2024

https://github.com/jonaswinkler/paperless-ng

A supercharged version of paperless: scan, index and archive all your physical documents

angular archiving django dms document-management-system full-text-search machine-learning ocr search

Last synced: 16 Apr 2024

https://github.com/ArchiveBox/archivebox-browser-extension

Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.

archivebox archiving browser-extension chrome-extension digipres digital-preservation firefox-extension internet-archiving svelte web-archiving

Last synced: 15 Apr 2024

https://github.com/mhucka/devonthink-hacks

Scripts and other things for working with DEVONthink, a personal information management system.

applescript archiving automation devonthink information-gathering information-management pdf-generation

Last synced: 09 Apr 2024

https://github.com/pirate/wikipedia-mirror

🌐 Guide and tools to run a full offline mirror of Wikipedia.org with three different approaches: Nginx caching proxy, Kiwix + ZIM dump, and MediaWiki/XOWA + XML dump

archiving datascience docker docker-compose html internet-archiving kiwix kiwix-offline-wikipedia mediawiki mwdumper nginx openzim wiki wikipedia wikipedia-dump wikipedia-mirror xowa zim

Last synced: 31 Mar 2024

https://github.com/ArchiveTeam/ArchiveBot

ArchiveBot, an IRC bot for archiving websites

archiving haxe irc javascript python ruby

Last synced: 30 Mar 2024

https://github.com/janw/podcast-archiver

Archive all your favorite podcasts

archiving feedparser podcast podcasts python rss

Last synced: 30 Mar 2024

https://github.com/postgrespro/pg_probackup

Backup and recovery manager for PostgreSQL

archiving backup incremental-backups postgresql recovery restore wall

Last synced: 25 Mar 2024

https://github.com/jupyterlab-contrib/jupyter-archive

A Jupyter/Jupyterlab extension to make, download and extract archive files.

archiving jupyterlab jupyterlab-extension jupyterlab-extensions

Last synced: 18 Mar 2024

https://github.com/bareos/bareos

Bareos is a cross-network Open Source backup solution (licensed under AGPLv3) which preserves, archives, and recovers data from all major operating systems.

archiving backup backup-solution backup-utility bareos ceph compression cross-platform disaster-recovery encrypt gluster mysql postgresql python recover restore s3 security vmware

Last synced: 17 Mar 2024

https://github.com/vdbsh/backy

tiny multiprocessing utility for file backups

archiving backup bsd bzip2 cli golang linux macos rsync synchronization tar

Last synced: 15 Mar 2024