Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome-datahoarding
List of data-hoarding related tools
https://github.com/simon987/awesome-datahoarding
- ArchiveBox - hosted web archive. Takes browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
- Browsertrix Crawler - based high-fidelity crawling system, designed to run a complex, customizable browser-based crawl in a single Docker container
- Collect
- grab-site
- Heritrix - scale, archival-quality web crawler
- HTTrack
- wail - Click User Instigated Preservation
- webrecorder - fidelity, ISO-compliant web archives in a user-friendly interface, providing access to archived content, and sharing collections
- wikiteam
- annie - DL alternative written in Golang
- aria2 - protocol & multi-source command-line download utility
- CrowLeer
- curl
- Horahora
- httpie
- news-crawl
- Plowshare - line tool to manage file-sharing site
- Rclone
- rsync
- Suck-It - threaded)
- wget - interactive download of files from the Web
- wget2 - threaded
- wpull - compatible web downloader and crawler
- you-get
- ytdl-sub
- yt-dlp - DLP that behaves better
- BBCSoundDownloader
- ChanThreadWatch - style boards and checks for updates until the thread dies
- comics-downloader - line tool to download comicsand manga in pdf/epub/cbz/cbr from supported sites
- floatplane_ripper
- gallery-dl
- Discord-Channel-Scraper
- dzi-dl
- FanFicFare
- FicSave
- flickr_download
- Google Images Download
- iiif-dl - line tile downloader/assembler for IIIF endpoints/manifests
- imgbrd-grabber
- instaloader
- InstaLooter - less Instagram pictures and videos downloader.
- Instagram Scraper - scraper is a command-line application written in Python that scrapes and downloads an instagram user's photos and videos. Use responsibly.
- PyInstaLive
- RedditDownloader
- Scribd-Downloader
- snscrape
- RipMe
- Tube Archivist - Hosted Docker container for automated/scheduled YouTube downloads of channels, playlists, etc.
- tumblr-utils
- yt-mango
- Youtube-MA
- bazarr
- FlexGet
- Jackett
- Lidarr
- Mylar
- Sick-Beard
- Radarr
- Sonarr
- BorgBackup
- 7-Zip
- KGB Archiver
- peazip
- PIGZ - threaded gzip
- WinRAR
- NetLimiter
- httpdirfs
- mergerfs
- NTFS drivers for MacOS
- AAXtoMP3
- html2warc
- warcat
- Backblaze B2 sync backup script
- flac2mp3_V0.py - threaded python script to convert all flac files to mp3 V0 while keeping the directory structure
- Misc download scripts
- TheFrenchGhosty's Ultimate YouTube-DL Scripts Collection - dl scripts to aid in YouTube channel archival
- rclone_dirsize
- rm_empty_subdir - directories on Windows
- void-cat-uploader
- youtube-dl_soundcloud - dl to download soundcloud playlists
- h5ai
- ipfs - addressable, peer-to-peer method of storing and sharing hypermedia in a distributed file system
- opds
- Syncthing
- baobab
- beets
- browsemonkey
- Calibre
- DataCurator-Filetree
- DeepSort
- diskover
- Everything
- FileBot
- fucking-weeb
- grepWin
- Hydrus
- Kiwix
- jdupes
- MediaElch
- MediaInfo
- Mp3tag - to-use tool to edit metadata of audio files (Windows/Mac)
- phockup
- picard
- TeraCopy
- tree
- WinDirStat
- WizTree
- sist2 - fast file system indexer and search tool
- SyncToy
- VisiPics
- iqdb - service reverse image search
- thetvdb
- CrystalDiskInfo
- GSmartControl - OS S.M.A.R.T. utility with an easy to understand graphical interface
- Hard Drive Sentinel - OS SSD and HDD monitoring and analysis software
- smartmontools
- PhotoRec
- TestDisk
- whipper - DA ripper preferring accuracy over speed. Generates .flac, .cue, and .log by default and automatically fetches metadata from musicbrainz. EAC log plugin is available.
- Exact Audio Copy
- MakeMKV - platform DVD ripper that supports recent blu ray DVDs. It's mostly open source, but the blu ray secret sauce is still hidden
- Handbrake
- CommonCrawl
- Blockyarchive
- par2cmdline
Keywords
python
17
downloader
10
crawler
5
cli
4
linux
4
warc
4
instagram
4
ebook
3
tumblr
3
torrent
3
windows
3
web-archiving
3
radarr
3
sonarr
3
youtube
3
epub
3
backup
3
scraper
3
https
3
http
3
filesystem
3
yt-dlp
3
sorting
2
web-crawler
2
archive
2
couchpotato
2
video-downloader
2
subsonic
2
youtube-dl
2
heritrix
2
java
2
libfuse
2
go
2
classification
2
video
2
command-line-tool
2
reddit
2
terminal
2
libcurl
2
musicbrainz
2
music
2
archiver
2
duplicate-files
2
calibre
2
fanfiction
2
lidarr
2
c
2
instagram-scraper
2
twitter
2
golang
2