Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome-datahoarding
List of data-hoarding related tools
https://github.com/simon987/awesome-datahoarding
Last synced: 5 days ago
JSON representation
-
Data curation
-
Download automation
- Everything
- FileBot
- Kiwix
- Mp3tag - to-use tool to edit metadata of audio files (Windows/Mac)
- TeraCopy
- tree
- WinDirStat
- WizTree
- SyncToy
- VisiPics
- baobab
- beets
- browsemonkey
- Calibre
- DataCurator-Filetree
- diskover
- fucking-weeb
- grepWin
- Hydrus
- jdupes
- MediaElch
- MediaInfo
- phockup
- picard
- DeepSort
- sist2 - fast file system indexer and search tool
- VisiPics
-
-
Download utilities
-
Web Archiving
- HTTrack
- wail - Click User Instigated Preservation
- Browsertrix Crawler - based high-fidelity crawling system, designed to run a complex, customizable browser-based crawl in a single Docker container
- Collect
- Heritrix - scale, archival-quality web crawler
- wikiteam
- grab-site
- ArchiveBox - hosted web archive. Takes browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
- webrecorder - fidelity, ISO-compliant web archives in a user-friendly interface, providing access to archived content, and sharing collections
-
General
- aria2 - protocol & multi-source command-line download utility
- rsync
- wget - interactive download of files from the Web
- CrowLeer
- curl
- Horahora
- news-crawl
- Plowshare - line tool to manage file-sharing site
- Suck-It - threaded)
- wget2 - threaded
- wpull - compatible web downloader and crawler
- you-get
- ytdl-sub
- yt-dlp - DLP that behaves better
- annie - DL alternative written in Golang
- Rclone
-
Application-specific
- floatplane_ripper
- Scribd-Downloader
- Tube Archivist - Hosted Docker container for automated/scheduled YouTube downloads of channels, playlists, etc.
- BBCSoundDownloader
- ChanThreadWatch - style boards and checks for updates until the thread dies
- comics-downloader - line tool to download comicsand manga in pdf/epub/cbz/cbr from supported sites
- gallery-dl
- Discord-Channel-Scraper
- dzi-dl
- FanFicFare
- FicSave
- flickr_download
- Google Images Download
- iiif-dl - line tile downloader/assembler for IIIF endpoints/manifests
- imgbrd-grabber
- instaloader
- InstaLooter - less Instagram pictures and videos downloader.
- Instagram Scraper - scraper is a command-line application written in Python that scrapes and downloads an instagram user's photos and videos. Use responsibly.
- PyInstaLive
- RedditDownloader
- snscrape
- RipMe
- tumblr-utils
- yt-mango
- Youtube-MA
- PyInstaLive
- yt-mango
-
Download automation
-
-
Backup
-
Download automation
-
-
Compression
-
Download automation
- 7-Zip
- peazip
- PIGZ - threaded gzip
- WinRAR
- KGB Archiver
-
-
Network
-
Download automation
-
-
File systems
-
Download automation
-
-
Utility Scripts
-
Download automation
- Backblaze B2 sync backup script
- rclone_dirsize
- rm_empty_subdir - directories on Windows
- youtube-dl_soundcloud - dl to download soundcloud playlists
- Misc download scripts
- TheFrenchGhosty's Ultimate YouTube-DL Scripts Collection - dl scripts to aid in YouTube channel archival
- void-cat-uploader
- flac2mp3_V0.py - threaded python script to convert all flac files to mp3 V0 while keeping the directory structure
-
-
Content sharing
-
APIs & Online tools
-
Hardware / Monitoring
-
Download automation
- CrystalDiskInfo
- GSmartControl - OS S.M.A.R.T. utility with an easy to understand graphical interface
- Hard Drive Sentinel - OS SSD and HDD monitoring and analysis software
- smartmontools
-
-
Data recovery
-
Local Media
-
Download automation
- MakeMKV - platform DVD ripper that supports recent blu ray DVDs. It's mostly open source, but the blu ray secret sauce is still hidden
- Handbrake
- whipper - DA ripper preferring accuracy over speed. Generates .flac, .cue, and .log by default and automatically fetches metadata from musicbrainz. EAC log plugin is available.
-
-
File conversion
Programming Languages
Categories
Sub Categories
Keywords
python
19
downloader
12
crawler
6
instagram
5
warc
5
cli
4
download
4
torrent
4
tumblr
4
scraper
4
youtube
4
video
4
web-archiving
4
sonarr
3
youtube-dl
3
yt-dlp
3
rss
3
golang
3
epub
3
radarr
3
go
3
ebook
3
music
3
linux
3
flickr
2
couchpotato
2
archiver
2
bilibili
2
command-line-tool
2
deviantart
2
reddit
2
help-wanted
2
reactjs
2
php
2
movies
2
calibre
2
metadata
2
fanfiction
2
wget
2
musicbrainz
2
kodi
2
lidarr
2
movie
2
plex
2
twitter
2
heritrix
2
crawling
2
instagram-scraper
2
web-crawler
2
self-hosted
2