Projects in Awesome Lists by ArchiveTeam
A curated list of projects in awesome lists by ArchiveTeam .
https://github.com/ArchiveTeam/wpull
Wget-compatible web downloader and crawler.
Last synced: 06 Apr 2025
https://github.com/archiveteam/wpull
Wget-compatible web downloader and crawler.
Last synced: 27 Jan 2026
https://github.com/ArchiveTeam/ArchiveBot
ArchiveBot, an IRC bot for archiving websites
archiving haxe irc javascript python ruby
Last synced: 17 Jul 2025
https://github.com/archiveteam/archivebot
ArchiveBot, an IRC bot for archiving websites
archiving haxe irc javascript python ruby
Last synced: 09 Apr 2025
https://github.com/archiveteam/warrior-dockerfile
A Dockerfile for the ArchiveTeam Warrior
Last synced: 05 Apr 2025
https://github.com/archiveteam/ubuntu-warrior
Scripts to build and boot warrior virtual machine containing Docker
Last synced: 15 May 2025
https://github.com/archiveteam/wget-lua
Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
archiveteam archiving crawl crawler crawlers crawling downloader ftp lua scraper scraping spider warc webarchiving wget wget-lua zstd
Last synced: 04 Apr 2025
https://github.com/ArchiveTeam/Ubuntu-Warrior
Scripts to build and boot warrior virtual machine containing Docker
Last synced: 03 Apr 2025
https://github.com/ArchiveTeam/wget-lua
Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
archiveteam archiving crawl crawler crawlers crawling downloader ftp lua scraper scraping spider warc webarchiving wget wget-lua zstd
Last synced: 18 Jul 2025
https://github.com/archiveteam/ia.bak
We back up a lot of stuff from around the web; now it's time to back up the Internet Archive, just in case.
Last synced: 11 Mar 2026
https://github.com/ArchiveTeam/IA.BAK
We back up a lot of stuff from around the web; now it's time to back up the Internet Archive, just in case.
Last synced: 24 Mar 2025
https://github.com/ArchiveTeam/terroroftinytown
URLTeam's second generation of URL shortener archiving tools
Last synced: 03 Apr 2025
https://github.com/archiveteam/terroroftinytown
URLTeam's second generation of URL shortener archiving tools
Last synced: 28 Oct 2025
https://github.com/archiveteam/seesaw-kit
Making a reusable toolkit for writing seesaw scripts
Last synced: 17 Jun 2025
https://github.com/archiveteam/reddit-grab
Grabbing everything from reddit.
Last synced: 15 May 2025
https://github.com/ArchiveTeam/reddit-grab
Grabbing everything from reddit.
Last synced: 24 Mar 2025
https://github.com/archiveteam/yahooanswers-grab
Saving all questions and answers from Yahoo! Answers.
Last synced: 15 May 2025
https://github.com/ArchiveTeam/yahooanswers-grab
Saving all questions and answers from Yahoo! Answers.
Last synced: 28 Mar 2025
https://github.com/archiveteam/tumblr-grab
Archiving all to-be-deleted NSFW tumblr blogs.
Last synced: 15 May 2025
https://github.com/archiveteam/universal-tracker
A configurable, reusable tracker with dashboard
Last synced: 15 May 2025
https://github.com/ArchiveTeam/universal-tracker
A configurable, reusable tracker with dashboard
Last synced: 03 Apr 2025
https://github.com/archiveteam/warrior4-vm
Warrior virtual machine appliance (version 4)
Last synced: 15 May 2025
https://github.com/archiveteam/terroroftinytown-client-grab
The Seesaw pipeline grab script for the URLTeam (terroroftinytown) project
Last synced: 30 Oct 2025
https://github.com/archiveteam/ludios_wpull
wpull fork with fixes and faster parsing using html5-parser; used by grab-site; should go away when wpull is similarly improved
Last synced: 13 Mar 2025
https://github.com/archiveteam/youtube-grab
Archiving all metadata from YouTube (everything except videos themselves due to size)
Last synced: 11 Jul 2025
https://github.com/archiveteam/warrior-code2
Boot scripts for the ArchiveTeam Warrior 2
Last synced: 15 May 2025
https://github.com/ArchiveTeam/usgovernment-grab
Archiving parts of the US government.
Last synced: 15 May 2026
https://github.com/archiveteam/urls-grab
Archiving URLs (outlinks) from a variety of sources.
Last synced: 29 Aug 2025
https://github.com/ArchiveTeam/WebArchiver
Decentralized web archiving
archiver archiving crawler decentralized python warc web webarchiving
Last synced: 07 Apr 2025
https://github.com/archiveteam/webarchiver
Decentralized web archiving
archiver archiving crawler decentralized python warc web webarchiving
Last synced: 15 May 2025
https://github.com/archiveteam/gamemaker-sandbox-items
Gamemaker Sandbox Tracker items
Last synced: 28 Jan 2026
https://github.com/archiveteam/500px-grab
Archiving https://500px.com/creativecommons
Last synced: 15 May 2025
https://github.com/ArchiveTeam/youtube-dislikes-grab
Archiving general youtube video metadata through innertube for dislikes removal.
Last synced: 01 Aug 2025
https://github.com/archiveteam/youtube-dislikes-grab
Archiving general youtube video metadata through innertube for dislikes removal.
Last synced: 15 May 2025
https://github.com/archiveteam/telegram-grab
Archiving public telegram messages.
Last synced: 15 May 2025
https://github.com/ArchiveTeam/VideoBot
Specialised bot for periodical grabs and video/audio/etc. webpage scrapes.
Last synced: 17 Jul 2025
https://github.com/archiveteam/videobot
Specialised bot for periodical grabs and video/audio/etc. webpage scrapes.
Last synced: 15 May 2025
https://github.com/archiveteam/urlteam-stuff
Urlteam website, code, ... also, PONIES
Last synced: 15 May 2025
https://github.com/archiveteam/youtube-dislikes-items
Managing items for youtube-dislikes-grab.
Last synced: 30 Dec 2025
https://github.com/archiveteam/google-sites-grab
Archiving Google Sites Classic.
Last synced: 15 May 2025
https://github.com/archiveteam/mediafire-items
Managing items for mediafire-grab.
Last synced: 16 Mar 2026
https://github.com/archiveteam/greader-grab
http://www.archiveteam.org/index.php?title=Google_Reader
Last synced: 09 Oct 2025
https://github.com/archiveteam/wget-lua-forum-scripts
Downloading forums posts with Wget+Lua
Last synced: 08 Sep 2025
https://github.com/archiveteam/youtube-items
Managing items for youtube-grab
Last synced: 02 Jan 2026
https://github.com/archiveteam/citeseerxpdf-grab
Grabbing all sources of CiteSeerX.
Last synced: 15 May 2025
https://github.com/archiveteam/mediafire-grab
Archiving mediafire.com URLs.
Last synced: 15 May 2025
https://github.com/archiveteam/tinyarchive
Software behind tracker.tinyarchive.org - Warning: Very hacky code
Last synced: 07 Mar 2026
https://github.com/archiveteam/archiveteam-megawarc-factory
Some scripts to process ArchiveTeam uploads
Last synced: 15 May 2025
https://github.com/archiveteam/warrior-preseed
Constructing a new warrior VM
Last synced: 12 Mar 2026
https://github.com/archiveteam/justintv-grab
Grabbing as much of justin.tv's archives as possible
Last synced: 15 May 2025
https://github.com/archiveteam/reddit-items
Managing items for reddit-grab.
Last synced: 30 Dec 2025
https://github.com/archiveteam/gamemaker-sandbox-grab
Grabbing sandbox.yoyogames.com
Last synced: 15 May 2025
https://github.com/archiveteam/heroku-buildpack-archiveteam-warrior
Heroku buildpack with the Archive Team Warrior
Last synced: 15 May 2025
https://github.com/archiveteam/grab-base-df
Base Dockerfile for warrior project grab scripts
Last synced: 27 Dec 2025
https://github.com/archiveteam/furaffinity-grab
Grabbing all images and other stuff from Fur Affinity.
Last synced: 07 Jul 2025
https://github.com/archiveteam/google-newspapers-grab
Archiving the Google New Archive at https://news.google.com/newspapers.
Last synced: 12 Apr 2025
https://github.com/archiveteam/blingee-grab
Saving all images and content from Blingee.
Last synced: 29 Apr 2026
https://github.com/archiveteam/universal-tracker-2
A better tracker with more features for ArchiveTeam
Last synced: 15 May 2025
https://github.com/archiveteam/flashdomains-grab
Copy of domains-grab for Flash sites.
Last synced: 10 Mar 2026
https://github.com/archiveteam/tumblr-grab-test
Archiving Tumblr blogs (an ArchiveTeam Warrior testing project)
Last synced: 13 Mar 2025