An open API service indexing awesome lists of open source software.

Projects in Awesome Lists by ArchiveTeam

A curated list of projects in awesome lists by ArchiveTeam .

https://github.com/archiveteam/grab-site

The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns

archiving crawl crawler spider warc

Last synced: 15 May 2025

https://github.com/ArchiveTeam/grab-site

The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns

archiving crawl crawler spider warc

Last synced: 07 Apr 2025

https://github.com/ArchiveTeam/wpull

Wget-compatible web downloader and crawler.

Last synced: 06 Apr 2025

https://github.com/archiveteam/wpull

Wget-compatible web downloader and crawler.

Last synced: 27 Jan 2026

https://github.com/ArchiveTeam/ArchiveBot

ArchiveBot, an IRC bot for archiving websites

archiving haxe irc javascript python ruby

Last synced: 17 Jul 2025

https://github.com/archiveteam/archivebot

ArchiveBot, an IRC bot for archiving websites

archiving haxe irc javascript python ruby

Last synced: 09 Apr 2025

https://github.com/archiveteam/warrior-dockerfile

A Dockerfile for the ArchiveTeam Warrior

Last synced: 05 Apr 2025

https://github.com/ArchiveTeam/parler-grab

Archiving Parler.

Last synced: 12 Oct 2025

https://github.com/archiveteam/parler-grab

Archiving Parler.

Last synced: 08 May 2025

https://github.com/archiveteam/ubuntu-warrior

Scripts to build and boot warrior virtual machine containing Docker

Last synced: 15 May 2025

https://github.com/archiveteam/wget-lua

Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.

archiveteam archiving crawl crawler crawlers crawling downloader ftp lua scraper scraping spider warc webarchiving wget wget-lua zstd

Last synced: 04 Apr 2025

https://github.com/ArchiveTeam/Ubuntu-Warrior

Scripts to build and boot warrior virtual machine containing Docker

Last synced: 03 Apr 2025

https://github.com/ArchiveTeam/wget-lua

Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.

archiveteam archiving crawl crawler crawlers crawling downloader ftp lua scraper scraping spider warc webarchiving wget wget-lua zstd

Last synced: 18 Jul 2025

https://github.com/archiveteam/ia.bak

We back up a lot of stuff from around the web; now it's time to back up the Internet Archive, just in case.

Last synced: 11 Mar 2026

https://github.com/ArchiveTeam/IA.BAK

We back up a lot of stuff from around the web; now it's time to back up the Internet Archive, just in case.

Last synced: 24 Mar 2025

https://github.com/ArchiveTeam/terroroftinytown

URLTeam's second generation of URL shortener archiving tools

Last synced: 03 Apr 2025

https://github.com/archiveteam/terroroftinytown

URLTeam's second generation of URL shortener archiving tools

Last synced: 28 Oct 2025

https://github.com/archiveteam/seesaw-kit

Making a reusable toolkit for writing seesaw scripts

Last synced: 17 Jun 2025

https://github.com/archiveteam/imgur-grab

Archiving imgur.

Last synced: 10 Apr 2025

https://github.com/ArchiveTeam/NewsGrabber

Grabbing all news.

Last synced: 17 Jul 2025

https://github.com/archiveteam/reddit-grab

Grabbing everything from reddit.

Last synced: 15 May 2025

https://github.com/archiveteam/newsgrabber

Grabbing all news.

Last synced: 06 Jul 2025

https://github.com/ArchiveTeam/reddit-grab

Grabbing everything from reddit.

Last synced: 24 Mar 2025

https://github.com/archiveteam/yahooanswers-grab

Saving all questions and answers from Yahoo! Answers.

Last synced: 15 May 2025

https://github.com/ArchiveTeam/yahooanswers-grab

Saving all questions and answers from Yahoo! Answers.

Last synced: 28 Mar 2025

https://github.com/archiveteam/tumblr-grab

Archiving all to-be-deleted NSFW tumblr blogs.

Last synced: 15 May 2025

https://github.com/archiveteam/universal-tracker

A configurable, reusable tracker with dashboard

Last synced: 15 May 2025

https://github.com/ArchiveTeam/universal-tracker

A configurable, reusable tracker with dashboard

Last synced: 03 Apr 2025

https://github.com/archiveteam/warrior4-vm

Warrior virtual machine appliance (version 4)

Last synced: 15 May 2025

https://github.com/archiveteam/terroroftinytown-client-grab

The Seesaw pipeline grab script for the URLTeam (terroroftinytown) project

Last synced: 30 Oct 2025

https://github.com/archiveteam/ludios_wpull

wpull fork with fixes and faster parsing using html5-parser; used by grab-site; should go away when wpull is similarly improved

Last synced: 13 Mar 2025

https://github.com/archiveteam/youtube-grab

Archiving all metadata from YouTube (everything except videos themselves due to size)

Last synced: 11 Jul 2025

https://github.com/archiveteam/warrior-code2

Boot scripts for the ArchiveTeam Warrior 2

Last synced: 15 May 2025

https://github.com/archiveteam/googleplus-grab

Archiving Google+.

Last synced: 15 May 2025

https://github.com/ArchiveTeam/usgovernment-grab

Archiving parts of the US government.

Last synced: 15 May 2026

https://github.com/archiveteam/ftp-gov-grab

Archiving government FTPs.

Last synced: 15 May 2025

https://github.com/archiveteam/urls-grab

Archiving URLs (outlinks) from a variety of sources.

Last synced: 29 Aug 2025

https://github.com/archiveteam/gamemaker-sandbox-items

Gamemaker Sandbox Tracker items

Last synced: 28 Jan 2026

https://github.com/archiveteam/500px-grab

Archiving https://500px.com/creativecommons

Last synced: 15 May 2025

https://github.com/archiveteam/tinyback

A tiny web scraper

Last synced: 15 May 2025

https://github.com/ArchiveTeam/youtube-dislikes-grab

Archiving general youtube video metadata through innertube for dislikes removal.

Last synced: 01 Aug 2025

https://github.com/archiveteam/youtube-dislikes-grab

Archiving general youtube video metadata through innertube for dislikes removal.

Last synced: 15 May 2025

https://github.com/archiveteam/telegram-grab

Archiving public telegram messages.

Last synced: 15 May 2025

https://github.com/archiveteam/subscene-grab

Archiving Subscene.

Last synced: 15 May 2025

https://github.com/ArchiveTeam/VideoBot

Specialised bot for periodical grabs and video/audio/etc. webpage scrapes.

Last synced: 17 Jul 2025

https://github.com/archiveteam/videobot

Specialised bot for periodical grabs and video/audio/etc. webpage scrapes.

Last synced: 15 May 2025

https://github.com/archiveteam/urlteam-stuff

Urlteam website, code, ... also, PONIES

Last synced: 15 May 2025

https://github.com/archiveteam/youtube-dislikes-items

Managing items for youtube-dislikes-grab.

Last synced: 30 Dec 2025

https://github.com/archiveteam/pastebin-grab

Archiving pastebin

Last synced: 26 Jul 2025

https://github.com/archiveteam/github-grab

Archiving GitHub

Last synced: 05 Mar 2026

https://github.com/archiveteam/google-sites-grab

Archiving Google Sites Classic.

Last synced: 15 May 2025

https://github.com/archiveteam/blogger-grab

Archiving Blogger/Blogspot.

Last synced: 15 May 2025

https://github.com/archiveteam/ftp-grab

Save all FTP sites!

Last synced: 15 May 2025

https://github.com/archiveteam/flickr-grab

Grabbing Flickr images.

Last synced: 15 May 2025

https://github.com/archiveteam/urls-sources

Sources for urls-grab.

Last synced: 24 Jun 2025

https://github.com/ArchiveTeam/flickr-grab

Grabbing Flickr images.

Last synced: 03 Apr 2025

https://github.com/archiveteam/ftp-nab

Thinger to download FTP sites

Last synced: 24 Jun 2025

https://github.com/archiveteam/mediafire-items

Managing items for mediafire-grab.

Last synced: 16 Mar 2026

https://github.com/archiveteam/greader-grab

http://www.archiveteam.org/index.php?title=Google_Reader

Last synced: 09 Oct 2025

https://github.com/archiveteam/twitchtv-grab

Grabbing twitch.tv videos

Last synced: 15 May 2025

https://github.com/archiveteam/wget-lua-forum-scripts

Downloading forums posts with Wget+Lua

Last synced: 08 Sep 2025

https://github.com/archiveteam/youtube-items

Managing items for youtube-grab

Last synced: 02 Jan 2026

https://github.com/archiveteam/mobileme-grab

Downloading MobileMe

Last synced: 12 Jun 2025

https://github.com/archiveteam/ftp-queue

Create queue items for ftp-grab.

Last synced: 13 Feb 2026

https://github.com/archiveteam/citeseerxpdf-grab

Grabbing all sources of CiteSeerX.

Last synced: 15 May 2025

https://github.com/archiveteam/mediafire-grab

Archiving mediafire.com URLs.

Last synced: 15 May 2025

https://github.com/archiveteam/tinyarchive

Software behind tracker.tinyarchive.org - Warning: Very hacky code

Last synced: 07 Mar 2026

https://github.com/archiveteam/yahoomessages-grab

Archiving Yahoo Messages

Last synced: 27 Oct 2025

https://github.com/archiveteam/archiveteam-megawarc-factory

Some scripts to process ArchiveTeam uploads

Last synced: 15 May 2025

https://github.com/archiveteam/ffnet-grab

Fanfictioning

Last synced: 21 Aug 2025

https://github.com/archiveteam/warrior-preseed

Constructing a new warrior VM

Last synced: 12 Mar 2026

https://github.com/archiveteam/justintv-grab

Grabbing as much of justin.tv's archives as possible

Last synced: 15 May 2025

https://github.com/archiveteam/coursera-grab

Saving courses from Coursera.

Last synced: 21 Feb 2026

https://github.com/archiveteam/reddit-items

Managing items for reddit-grab.

Last synced: 30 Dec 2025

https://github.com/archiveteam/formspring-grab

Downloading Formspring

Last synced: 20 Jun 2025

https://github.com/archiveteam/gamemaker-sandbox-grab

Grabbing sandbox.yoyogames.com

Last synced: 15 May 2025

https://github.com/archiveteam/heroku-buildpack-archiveteam-warrior

Heroku buildpack with the Archive Team Warrior

Last synced: 15 May 2025

https://github.com/ArchiveTeam/wikis-grab

Grabbing all wikis.

Last synced: 03 Apr 2025

https://github.com/archiveteam/grab-base-df

Base Dockerfile for warrior project grab scripts

Last synced: 27 Dec 2025

https://github.com/archiveteam/gfycat-grab

Archiving gfycat.com.

Last synced: 23 Jun 2025

https://github.com/archiveteam/furaffinity-grab

Grabbing all images and other stuff from Fur Affinity.

Last synced: 07 Jul 2025

https://github.com/archiveteam/google-newspapers-grab

Archiving the Google New Archive at https://news.google.com/newspapers.

Last synced: 12 Apr 2025

https://github.com/archiveteam/imdb-grab

Archiving IMDb.

Last synced: 19 Oct 2025

https://github.com/archiveteam/liveleak-grab

Archiving liveleak.com

Last synced: 10 Jan 2026

https://github.com/archiveteam/blingee-grab

Saving all images and content from Blingee.

Last synced: 29 Apr 2026

https://github.com/archiveteam/wikis-grab

Grabbing all wikis.

Last synced: 13 Mar 2025

https://github.com/archiveteam/sourceforge-grab

Archiving SourceForge.

Last synced: 15 May 2025

https://github.com/archiveteam/wikidot-grab

Archiving wikidot.

Last synced: 06 Oct 2025

https://github.com/archiveteam/universal-tracker-2

A better tracker with more features for ArchiveTeam

Last synced: 15 May 2025

https://github.com/archiveteam/flashdomains-grab

Copy of domains-grab for Flash sites.

Last synced: 10 Mar 2026

https://github.com/archiveteam/tumblr-grab-test

Archiving Tumblr blogs (an ArchiveTeam Warrior testing project)

Last synced: 13 Mar 2025

https://github.com/archiveteam/roblox-grab

Archiving roblox forums.

Last synced: 27 Jul 2025