Projects in Awesome Lists by archivesunleashed
A curated list of projects in awesome lists by archivesunleashed .
https://github.com/archivesunleashed/aut
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
analysis apache-spark big-data big-data-analytics dataframe digital-humanities hadoop network-graphing pyspark python3 scala spark text-extraction webarchives
Last synced: 13 Apr 2025
https://github.com/archivesunleashed/warclight
A Rails engine supporting the discovery of web archives.
blacklight discovery rails rails-engine ruby solr warc webarchive-discovery webarchives
Last synced: 27 Jul 2025
https://github.com/archivesunleashed/notebooks
Various examples of notebooks for working with web archives with the Archives Unleashed Toolkit, and derivatives generated by the Archives Unleashed Toolkit.
juypter-notebook notebooks pyspark-notebook python3 spark web-archives
Last synced: 14 Oct 2025
https://github.com/archivesunleashed/graphpass
GraphPass is a utility to filter networks and provide a default visualization output for Gephi or SigmaJS.
c gephi gexf gexf-graph-files igraph sigmajs web-archive-analysis
Last synced: 09 Sep 2025
https://github.com/archivesunleashed/docker-aut
Docker image for the Archives Unleashed Toolkit
archives-unleashed aut docker docker-image spark webarchives
Last synced: 27 Apr 2025
https://github.com/archivesunleashed/auk
Rails application for the Archives Unleashed Cloud.
apache-spark archives-unleashed archives-unleashed-toolkit rails rails-application webarchives
Last synced: 29 Sep 2025
https://github.com/archivesunleashed/auk-notebooks
Jupyter notebooks to assist in creating additional analysis and visualizations of Archives Unleashed Cloud derivatives.
Last synced: 27 Jul 2025
https://github.com/archivesunleashed/twut
An open-source toolkit for analyzing line-oriented JSON Twitter archives with Apache Spark.
apache-spark spark spark-packages tweets twitter-data twitter-json
Last synced: 29 Oct 2025
https://github.com/archivesunleashed/archivesunleashed.org
This repository powers the project website.
Last synced: 27 Jan 2026
https://github.com/archivesunleashed/aut-resources
Resources for Archives Unleashed Toolkit
Last synced: 03 Feb 2026
https://github.com/archivesunleashed/juxta-collage
Instructions for building an image collage using Juxta shell script and image dataset generated through ARCH.
Last synced: 19 Mar 2026