Projects in Awesome Lists by aamend
A curated list of projects in awesome lists by aamend .
https://github.com/aamend/spark-gdelt
Binding the GDELT universe in a Spark environment
analytics gdelt news parser spark structured-streaming
Last synced: 14 Apr 2025
https://github.com/aamend/pathogen
The rooster crows immediately before sunrise, the rooster causes the sun to rise
big-data bigdata causation contagion correlation datascience fcm graph graphx machine-learning spark
Last synced: 09 Sep 2025
https://github.com/aamend/hadoop-primitive-clustering
Hadoop implementation of Canopy Clustering using Levenshtein distance
Last synced: 29 Jul 2025
https://github.com/aamend/ml-registry
Enabling continuous delivery and improvement of Spark pipeline models through devops methodology and ML governance
datascience devops machinelearning maven ml nexus spark
Last synced: 14 Apr 2025
https://github.com/aamend/fire-spark
Mapping fire datamodel to spark execution
Last synced: 30 Jan 2026
https://github.com/aamend/spark-archetype
Maven archetype is a convenient way to create fully fledged SPARK libraries at minimal cost
Last synced: 17 Apr 2026
https://github.com/aamend/canopy-clustering
Automatically exported from code.google.com/p/canopy-clustering
Last synced: 30 Jan 2026
https://github.com/aamend/hadoop-recordreader
Custom RecordReader to allow custom delimiter (used for legacy 1.2.1 version)
Last synced: 30 Jan 2026
https://github.com/aamend/vagrant
Vagrant scripts to spin up hadoop instance
Last synced: 30 Jan 2026