Projects in Awesome Lists by newfront
A curated list of projects in awesome lists by newfront .
https://github.com/newfront/spark-moderndataengineering
The source code for the book Modern Data Engineering with Apache Spark
Last synced: 18 Oct 2025
https://github.com/newfront/hitchhikers_guide_to_deltalake_streaming
Don't Panic. This guide will help you when it feels like the end of the world.
apache apache-spark deltalake hitchhikers-guide
Last synced: 26 Jul 2025
https://github.com/newfront/spark-intro-to-ml
A Gentle introduction to Machine Learning with Apache Spark
Last synced: 26 Jul 2025
https://github.com/newfront/odsc-west-streaming-trends
All Data, Relevant Information, Scripts, and Applications for the Open Data Science Conference (2018)
Last synced: 12 Aug 2025
https://github.com/newfront/spark-summit-2018
Spark Application : Spark Summit 2018 : Streaming Trend Discovery
Last synced: 26 Jul 2025
https://github.com/newfront/odsc-east-2020-decision-intelligence
This is the home of the 2020 Open Data Science Conference workshop (Creating Streaming Predictive Analytics and Decision Intelligence Systems with Apache Spark)
decision-intelligence-systems odsc odsc-east-2020 spark
Last synced: 01 Aug 2025
https://github.com/newfront/spark-inception
This project is available free of charge as a companion to my Data+AI Summit (2022) talk.
Last synced: 10 Jul 2025
https://github.com/newfront/odsc-east-realish-predictions
Material for the 2019 ODSC East Workshop (Realish Time Predictive Analytics with Spark Structured Streaming)
Last synced: 26 Jul 2025
https://github.com/newfront/hacker-dojo-rails3-code-base
This is the sample code repository for the November 2010 Hacker Dojo Rails 3 with Ruby Hands On Programming Class
Last synced: 26 Jul 2025
https://github.com/newfront/svcc-2019-realish-spark
This is the material for the 2019 Silicon Valley Code Camp Session "Realish Time Predictive Analytics with Spark Structured Streaming"
apache-spark silicon-valley-code-camp workshop-materials
Last synced: 26 Jul 2025
https://github.com/newfront/programming_objective-c_book
Source files from Programming Objective-C 2.0 (3rd Edition)
Last synced: 24 Mar 2025
https://github.com/newfront/svcc_application
Source Code and Files for the Web Socket I/O presentation by Scott Haines
Last synced: 24 Mar 2025
https://github.com/newfront/unitycatalog-playground
This project makes use of the open source Unity Catalog project and introduces a full notebook environment for simplifying how you work with UC OSS.
Last synced: 09 Mar 2026
https://github.com/newfront/docker-spark-base
Creates a customizable base image for working with Apache Spark
Last synced: 09 Feb 2026
https://github.com/newfront/learning-spark-sdp
This is an entire environment created to master the craft of Spark Declarative Pipelines.
Last synced: 09 Mar 2026
https://github.com/newfront/em-aim
Event Machine Bindings to the OpenAIM Platform (WebAIM)
Last synced: 11 Jun 2025
https://github.com/newfront/pyspark-datagen
Need data? Need data that feels real? What about fake real data? This is the project for you. Sales pitch complete.
Last synced: 09 Mar 2026
https://github.com/newfront/cocoa_conductor_agent
Handles all of the Messaging, and Polling of sockets
Last synced: 24 Mar 2025
https://github.com/newfront/odsc-east2019-warmup
Warmup Presentation for The 2019 Open Data Science Conference in Boston
Last synced: 12 Feb 2026
https://github.com/newfront/em-ffmpeg
Ruby bindings to Ffmpeg processes via EventMachine
Last synced: 19 Feb 2026
https://github.com/newfront/dojophp
Class Files from 2010 PHP Course - written for Hacker Dojo in Mountain View, CA
Last synced: 24 Mar 2025
https://github.com/newfront/odsc-west-2019-realtime-analytics
Workshop Material for Near RealTime Predictive Analytics with Apache Spark Structured Streaming Workshop at the Open Data Science Conference WEST 2019
apache-spark odsc-west-2019 odsc2019 realtime-predictive-analytics workshop-material
Last synced: 20 Apr 2026
https://github.com/newfront/zerobus-flow
This is a project that simplifies how we work with Databrick's Zerobus. Zerobus is a service for ingesting unbounded rows of data (like Kafka) but without the need for Kafka. It will write directly to Delta Lake tables within Unity Catalog.
Last synced: 09 Mar 2026
https://github.com/newfront/dataviz
Repo of Experiments for Visualizing Data and experimenting
Last synced: 05 Jan 2026
https://github.com/newfront/learn-spark-elasticsearch
This is a Docker environment for running ElasticSearch, Kibana, Spark and Zeppelin
Last synced: 24 Mar 2025
https://github.com/newfront/gameengine
Here is a take on building a game engine with html5
Last synced: 12 Sep 2025