An open API service indexing awesome lists of open source software.

Projects in Awesome Lists by newfront

A curated list of projects in awesome lists by newfront .

https://github.com/newfront/spark-moderndataengineering

The source code for the book Modern Data Engineering with Apache Spark

Last synced: 18 Oct 2025

https://github.com/newfront/hitchhikers_guide_to_deltalake_streaming

Don't Panic. This guide will help you when it feels like the end of the world.

apache apache-spark deltalake hitchhikers-guide

Last synced: 26 Jul 2025

https://github.com/newfront/spark-intro-to-ml

A Gentle introduction to Machine Learning with Apache Spark

Last synced: 26 Jul 2025

https://github.com/newfront/odsc-west-streaming-trends

All Data, Relevant Information, Scripts, and Applications for the Open Data Science Conference (2018)

ml spark spark-streaming

Last synced: 12 Aug 2025

https://github.com/newfront/spark-summit-2018

Spark Application : Spark Summit 2018 : Streaming Trend Discovery

Last synced: 26 Jul 2025

https://github.com/newfront/odsc-east-2020-decision-intelligence

This is the home of the 2020 Open Data Science Conference workshop (Creating Streaming Predictive Analytics and Decision Intelligence Systems with Apache Spark)

decision-intelligence-systems odsc odsc-east-2020 spark

Last synced: 01 Aug 2025

https://github.com/newfront/spark-inception

This project is available free of charge as a companion to my Data+AI Summit (2022) talk.

Last synced: 10 Jul 2025

https://github.com/newfront/odsc-east-realish-predictions

Material for the 2019 ODSC East Workshop (Realish Time Predictive Analytics with Spark Structured Streaming)

Last synced: 26 Jul 2025

https://github.com/newfront/hacker-dojo-rails3-code-base

This is the sample code repository for the November 2010 Hacker Dojo Rails 3 with Ruby Hands On Programming Class

Last synced: 26 Jul 2025

https://github.com/newfront/svcc-2019-realish-spark

This is the material for the 2019 Silicon Valley Code Camp Session "Realish Time Predictive Analytics with Spark Structured Streaming"

apache-spark silicon-valley-code-camp workshop-materials

Last synced: 26 Jul 2025

https://github.com/newfront/programming_objective-c_book

Source files from Programming Objective-C 2.0 (3rd Edition)

Last synced: 24 Mar 2025

https://github.com/newfront/svcc_application

Source Code and Files for the Web Socket I/O presentation by Scott Haines

Last synced: 24 Mar 2025

https://github.com/newfront/dailyhacking

source files

Last synced: 24 Mar 2025

https://github.com/newfront/unitycatalog-playground

This project makes use of the open source Unity Catalog project and introduces a full notebook environment for simplifying how you work with UC OSS.

Last synced: 09 Mar 2026

https://github.com/newfront/docker-spark-base

Creates a customizable base image for working with Apache Spark

Last synced: 09 Feb 2026

https://github.com/newfront/learning-spark-sdp

This is an entire environment created to master the craft of Spark Declarative Pipelines.

Last synced: 09 Mar 2026

https://github.com/newfront/em-aim

Event Machine Bindings to the OpenAIM Platform (WebAIM)

Last synced: 11 Jun 2025

https://github.com/newfront/pyspark-datagen

Need data? Need data that feels real? What about fake real data? This is the project for you. Sales pitch complete.

Last synced: 09 Mar 2026

https://github.com/newfront/cocoa_conductor_agent

Handles all of the Messaging, and Polling of sockets

Last synced: 24 Mar 2025

https://github.com/newfront/odsc-east2019-warmup

Warmup Presentation for The 2019 Open Data Science Conference in Boston

Last synced: 12 Feb 2026

https://github.com/newfront/em-ffmpeg

Ruby bindings to Ffmpeg processes via EventMachine

Last synced: 19 Feb 2026

https://github.com/newfront/dojophp

Class Files from 2010 PHP Course - written for Hacker Dojo in Mountain View, CA

Last synced: 24 Mar 2025

https://github.com/newfront/odsc-west-2019-realtime-analytics

Workshop Material for Near RealTime Predictive Analytics with Apache Spark Structured Streaming Workshop at the Open Data Science Conference WEST 2019

apache-spark odsc-west-2019 odsc2019 realtime-predictive-analytics workshop-material

Last synced: 20 Apr 2026

https://github.com/newfront/zerobus-flow

This is a project that simplifies how we work with Databrick's Zerobus. Zerobus is a service for ingesting unbounded rows of data (like Kafka) but without the need for Kafka. It will write directly to Delta Lake tables within Unity Catalog.

Last synced: 09 Mar 2026

https://github.com/newfront/dataviz

Repo of Experiments for Visualizing Data and experimenting

Last synced: 05 Jan 2026

https://github.com/newfront/webworkers-js

Javascript Webworkers Playground

Last synced: 05 Jan 2026

https://github.com/newfront/html5

HTML5 research and development

Last synced: 24 Mar 2025

https://github.com/newfront/learn-spark-elasticsearch

This is a Docker environment for running ElasticSearch, Kibana, Spark and Zeppelin

Last synced: 24 Mar 2025

https://github.com/newfront/gameengine

Here is a take on building a game engine with html5

Last synced: 12 Sep 2025

https://github.com/newfront/datariders

Meetup Presentations

Last synced: 26 Feb 2026