Projects in Awesome Lists by iht
A curated list of projects in awesome lists by iht .
https://github.com/iht/fpinscala
Solution to the exercises of Functional Programming in Scala, from Mannig Publications
Last synced: 20 Sep 2025
https://github.com/iht/python-profiling-beam-summit-2021
This repository contains a streaming Dataflow pipeline written in Python with Apache Beam, reading data from PubSub.
Last synced: 20 Sep 2025
https://github.com/iht/vertex-tfx-pipeline
An example of TFX intended to work with Vertex AI in Google Cloud
google-cloud mlops tfx vertex-ai
Last synced: 20 Sep 2025
https://github.com/iht/dataflow-scala-streaming
Example with Scala and Dataflow for the Madrid Scala Meetup
Last synced: 20 Sep 2025
https://github.com/iht/scio-scala-beam-summit
This repository contains the template and the solution for the Beam Summit 2021 workshop
Last synced: 20 Sep 2025
https://github.com/iht/ml-in-prod
Template for Python apps that implement training and inference of Machine Learning models with Tensorflow
Last synced: 20 Sep 2025
https://github.com/iht/stitchy-studio
Design your own cross stitch patterns and drawings
Last synced: 22 Aug 2025
https://github.com/iht/splittable-dofns-python
This repository contains the code samples used for the workshop "Splittable DoFns in Python" of the Beam Summit 2022
Last synced: 11 Aug 2025
https://github.com/iht/beam-late-data
A unit test for Apache Beam streaming, to check if your window would drop data, and how many times would the window be triggered
Last synced: 09 Jun 2026
https://github.com/iht/sample-tfx-pipeline
A sample TFX pipeline intended to run in local
Last synced: 20 Sep 2025
https://github.com/iht/debian-measurer
Tool to gather the sources of Debian and measure them
Last synced: 04 Jun 2026
https://github.com/iht/beam-cloud-build-terraform
The scripts in this repo will build the Apache Beam Java SDK packages, using Cloud Build and Artifact Registry, for a personal Beam fork.
apache-beam artifact-registry cloud-build google-cloud
Last synced: 06 Mar 2026
https://github.com/iht/bigquery-dataflow-cdc-example
A Dataflow streaming pipeline written in Java, reading data from Pubsub and recovering the sessions from potentially unordered data, and upserting the session data into BigQuery with no duplicates
apache-beam bigquery cdc dataflow google-cloud pubsub
Last synced: 04 Jan 2026
https://github.com/iht/scio-quickstart
This repository contains a sample pipeline for starting with Scio, the Scala framework to develop Apache Beam pipelines. Fork this repository so you can commit your changes in your own repository.
Last synced: 07 Jun 2026
https://github.com/iht/elastic2bq
A Beam pipeline that takes a ElasticSearch index and creates a BigQuery table with the same contents.
Last synced: 08 Jun 2026