Projects in Awesome Lists by getindata
A curated list of projects in awesome lists by getindata .
https://github.com/getindata/flink-http-connector
Http Connector for Apache Flink. Provides sources and sinks for Datastream , Table and SQL APIs.
data-streaming flink flink-sql flink-stream-processing java
Last synced: 12 Jan 2026
https://github.com/getindata/dbt-flink-adapter
Adapter for dbt that executes dbt pipelines on Apache Flink
apache-flink data-streaming dbt streaming-analytics
Last synced: 07 May 2025
https://github.com/getindata/kedro-kubeflow
Kedro Plugin to support running workflows on Kubeflow Pipelines
ai-pipelines kedro kedro-kubeflow kedro-plugin kubeflow kubeflow-pipelines machinelearning mlops
Last synced: 30 Apr 2025
https://github.com/getindata/dbt-airflow-factory
Library to convert DBT manifest metadata to Airflow tasks
Last synced: 09 Apr 2025
https://github.com/getindata/kedro-azureml
Kedro plugin to support running workflows on Microsoft Azure ML Pipelines
azure-ml kedro kedro-plugin machinelearning mlops
Last synced: 27 Jul 2025
https://github.com/getindata/kedro-vertexai
Kedro Plugin to support running workflows on GCP Vertex AI Pipelines
googlecloudplatform kedro kedro-plugin machinelearning mlops vertexai
Last synced: 06 Apr 2025
https://github.com/getindata/kedro-airflow-k8s
Kedro Plugin to support running pipelines on Kubernetes using Airflow.
airflow k8s kedro kedro-airflow kedro-plugin kuberentes machinelearning mlops
Last synced: 09 Apr 2025
https://github.com/getindata/kedro-sagemaker
Kedro Plugin to support running pipelines on AWS SageMaker.
kedro kedro-plugin machinelearning mlops sagemaker
Last synced: 27 Jul 2025
https://github.com/getindata/quickstart-ml-blueprints
Data science project development best practices and state of the art open-source tooling forged into a set of solved ML use cases to serve as blueprints for efficient prototyping.
Last synced: 09 Apr 2025
https://github.com/getindata/docker-atlantis
Custom Atlantis docker image developed by GetInData
Last synced: 08 Oct 2025
https://github.com/getindata/kedro-snowflake
Kedro Snowflake / Snowpark plugin
kedro machine-learning mlops snowflake snowpark
Last synced: 09 Apr 2025
https://github.com/getindata/flink-spring
A library that allows using Spring dependency injection framework in Flink Jobs
Last synced: 24 Apr 2025
https://github.com/getindata/terraform-module-template
Terraform module template - boilerplate used to simplify creation of new Terraform modules
Last synced: 08 Jul 2025
https://github.com/getindata/helm-charts
GetInData Helm Charts repository
helm helm-charts k8s repository
Last synced: 26 Jul 2025
https://github.com/getindata/terraform-snowflake-role
Terraform module for managing Snowflake role and grants
Last synced: 09 Apr 2025
https://github.com/getindata/terraform-azurerm-atlantis
Terraform module for deploying Atlantis in Azure Container Group
atlantis azure terraform terraform-module terragrunt
Last synced: 09 Apr 2025
https://github.com/getindata/flink-sql-runner
Framework for scheduling Flink SQL jobs on AWS Elastic MapReduce or a standalone Flink cluster.
Last synced: 09 Apr 2025
https://github.com/getindata/jupyter-images
Receipes of publicly-available Jupyter images
jupyter jupyter-notebook jupyterlab mlops mlops-environment
Last synced: 07 Jul 2025
https://github.com/getindata/terraform-aws-organization
Terraform module for AWS Organization management
Last synced: 11 Jul 2025
https://github.com/getindata/gitlab_cicd_templates
The project contains templates for CICD processes
Last synced: 03 Jan 2026
https://github.com/getindata/data-pipelines-template-example
The project contains an example of a template to create pipeline project with GetInData Framework based on DBT
Last synced: 23 Oct 2025
https://github.com/getindata/terraform-azurerm-storage-account
Terraform Module for Azure Storage Account
Last synced: 10 Sep 2025
https://github.com/getindata/terraform-null-atlantis-repo-config
Module for generating Atlantis repo config file. It contains set of custom workflows
Last synced: 04 Jan 2026
https://github.com/getindata/streaming-cli
Python CLI for streaming platform
Last synced: 09 Sep 2025
https://github.com/getindata/quickstart-ml-starter
Kedro starterts to quickly set up new projects according to QuickStart ML Blueprints practice.
Last synced: 30 Oct 2025
https://github.com/getindata/feast-kafka-postgres-demo
feast-framework feature-store flink streaming
Last synced: 08 Oct 2025
https://github.com/getindata/terraform-snowflake-database
Terraform module for managing Snowflake database
database schema snowflake snowflake-db terraform terraform-module
Last synced: 06 Mar 2026
https://github.com/getindata/flink-elastic-catalog
Flink Catalog for Elasticsearch.
Last synced: 02 Aug 2025
https://github.com/getindata/training-infra
Scripts setting up infrastructure for trainings
Last synced: 09 Apr 2025
https://github.com/getindata/streaming-ml-with-flink
Demo of running SciKit model on Flink, using Mleap serialization
Last synced: 17 Mar 2025
https://github.com/getindata/py-pre-commit-hooks
This is small repository that adds python based hooks for pre-commit
Last synced: 03 Jan 2026
https://github.com/getindata/terraform-azurerm-mlflow
Module for deploying serverless MLflow instance on Azure, using Serverless SQL Server, Container App Service and Azure Blob Storage.
Last synced: 19 Mar 2026
https://github.com/getindata/terraform-snowflake-user
Terraform module for creating snowflake users
Last synced: 26 Feb 2026
https://github.com/getindata/docker-image-template
Docker image template - boilerplate used to simplify creation of new docker images
Last synced: 19 Mar 2026
https://github.com/getindata/terraform-snowflake-warehouse
Terraform module for Snowflake Warehouse management
module snowflake snowflakedb terraform
Last synced: 14 Feb 2026
https://github.com/getindata/terraform-snowflake-privatelink-aws
Terraform module for Snowflake AWS PrivateLink management
aws module privatelink snowflake terraform
Last synced: 20 May 2026
https://github.com/getindata/terraform-azurerm-public-ip
Terraform module for managing Azure Public IP
azure public-ip terraform terraform-module
Last synced: 17 May 2026
https://github.com/getindata/example-kedro-azureml-pytorch-distributed
This repository contains an example project showing how to run distributed PyTorch training on Azure ML pipelines with Kedro. See the related blogpost.
Last synced: 17 Mar 2025
https://github.com/getindata/looker-pre-commit
A set of pre-commit hooks for Looker
Last synced: 19 Mar 2026
https://github.com/getindata/github-workflows
Collection of reusable GitHub Actions workflows
Last synced: 03 Jan 2026
https://github.com/getindata/terraform-aws-budget
Terraform module to manage AWS Budgets
Last synced: 03 Jan 2026
https://github.com/getindata/terraform-snowflake-schema
Terraform module for managing Snowflake schemas
schema snowflake terraform terraform-module
Last synced: 08 Feb 2026
https://github.com/getindata/terraform-azurerm-container-group
Terraform Module for creating Azure Container Group
Last synced: 19 Mar 2026
https://github.com/getindata/mlflow-appengine-terraform
Terraform module for deploying MLflow on Google Cloud AppEngine Flexible
Last synced: 19 Mar 2026
https://github.com/getindata/gid-mdp-workshop
Repository with lab exercises and their solutions
Last synced: 03 Jan 2026
https://github.com/getindata/terraform-snowflake-stage
Terraform module for managing Snowflake stages
Last synced: 03 Jan 2026
https://github.com/getindata/test-spark-app
Skeleton for Spark Application with HiveContext and tests
Last synced: 29 Jul 2025
https://github.com/getindata/terraform-azurerm-module-template
Terraform module template for creating Azure based Terraform modulees
Last synced: 03 Jan 2026
https://github.com/getindata/dbt-common-macros
It contains macros shared between projects.
Last synced: 19 Mar 2026
https://github.com/getindata/terraform-snowflake-resource-monitor
Terraform module that can create and manage Snowflake Resource Monitors.
Last synced: 19 Mar 2026
https://github.com/getindata/flink-ververica-catalog-proxy
Proxy to the internal Ververica Catalog via Ververica REST Api
flink flink-sql sql ververica-platform
Last synced: 22 Jun 2025
https://github.com/getindata/streaming-ml-with-ksql
Demo of running Spark MLLib model on Kafka with KSQL, using Mleap serialization
Last synced: 17 Mar 2025
https://github.com/getindata/terraform-azurerm-tenant-management
Terraform module for configuring Azure Tenant: Management Groups, Azure Policy, Subscription.
Last synced: 19 Mar 2026
https://github.com/getindata/kafka-streams-avro
Example application for Kafka Streams training
Last synced: 17 Mar 2025
https://github.com/getindata/ververica-platform-flink-workshop
All tools for local Ververica Platform setup with Flink SQL
Last synced: 19 Mar 2026
https://github.com/getindata/terraform-gke-helm-release
GKE Helm release module
Last synced: 19 Mar 2026
https://github.com/getindata/terraform-snowflake-shared-database
Terraform module for managing Shared Databases in Snowflake
Last synced: 19 Mar 2026
https://github.com/getindata/data-pipelines-cli-init-example
The example for init template for Data Pipelines CLI tool
Last synced: 19 Mar 2026
https://github.com/getindata/mlops-gcp-vertex-snowflake-dbt
The repository contains code samples from "MLOPs for Pro's - Technical perspective. Build a Feature Store Faster - an Introduction to Vertex AI, Snowflake and dbt Cloud" ebook.
Last synced: 19 Mar 2026
https://github.com/getindata/dbt-intro
Introductory repository to dbt with the use of data-pipelines-cli Topics Resources
Last synced: 19 Mar 2026
https://github.com/getindata/terraform-azurerm-subscription
Terraform Module for Azure Subscription
Last synced: 19 Mar 2026
https://github.com/getindata/dbt-workflows-factory
Creates dbt based GCP workflows.
Last synced: 17 Mar 2025
https://github.com/getindata/terraform-azuread-group
Terraform module for managing Azure Active Directory Groups
Last synced: 23 Feb 2026
https://github.com/getindata/terraform-azurerm-keyvault-secret-data-source
Terraform module for reading secrets from Azure KeyVault
azure keyvault terraform terraform-module
Last synced: 13 May 2026
https://github.com/getindata/terraform-google-group-users-data-source
Module returns a map of users, where keys in the maps are the group names and the values is list of users that belong to the group
Last synced: 07 Feb 2026
https://github.com/getindata/terraform-snowflake-database-role
Terraform Database role module for Snowflake
Last synced: 03 Jan 2026