An open API service indexing awesome lists of open source software.

Projects in Awesome Lists by getindata

A curated list of projects in awesome lists by getindata .

https://github.com/getindata/flink-http-connector

Http Connector for Apache Flink. Provides sources and sinks for Datastream , Table and SQL APIs.

data-streaming flink flink-sql flink-stream-processing java

Last synced: 12 Jan 2026

https://github.com/getindata/dbt-flink-adapter

Adapter for dbt that executes dbt pipelines on Apache Flink

apache-flink data-streaming dbt streaming-analytics

Last synced: 07 May 2025

https://github.com/getindata/kedro-kubeflow

Kedro Plugin to support running workflows on Kubeflow Pipelines

ai-pipelines kedro kedro-kubeflow kedro-plugin kubeflow kubeflow-pipelines machinelearning mlops

Last synced: 30 Apr 2025

https://github.com/getindata/dbt-airflow-factory

Library to convert DBT manifest metadata to Airflow tasks

Last synced: 09 Apr 2025

https://github.com/getindata/kedro-azureml

Kedro plugin to support running workflows on Microsoft Azure ML Pipelines

azure-ml kedro kedro-plugin machinelearning mlops

Last synced: 27 Jul 2025

https://github.com/getindata/kedro-vertexai

Kedro Plugin to support running workflows on GCP Vertex AI Pipelines

googlecloudplatform kedro kedro-plugin machinelearning mlops vertexai

Last synced: 06 Apr 2025

https://github.com/getindata/kedro-airflow-k8s

Kedro Plugin to support running pipelines on Kubernetes using Airflow.

airflow k8s kedro kedro-airflow kedro-plugin kuberentes machinelearning mlops

Last synced: 09 Apr 2025

https://github.com/getindata/datapill

Big Data Newsletter

Last synced: 09 Feb 2026

https://github.com/getindata/kedro-sagemaker

Kedro Plugin to support running pipelines on AWS SageMaker.

kedro kedro-plugin machinelearning mlops sagemaker

Last synced: 27 Jul 2025

https://github.com/getindata/data-pipelines-cli

CLI for data platform

Last synced: 09 Apr 2025

https://github.com/getindata/quickstart-ml-blueprints

Data science project development best practices and state of the art open-source tooling forged into a set of solved ML use cases to serve as blueprints for efficient prototyping.

data-science machine-learning

Last synced: 09 Apr 2025

https://github.com/getindata/flink-dynamic-cep-demo

Flink dynamic CEP demo

Last synced: 07 Sep 2025

https://github.com/getindata/docker-atlantis

Custom Atlantis docker image developed by GetInData

Last synced: 08 Oct 2025

https://github.com/getindata/kedro-snowflake

Kedro Snowflake / Snowpark plugin

kedro machine-learning mlops snowflake snowpark

Last synced: 09 Apr 2025

https://github.com/getindata/flink-spring

A library that allows using Spring dependency injection framework in Flink Jobs

Last synced: 24 Apr 2025

https://github.com/getindata/terraform-module-template

Terraform module template - boilerplate used to simplify creation of new Terraform modules

terraform

Last synced: 08 Jul 2025

https://github.com/getindata/helm-charts

GetInData Helm Charts repository

helm helm-charts k8s repository

Last synced: 26 Jul 2025

https://github.com/getindata/terraform-snowflake-role

Terraform module for managing Snowflake role and grants

module snowflake terraform

Last synced: 09 Apr 2025

https://github.com/getindata/terraform-azurerm-atlantis

Terraform module for deploying Atlantis in Azure Container Group

atlantis azure terraform terraform-module terragrunt

Last synced: 09 Apr 2025

https://github.com/getindata/flink-sql-runner

Framework for scheduling Flink SQL jobs on AWS Elastic MapReduce or a standalone Flink cluster.

Last synced: 09 Apr 2025

https://github.com/getindata/jupyter-images

Receipes of publicly-available Jupyter images

jupyter jupyter-notebook jupyterlab mlops mlops-environment

Last synced: 07 Jul 2025

https://github.com/getindata/terraform-aws-organization

Terraform module for AWS Organization management

Last synced: 11 Jul 2025

https://github.com/getindata/hive-metastore

Hive-metastore helm chart

Last synced: 04 Jan 2026

https://github.com/getindata/gitlab_cicd_templates

The project contains templates for CICD processes

Last synced: 03 Jan 2026

https://github.com/getindata/mlflow-docker

Docker image for MLflow.

docker-image mlflow mlops

Last synced: 09 Apr 2025

https://github.com/getindata/data-pipelines-template-example

The project contains an example of a template to create pipeline project with GetInData Framework based on DBT

Last synced: 23 Oct 2025

https://github.com/getindata/terraform-azurerm-storage-account

Terraform Module for Azure Storage Account

Last synced: 10 Sep 2025

https://github.com/getindata/terraform-null-atlantis-repo-config

Module for generating Atlantis repo config file. It contains set of custom workflows

Last synced: 04 Jan 2026

https://github.com/getindata/streaming-cli

Python CLI for streaming platform

Last synced: 09 Sep 2025

https://github.com/getindata/quickstart-ml-starter

Kedro starterts to quickly set up new projects according to QuickStart ML Blueprints practice.

data-science machine-learning

Last synced: 30 Oct 2025

https://github.com/getindata/terraform-snowflake-database

Terraform module for managing Snowflake database

database schema snowflake snowflake-db terraform terraform-module

Last synced: 06 Mar 2026

https://github.com/getindata/flink-elastic-catalog

Flink Catalog for Elasticsearch.

Last synced: 02 Aug 2025

https://github.com/getindata/training-infra

Scripts setting up infrastructure for trainings

Last synced: 09 Apr 2025

https://github.com/getindata/streaming-ml-with-flink

Demo of running SciKit model on Flink, using Mleap serialization

Last synced: 17 Mar 2025

https://github.com/getindata/py-pre-commit-hooks

This is small repository that adds python based hooks for pre-commit

Last synced: 03 Jan 2026

https://github.com/getindata/terraform-azurerm-mlflow

Module for deploying serverless MLflow instance on Azure, using Serverless SQL Server, Container App Service and Azure Blob Storage.

Last synced: 19 Mar 2026

https://github.com/getindata/kedro-starters

Kedro starters by GetInData

Last synced: 09 Apr 2025

https://github.com/getindata/terraform-snowflake-user

Terraform module for creating snowflake users

Last synced: 26 Feb 2026

https://github.com/getindata/docker-image-template

Docker image template - boilerplate used to simplify creation of new docker images

Last synced: 19 Mar 2026

https://github.com/getindata/terraform-snowflake-warehouse

Terraform module for Snowflake Warehouse management

module snowflake snowflakedb terraform

Last synced: 14 Feb 2026

https://github.com/getindata/terraform-snowflake-privatelink-aws

Terraform module for Snowflake AWS PrivateLink management

aws module privatelink snowflake terraform

Last synced: 20 May 2026

https://github.com/getindata/terraform-azurerm-public-ip

Terraform module for managing Azure Public IP

azure public-ip terraform terraform-module

Last synced: 17 May 2026

https://github.com/getindata/example-kedro-azureml-pytorch-distributed

This repository contains an example project showing how to run distributed PyTorch training on Azure ML pipelines with Kedro. See the related blogpost.

Last synced: 17 Mar 2025

https://github.com/getindata/looker-pre-commit

A set of pre-commit hooks for Looker

Last synced: 19 Mar 2026

https://github.com/getindata/github-workflows

Collection of reusable GitHub Actions workflows

Last synced: 03 Jan 2026

https://github.com/getindata/terraform-aws-budget

Terraform module to manage AWS Budgets

Last synced: 03 Jan 2026

https://github.com/getindata/terraform-snowflake-schema

Terraform module for managing Snowflake schemas

schema snowflake terraform terraform-module

Last synced: 08 Feb 2026

https://github.com/getindata/terraform-azurerm-container-group

Terraform Module for creating Azure Container Group

Last synced: 19 Mar 2026

https://github.com/getindata/mlflow-appengine-terraform

Terraform module for deploying MLflow on Google Cloud AppEngine Flexible

Last synced: 19 Mar 2026

https://github.com/getindata/gid-mdp-workshop

Repository with lab exercises and their solutions

Last synced: 03 Jan 2026

https://github.com/getindata/terraform-snowflake-stage

Terraform module for managing Snowflake stages

Last synced: 03 Jan 2026

https://github.com/getindata/test-spark-app

Skeleton for Spark Application with HiveContext and tests

Last synced: 29 Jul 2025

https://github.com/getindata/terraform-azurerm-module-template

Terraform module template for creating Azure based Terraform modulees

Last synced: 03 Jan 2026

https://github.com/getindata/dbt-common-macros

It contains macros shared between projects.

Last synced: 19 Mar 2026

https://github.com/getindata/terraform-snowflake-resource-monitor

Terraform module that can create and manage Snowflake Resource Monitors.

Last synced: 19 Mar 2026

https://github.com/getindata/flink-ververica-catalog-proxy

Proxy to the internal Ververica Catalog via Ververica REST Api

flink flink-sql sql ververica-platform

Last synced: 22 Jun 2025

https://github.com/getindata/streaming-ml-with-ksql

Demo of running Spark MLLib model on Kafka with KSQL, using Mleap serialization

Last synced: 17 Mar 2025

https://github.com/getindata/terraform-azurerm-tenant-management

Terraform module for configuring Azure Tenant: Management Groups, Azure Policy, Subscription.

Last synced: 19 Mar 2026

https://github.com/getindata/kafka-streams-avro

Example application for Kafka Streams training

Last synced: 17 Mar 2025

https://github.com/getindata/ververica-platform-flink-workshop

All tools for local Ververica Platform setup with Flink SQL

Last synced: 19 Mar 2026

https://github.com/getindata/terraform-gke-helm-release

GKE Helm release module

Last synced: 19 Mar 2026

https://github.com/getindata/terraform-snowflake-shared-database

Terraform module for managing Shared Databases in Snowflake

Last synced: 19 Mar 2026

https://github.com/getindata/data-pipelines-cli-init-example

The example for init template for Data Pipelines CLI tool

Last synced: 19 Mar 2026

https://github.com/getindata/mlops-gcp-vertex-snowflake-dbt

The repository contains code samples from "MLOPs for Pro's - Technical perspective. Build a Feature Store Faster - an Introduction to Vertex AI, Snowflake and dbt Cloud" ebook.

Last synced: 19 Mar 2026

https://github.com/getindata/dbt-intro

Introductory repository to dbt with the use of data-pipelines-cli Topics Resources

Last synced: 19 Mar 2026

https://github.com/getindata/terraform-azurerm-subscription

Terraform Module for Azure Subscription

Last synced: 19 Mar 2026

https://github.com/getindata/mlflow-demo

mlflow-demo

Last synced: 17 Mar 2025

https://github.com/getindata/dbt-workflows-factory

Creates dbt based GCP workflows.

Last synced: 17 Mar 2025

https://github.com/getindata/terraform-azuread-group

Terraform module for managing Azure Active Directory Groups

Last synced: 23 Feb 2026

https://github.com/getindata/terraform-azurerm-keyvault-secret-data-source

Terraform module for reading secrets from Azure KeyVault

azure keyvault terraform terraform-module

Last synced: 13 May 2026

https://github.com/getindata/terraform-google-group-users-data-source

Module returns a map of users, where keys in the maps are the group names and the values is list of users that belong to the group

Last synced: 07 Feb 2026

https://github.com/getindata/terraform-snowflake-database-role

Terraform Database role module for Snowflake

Last synced: 03 Jan 2026