An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with data-modeling

A curated list of projects in awesome lists tagged with data-modeling .

https://github.com/dbt-labs/dbt-core

dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

analytics business-intelligence data-modeling dbt-viewpoint elt pypa slack

Last synced: 09 Apr 2026

https://github.com/quarylabs/quary

Open-source BI for engineers

analytics big-data business-intelligence data-modeling elt

Last synced: 24 Jan 2026

https://github.com/data-engineering-community/data-engineering-wiki

The best place to learn data engineering. Built and maintained by the data engineering community.

data data-engineer data-engineering data-modeling data-pipelines database etl sql

Last synced: 14 May 2025

https://github.com/bruin-data/bruin

Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.

analytics bigquery data-analysis data-ingestion data-modeling data-pipelines data-platform data-transformation python snowflake sql

Last synced: 23 Apr 2026

https://github.com/dbt-labs/metricflow

MetricFlow allows you to define, build, and maintain metrics in code.

analytics business-intelligence data data-modeling metrics pypi semantic-layer

Last synced: 13 May 2025

https://github.com/mswjs/data

Data modeling and relation library for testing JavaScript applications.

api data-modeling fixtures mocking orm testing testing-tool

Last synced: 16 May 2025

https://github.com/fal-ai/dbt-fal

do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning models.

analytics data-modeling dbt machine-learning machinelearning pandas python

Last synced: 18 Jul 2025

https://github.com/chaoss/augur

Python library and web service for Open Source Software Health and Sustainability metrics & data collection. You can find our documentation and new contributor information easily here: https://oss-augur.readthedocs.io/en/main/

chaoss data-collection data-modeling data-visualization defined-metrics facade git github hacktoberfest hacktoberfest2020 health linux linux-foundation metrics open-source opensource python-library research sustainability unix

Last synced: 21 Jan 2026

https://github.com/ananas-analytics/ananas-desktop

A hackable data integration & analysis tool to enable non technical users to edit data processing jobs and visualise data on demand.

analytics business-intelligence data-modeling etl hackable-data visualization

Last synced: 03 Apr 2025

https://github.com/hofstadter-io/hof

Framework that joins data models, schemas, code generation, and a task engine. Language and technology agnostic.

code-generator cue cuelang data-modeling declarative-programming hacktoberfest hofstadter llm migrations-generator tui workflow-engine

Last synced: 03 Jan 2026

https://github.com/3rd/tsdiagram

Create diagrams and plan your code with TypeScript.

data-modeling diagram diagrams typescript

Last synced: 16 May 2025

https://github.com/tellery/tellery

Tellery lets you build metrics using SQL and bring them to your team. As easy as using a document. As powerful as a data modeling tool.

analytics bigquery business-intelligence collaboration dashboard data-analytics data-modeling data-science data-visualization database dbt notebook self-hosted sql

Last synced: 16 May 2025

https://github.com/aws-samples/amazon-dynamodb-design-patterns

This repo contains sample data models to demonstrate design patterns for Amazon DynamoDB.

data-modeling dynamodb

Last synced: 31 Mar 2025

https://github.com/p2p-ld/numpydantic

Type annotations for specifying, validating, and serializing arrays with arbitrary backends in Pydantic (and beyond)

arrays dask data-modeling hdf5 numpy pydantic pydantic-numpy serialization validation zarr

Last synced: 06 Oct 2025

https://github.com/wittline/uber-expenses-tracking

The goal of this project is to track the expenses of Uber Rides and Uber Eats through data Engineering processes using technologies such as Apache Airflow, AWS Redshift and Power BI.

airflow-docker apache-airflow aws aws-redshift data-engineering data-modeling etl-pipeline expenses-dashboard expenses-tracker power-bi python uber uber-data uber-eats

Last synced: 13 Apr 2025

https://github.com/mattiasthalen/adventure-works

Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principles on Adventure Works. Features programmatic model generation, event-enhanced Puppini bridges, and temporal resolution across DAS/DAB/DAR layers.

analytical-data-storage-system data-architecture data-engineering data-modeling data-warehouse dimensional-modeling duckdb hook-methodology iceberg lakehouse serverless sqlmesh unified-star-schema

Last synced: 11 Feb 2026

https://github.com/oslabs-beta/graphql-blueprint

GraphQL Blueprint: a software developer tool for engineers that want to quickly generate React/Express, Apollo and GraphQL boilerplate code using a data modeling interface. Watch your queries, mutations, and schema update in realtime with our code preview feature and finally, export it when you're ready to begin building the rest of your app!

apollo boilerplate create-react-app data-modeling graphql mongodb node post postgresql react sql

Last synced: 30 Apr 2025

https://github.com/mara/mara-schema

Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables

data-governance data-modeling datawarehousing metadata python

Last synced: 30 Apr 2025

https://github.com/malloydata/malloy-composer

Malloy Composer is a simple application to build dashboards or run ad-hoc queries using an existing Malloy model

business-analytics business-intelligence data data-modeling data-visualization malloy semantic-modeling

Last synced: 08 May 2025

https://github.com/thefabric-io/eventsourcing

An efficient and robust Event Sourcing library for Go, designed for scalability and ease of use. Tailored for PostgreSQL, this library provides essential functionalities for storing and retrieving a sequence of events as the source of truth for the state of your application's aggregates. 🚀

aggregate data-modeling ddd event-sourcing events eventstore go-generics golang postgresql state-management

Last synced: 26 Jan 2026

https://github.com/mundipagg/amora-data-build-tool

Amora Data Build Tool enables analysts and engineers to transform data on the data warehouse (BigQuery) by writing Amora Models that describe the data schema using Python's "PEP484 - Type Hints" and select statements with SQLAlchemy. Amora is able to transform Python code into SQL data transformation jobs that run inside the warehouse.

analytics analytics-dashboard analytics-engineering bigquery business-intelligence data-engineering data-modeling datacleaning dataquality elt machine-learning python transformation

Last synced: 08 Sep 2025

https://github.com/nicosuave/awesome-sqlmesh

A curated list of awesome SQLMesh resources

data-modeling data-transformation sqlmesh

Last synced: 22 Jan 2026

https://github.com/harshcasper/helpinghand

Leveraging Intelligent Processing Tools and Algorithms to help the Visually Impaired see and navigate 💥✨

blind-people data-modeling deep-learning flutter flutter-apps helpinghand mobile-app

Last synced: 23 Mar 2025

https://github.com/chenqingspring/rules-based-modeling-engine

一款基于规则的可视化模型构建引擎。支持指标定义,规则定义,多数据源接入,RESTful API 查询

big-data data-modeling datawarehouse restful-api rule-engine sql-generator

Last synced: 31 Jan 2026

https://github.com/hakanensari/structure

Turn hashes into data objects

data-modeling ruby value-object

Last synced: 12 Oct 2025

https://github.com/rnd-forests/skyline-query

Simple implementation of spatial skyline query algorithms

algorithm branch-and-bound data-modeling nearest-neighbors r-tree skyline-query

Last synced: 10 Mar 2026

https://github.com/cleverage/eav-manager

Blazing fast data modeling and enrichment

admin data-management data-modeling symfony

Last synced: 12 Apr 2025

https://github.com/archiewood/analytics_monorepo

A monorepo combining data modelling in dbt with data viz using Evidence

data-modeling dbt evidence-dev visualization

Last synced: 20 Sep 2025

https://github.com/malloydata/malloy-samples

Malloy model examples and associated datasets

data data-modeling malloy semantic-modeling sql

Last synced: 16 Oct 2025

https://github.com/malloydata/malloy-vscode-extension

The Malloy Visual Studio Code extension facilitates building Malloy data models, querying and transforming data, and creating simple visualizations and dashboards

data data-modeling malloy semantic-modeling sql

Last synced: 26 Apr 2025

https://github.com/nichtich/tkz-orm

Object-Role Modeling diagrams in TeX

data-modeling diagrams tex tikz

Last synced: 10 Oct 2025

https://github.com/virajbhutada/us-healthcare-analysis-powerbi

Unlock insights into the U.S. healthcare landscape from 2019 to 2020. Our PowerBI-driven analysis delves into hospital performance, patient outcomes, and payer-provider dynamics. Dive into detailed reports and visualizations for informed decision-making, empowering healthcare stakeholders, and shaping the industry's future.

data-analytics data-exploration data-modeling data-visualization datascience dax-expression decision-making healthcare-analysis healthcare-datasets insights interactive-visualizations microsoftpowerbi power-query powerbi powerbi-dashboards powerbi-desktop strategic-planning

Last synced: 04 Mar 2026

https://github.com/covesa/s2dm

A Simplified Semantic Data Modeling (S2DM) approach that offers a pragmatic balance between semantic rigor and usability for subject matter experts.

data-modeling graphql-sdl rdf semantics skos

Last synced: 26 Feb 2026

https://github.com/snowplow/dbt-snowplow-mobile

A fully incremental model, that transforms raw mobile event data generated by the Snowplow mobile trackers into a series of derived tables of varying levels of aggregation.

data-modeling dbt mobile snowplow

Last synced: 21 Apr 2025

https://github.com/cefriel/competence-kg

A tutorial on Knowledge Graphs discussing how to model the employee competences within a company

competences cypher data-modeling knowledge-graph langchain llm sparql sql tutorial

Last synced: 03 Feb 2026

https://github.com/snowplow/dbt-snowplow-media-player

A fully incremental model, that transforms media player event data generated by the Snowplow JavaScript tracker into derived tables for easier querying

data-modeling dbt snowplow snowplow-javascript-tracker

Last synced: 20 Apr 2025

https://github.com/zizmorcore/github-actions-models

Unofficial Rust data models for GitHub Actions

ci data-modeling github-actions

Last synced: 20 Jun 2025

https://github.com/woodruffw/github-actions-models

Unofficial Rust data models for GitHub Actions

ci data-modeling github-actions

Last synced: 16 Apr 2025

https://github.com/malloydata/malloy-cli

A command-line interface for executing Malloy and SQL

data data-modeling transformation

Last synced: 13 Jul 2025

https://github.com/memgonzales/pisa-2018-analysis

Jupyter notebook presenting the process of data preparation, research question formulation, data analysis, and data modeling with the goal of extracting insights from the 2018 PISA Dataset

data-cleaning data-modeling data-science data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy oecd-data pandas pisa scipy statistical-inference

Last synced: 13 Jun 2025

https://github.com/virajbhutada/US-healthcare-analysis-powerBI

Unlock insights into the U.S. healthcare landscape from 2019 to 2020. Our PowerBI-driven analysis delves into hospital performance, patient outcomes, and payer-provider dynamics. Dive into detailed reports and visualizations for informed decision-making, empowering healthcare stakeholders, and shaping the industry's future.

data-analytics data-exploration data-modeling data-visualization datascience dax-expression decision-making healthcare-analysis healthcare-datasets insights interactive-visualizations microsoftpowerbi power-query powerbi powerbi-dashboards powerbi-desktop strategic-planning

Last synced: 09 Oct 2025

https://github.com/thehyve/tmtk

tranSMART Arborist ETL toolkit

data-curation data-modeling jupyter-notebook transmart

Last synced: 02 Oct 2025

https://github.com/prajwalchapke055/accenture-data-analytics-and-visualization-forage

NAVIGATING NUMBERS - Apply your data analytics & visualization skills to advise a social media client on their content creation strategy as a Data Analyst at Accenture

accenture communication data-analysis data-modeling data-understanding data-visualization forage internship internship-task job-simulation presentation project-planning public-speaking storytelling strategy teamwork virtual-internship

Last synced: 24 Feb 2026

https://github.com/cvitter/riak-ts-data-modeling

An introduction to data modeling with Riak TS with hands on examples.

data-modeling riak-ts

Last synced: 06 Mar 2026

https://github.com/vishnu-t-r/data-analytics-portfolio-projects

This repository contain data analyst portfolio projects developed using various data analytics tools including SQL, Python, Tableau, Looker etc.

data data-analysis data-cleaning data-modeling data-visualization looker looker-studio python sql ssms tableau

Last synced: 23 Apr 2025

https://github.com/cognitedata/pygen

The Cognite Python Data Modeling SDK Generator

data-modeling sdk-python

Last synced: 16 Apr 2025

https://github.com/kathleenwest/wcfstockservicesingletonwithclientchannelfactory

This project presents a WCF Stock Service Library (StockServiceLib) that mimics a stock exchange. The service is implemented as a “singleton” and maintains persistent data between client calls and can handle multiple client sessions. The service is hosted via a console application (StockServiceHost). The client and service participate in a bi-directional/callback relationship. The client (StockClient) uses the ChannelFactory pattern as opposed to “Add Service Reference” with SVCUTIL. The client and service share a common assembly (SharedLib) that contains the key contract and data model information. Furthermore, a Utilities project is used by the client console application to facilitate user data entry and the complicated details of building and managing the WCF ChannelFactory connection implementation. The ProxyGen class inside the Utilities project abstracts the details of implementing and managing a generic ChannelFactory connection to a generic service for a client. Note: The Utilities project library was included as base code for my lab project to facilitate speedy completion; we were not expected to code this Utilities project ourselves due to complexity and time constraints. The remaining projects in the solution (SharedLib, StockClient, StockServiceHost, and StockServiceLib), I completed individually per requirements for the lab project.

callback callback-api channelfactory client-server client-server-example concurrency concurrent-programming contracts data-modeling data-models service service-library stock-market stock-market-simulator wcf wcf-bindings wcf-client wcf-proxy wcf-service wcf-service-client-demo

Last synced: 17 Mar 2025

https://github.com/hackolade/deltalake

Hackolade(https://hackolade.com) plugin for Delta Lake on Databricks

data-model data-modeling databricks delta-lake schema-design

Last synced: 13 Feb 2026

https://github.com/kathleenwest/WCFStockServiceSingletonWithClientChannelFactory

This project presents a WCF Stock Service Library (StockServiceLib) that mimics a stock exchange. The service is implemented as a “singleton” and maintains persistent data between client calls and can handle multiple client sessions. The service is hosted via a console application (StockServiceHost). The client and service participate in a bi-directional/callback relationship. The client (StockClient) uses the ChannelFactory pattern as opposed to “Add Service Reference” with SVCUTIL. The client and service share a common assembly (SharedLib) that contains the key contract and data model information. Furthermore, a Utilities project is used by the client console application to facilitate user data entry and the complicated details of building and managing the WCF ChannelFactory connection implementation. The ProxyGen class inside the Utilities project abstracts the details of implementing and managing a generic ChannelFactory connection to a generic service for a client. Note: The Utilities project library was included as base code for my lab project to facilitate speedy completion; we were not expected to code this Utilities project ourselves due to complexity and time constraints. The remaining projects in the solution (SharedLib, StockClient, StockServiceHost, and StockServiceLib), I completed individually per requirements for the lab project.

callback callback-api channelfactory client-server client-server-example concurrency concurrent-programming contracts data-modeling data-models service service-library stock-market stock-market-simulator wcf wcf-bindings wcf-client wcf-proxy wcf-service wcf-service-client-demo

Last synced: 25 Apr 2025

https://github.com/web2solutions/voodux

👻 . VooduX - "with bateries included" agnostic scalable "M" layer for React, Vue, ExtJS, DHTMLX

context-api data-modeling dhtmlx extjs indexeddb offline-app offline-capable offline-first personal-web-application react reactjs redux single-page-applications vue vuejs vuex

Last synced: 19 Apr 2025

https://github.com/joyceannie/data-modeling-with-postgres

The main focus of the project is data modeling with Postgres and build an ETL pipeline using Python. The first step is to define fact and dimension tables for a star schema for a particular analytic focus. The second step is to write an ETL pipeline that transfers data from files in different directories into these tables in Postgres using Python and SQL.

data-engineering data-modeling postgresql python

Last synced: 24 Mar 2025

https://github.com/virajbhutada/power-bi-resources

Comprehensive Power BI resources covering interview questions, group training materials, project portfolio ideas, and a cheat sheet for quick reference. Elevate your Power BI skills with curated content designed to enhance your proficiency and boost project success.

data-analysis data-modeling data-visualization dax-expression interview-preparation portfolio-projects powerbi quick-reference visualization-techniques

Last synced: 03 Mar 2026

https://github.com/narius2030/sakila-datawarehouse-ssis

Implement a simple data warehouse to store Saklia data - Create data pipelines for extract, transform and load data from source to warehouse - Retrieve data in warehouse to explore and do several analysis

data-analysis data-integration data-modeling data-visualization excel microsoft-sql-server power-bi ssas ssis

Last synced: 10 Oct 2025

https://github.com/giagiannis/data-profiler

Data profiler is an attempt to model the behavior of a given operator for a set of datasets.

bhattacharyya-coefficient data-modeling data-profiling data-science dataset machine-learning similarity-matrix

Last synced: 14 Feb 2026

https://github.com/ebabel-games/fish-tank-simulation

Simulate a fish tank with violent, hungry fishes! The Blessed Fish may come and heal fishes that attack him, otherwise it's each fish for himself.

aquarium data-modeling node-js-express simulation web-api

Last synced: 10 Apr 2026

https://github.com/gansanay/dbt-teradata

dbt adapter for Teradata data warehouses

analytics data-modeling elt

Last synced: 20 Mar 2025

https://github.com/cheikhnadiouf/clear-and-sane-data-modeling

Alert: Sadly, I don't have time to maintain this but we have now a new edition 2020 available at https://www.amazon.com/dp/B086ML4PGW/ref=cm_sw_r_sms_awdb_t1_Z6oHEbGMPYFXF. The "Clear and Sane" framework for rapid data modeling, from the book "Clear and Sane engineering" (by Cheikhna Diouf) , a lightweight alternative to UML-like diagrams for engineers and technicians who have to collaborate with non-technical teams and those who have to create user guides.

clear data-model data-modeling diagram engineering framework libreoffice sane schema symbols symbols-collection sysml systemaker templates uml

Last synced: 10 Mar 2026

https://github.com/bartkl/metamorph

Metamorph is a Clojure libary that enables the generation of an Avro schema from a given input SHACL model.

avro-schema data-modeling logical-data-model rdf schema-generation schema-transformations semantic-modeling shacl

Last synced: 11 Apr 2025

https://github.com/lmdcma27/dataexpert.io-boot-camp

Here i share my practices and solutions for the dataExpert BootCamp. You can follow it in the repo: https://github.com/DataExpert-io/data-engineer-handbook/tree/main/bootcamp/materials

analytical-patterns apache-flink apache-spark data-modeling data-quality-patterns unit-testing-pipelines

Last synced: 25 Feb 2026

https://github.com/roti/lut

A library for data modeling in Scala.

case-classes data-model data-modeling data-modelling scala

Last synced: 12 Apr 2025