Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/datawaves-xyz/dbt_datawaves_wallet_labels
Ethereum Wallet labels built using dbt.
blockchain blockchain-analytics databricks dbt ethereum nft whales
Last synced: 02 Jul 2024
![](https://github.com/datawaves-xyz.png)
https://github.com/CartoDB/analytics-toolbox-core
A set of UDFs and Procedures to extend BigQuery, Snowflake, Redshift, Postgres and Databricks with Spatial Analytics capabilities
analytics-toolbox bigquery carto databricks geospatial gis postgres redshift snowflake sql
Last synced: 21 Jun 2024
![](https://github.com/CartoDB.png)
https://github.com/thoughtworks/mlops-platforms
Compare MLOps Platforms. Breakdowns of SageMaker, VertexAI, AzureML, Dataiku, Databricks, h2o, kubeflow, mlflow...
azureml data-science databricks dataiku datarobot google-ai-platform h2oai iguazio knime kubeflow machine-learning mlflow mlops pachyderm sagemaker seldon
Last synced: 20 Jun 2024
![](https://github.com/thoughtworks.png)
https://microsoft.github.io/SynapseML/
Simple and Distributed Machine Learning
ai apache-spark azure big-data cognitive-services data-science databricks deep-learning http lightgbm machine-learning microsoft ml model-deployment onnx opencv pyspark scala spark synapse
Last synced: 19 Jun 2024
![](https://github.com/microsoft.png)
https://github.com/databrickslabs/dbx
🧱 Databricks CLI eXtensions - aka dbx is a CLI tool for development and advanced Databricks workflows management.
ci cicd databricks databricks-api databricks-cli mlops
Last synced: 16 Jun 2024
![](https://github.com/databrickslabs.png)
https://github.com/starlake-ai/jsqltranspiler
Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data types and format characters) using Java.
bigquery databricks duckdb java query redshift rewrite snowflake transpiler
Last synced: 13 Jun 2024
![](https://github.com/starlake-ai.png)
https://github.com/hystax/optscale
FinOps and MLOps platform to run ML/AI and regular cloud workloads with optimal performance and cost.
aws azure cloud cloud-cost cloud-cost-intelligence cost-optimization data-science databricks devops experiment-tracking finops gcp kubernetes ml mlflow mlops paas-instrumentation paas-profiling s3-optimization
Last synced: 13 Jun 2024
![](https://github.com/hystax.png)
https://github.com/Azure-Samples/modern-data-warehouse-dataops
DataOps for the Modern Data Warehouse on Microsoft Azure. https://aka.ms/mdw-dataops.
automatedtesting azure cicd data databricks datafactory dataops devops mdw
Last synced: 10 Jun 2024
![](https://github.com/Azure-Samples.png)
https://github.com/databrickslabs/ucx
Your best companion for upgrading to Unity Catalog. UCX will guide you, the Databricks customer, through the process of upgrading your account, groups, workspaces, jobs etc. to Unity Catalog.
databricks databricks-cli-installable unity-catalog
Last synced: 10 Jun 2024
![](https://github.com/databrickslabs.png)
https://github.com/lhbench/lhbench
Lakehouse storage system benchmark
apache-hudi apache-iceberg benchmark cidr database databricks delta-lake lakehouse
Last synced: 07 Jun 2024
![](https://github.com/lhbench.png)
https://github.com/Multiwoven/multiwoven
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack. Leading Reverse ETL and Customer Data Platform (CDP) for Data Teams.
bigquery cdp customer-data-platform data-activation data-engineering data-pipeline data-warehouse databricks dbt etl hacktoberfest open-source postresql react redshift reverse-etl ruby self-hosted snowflake typescript
Last synced: 05 Jun 2024
![](https://github.com/Multiwoven.png)
https://delta-io.github.io/delta-rs/
A native Rust library for Delta Lake, with bindings into Python
databricks delta delta-lake pandas pandas-dataframe python rust
Last synced: 27 May 2024
![](https://github.com/delta-io.png)
https://github.com/databrickslabs/dolly
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
Last synced: 25 May 2024
![](https://github.com/databrickslabs.png)
https://github.com/mlcraft-io/mlcraft
Synmetrix – open source semantic layer / Boost your LLM precision
big-data bigquery business-intelligence clickhouse cube cubejs data-engineering databricks dremio druid firebolt llm prestodb redshift semantic-layer snowflake vertica
Last synced: 19 May 2024
![](https://github.com/mlcraft-io.png)
https://github.com/getredash/redash
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
analytics athena bi bigquery business-intelligence dashboard databricks hacktoberfest javascript mysql postgresql python redash redshift spark spark-sql visualization
Last synced: 16 May 2024
![](https://github.com/getredash.png)
https://github.com/delta-io/delta-rs
A native Rust library for Delta Lake, with bindings into Python
databricks delta delta-lake pandas pandas-dataframe python rust
Last synced: 11 May 2024
![](https://github.com/delta-io.png)
https://github.com/ossinova/awesome-databricks
A collection of useful and awesome Databricks resources
awesome awesome-list databricks guide lists resources
Last synced: 23 Apr 2024
![](https://github.com/ossinova.png)
https://github.com/aloneguid/stowage
Bloat-free, no BS cloud storage SDK.
aws-s3 azure-storage databricks gcp-storage
Last synced: 22 Apr 2024
![](https://github.com/aloneguid.png)
https://github.com/databrickslabs/cicd-templates
Manage your Databricks deployments and CI with code.
aws azure azure-devops cd-pipeline ci databricks github-actions gitlab mlops
Last synced: 19 Apr 2024
![](https://github.com/databrickslabs.png)
https://github.com/Azure/azure-cosmosdb-spark
Apache Spark Connector for Azure Cosmos DB
apache-spark azure-cosmos-db azure-databricks changefeed connector cosmos-db databricks databricks-notebooks jupyter-notebook lambda-architecture pyspark spark
Last synced: 19 Apr 2024
![](https://github.com/Azure.png)
https://github.com/microsoft/SynapseML
Simple and Distributed Machine Learning
ai apache-spark azure big-data cognitive-services data-science databricks deep-learning http lightgbm machine-learning microsoft ml model-deployment onnx opencv pyspark scala spark synapse
Last synced: 17 Apr 2024
![](https://github.com/microsoft.png)
https://github.com/Tencent/APIJSON
🏆 零代码、全功能、强安全 ORM 库 🚀 后端接口和文档零代码,前端(客户端) 定制返回 JSON 的数据和结构。 🏆 A JSON Transmission Protocol and an ORM Library 🚀 provides APIs and Docs without writing any code.
baas clickhouse crud databricks elasticsearch hadoop hive influxdb low-code lowcode milvus nocode oracle postgresql postgresql-database serverless snowflake sqlserver tdengine tidb
Last synced: 13 Apr 2024
![](https://github.com/Tencent.png)
https://github.com/getstrm/pace
Data policy IN, dynamic view OUT: PACE is the Policy As Code Engine. It helps you to programatically create and apply a data policy to a processing platform like Databricks, Snowflake or BigQuery, with definitions imported from Collibra, Datahub, ODD and the like.
bigquery data-catalog data-contracts data-governance data-processing databricks policy-enforcement snowflake
Last synced: 11 Apr 2024
![](https://github.com/getstrm.png)
https://github.com/dotnet/spark
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
analytics apache-spark azure bigdata csharp databricks dotnet dotnet-core dotnet-standard emr fsharp hdinsight machine-learning microsoft spark spark-sql spark-streaming streaming tpcds tpch
Last synced: 11 Apr 2024
![](https://github.com/dotnet.png)
https://github.com/yueureka/WildFireDetection
Using U-Net Model to Detect Wildfire from Satellite Imagery
ai databricks deep-learning docker satellite-imagery sparkaisummit streamlit unet-model wildfire wildfire-detection
Last synced: 08 Apr 2024
![](https://github.com/yueureka.png)
https://github.com/databricks/dbrx
Code examples and resources for DBRX, a large language model developed by Databricks
databricks gen-ai generative-ai llm llm-inference llm-training mosaic-ai
Last synced: 01 Apr 2024
![](https://github.com/databricks.png)
https://github.com/microsoft/nutter
Testing framework for Databricks notebooks
azuredevops databricks databricks-notebooks
Last synced: 29 Mar 2024
![](https://github.com/microsoft.png)
https://github.com/Azure/mmlspark
Simple and Distributed Machine Learning
ai apache-spark azure big-data cognitive-services data-science databricks deep-learning http lightgbm machine-learning microsoft ml model-deployment onnx opencv pyspark scala spark synapse
Last synced: 13 Mar 2024
![](https://github.com/microsoft.png)