Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists by databricks

A curated list of projects in awesome lists by databricks .

https://github.com/databricks/learning-spark

Example code from Learning Spark book

Last synced: 01 Aug 2024

https://github.com/databricks/koalas

Koalas: pandas API on Apache Spark

big-data data-science dataframe mlflow pandas pydata spark

Last synced: 31 Jul 2024

https://github.com/databricks/Spark-The-Definitive-Guide

Spark: The Definitive Guide's Code Repository

Last synced: 31 Jul 2024

https://github.com/databricks/scala-style-guide

Databricks Scala Coding Style Guide

Last synced: 01 Aug 2024

https://github.com/databricks/dbrx

Code examples and resources for DBRX, a large language model developed by Databricks

databricks gen-ai generative-ai llm llm-inference llm-training mosaic-ai

Last synced: 01 Aug 2024

https://github.com/databricks/click

The "Command Line Interactive Controller for Kubernetes"

cli kubectl kubernetes rust

Last synced: 31 Jul 2024

https://github.com/databricks/spark-sklearn

(Deprecated) Scikit-learn integration package for Apache Spark

apache-spark grid-search machine-learning parameter-tuning scikit-learn

Last synced: 31 Jul 2024

https://github.com/databricks/spark-csv

CSV Data Source for Apache Spark 1.x

Last synced: 03 Aug 2024

https://github.com/tjhunter/tensorframes

[DEPRECATED] Tensorflow wrapper for DataFrames on Apache Spark

Last synced: 24 Aug 2024

https://github.com/databricks/tensorframes

[DEPRECATED] Tensorflow wrapper for DataFrames on Apache Spark

Last synced: 31 Jul 2024

https://github.com/databricks/devrel

This repository contains the notebooks and presentations we use for our Databricks Tech Talks

Last synced: 01 Aug 2024

https://github.com/databricks/reference-apps

Spark reference applications

Last synced: 03 Aug 2024

https://github.com/databricks/spark-avro

Avro Data Source for Apache Spark

Last synced: 03 Aug 2024

https://github.com/databricks/spark-xml

XML data source for Spark SQL and DataFrames

Last synced: 06 Aug 2024

https://github.com/databricks/spark-corenlp

Stanford CoreNLP wrapper for Apache Spark

Last synced: 03 Aug 2024

https://github.com/databricks/databricks-cli

(Legacy) Command Line Interface for Databricks

Last synced: 02 Aug 2024

https://github.com/databricks/sjsonnet

Last synced: 01 Aug 2024

https://github.com/databricks/jsonnet-style-guide

Databricks Jsonnet Coding Style Guide

Last synced: 01 Aug 2024

https://github.com/databricks/mlflow

Open source platform for the machine learning lifecycle

Last synced: 02 Aug 2024

https://github.com/databricks/terraform-databricks-lakehouse-blueprints

Set of Terraform automation templates and quickstart demos to jumpstart the design of a Lakehouse on Databricks. This project has incorporated best practices across the industries we work with to deliver composable modules to build a workspace to comply with the highest platform security and governance standards.

financial-services hls regulated-industry-blueprints terraform

Last synced: 13 Aug 2024

https://github.com/databricks/ide-best-practices

Best practices for working with Databricks from an IDE

Last synced: 02 Aug 2024