Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/Snowflake-Labs/sfguide-recommender-pipeline

Snowflake Guide: Building a Recommendation Engine Using Snowflake & Amazon SageMaker

amazon-sagemaker machine-learning sagemaker sagemaker-example snowflake snowflake-guide snowflakedb

Last synced: 21 Jun 2024

https://github.com/Snowflake-Labs/sfguide-twitter-auto-ingest

Learn how to auto-ingest streaming data into Snowflake using Snowpipe.

demo snowflake snowpipe twitter

Last synced: 21 Jun 2024

https://github.com/CartoDB/analytics-toolbox-core

A set of UDFs and Procedures to extend BigQuery, Snowflake, Redshift, Postgres and Databricks with Spatial Analytics capabilities

analytics-toolbox bigquery carto databricks geospatial gis postgres redshift snowflake sql

Last synced: 21 Jun 2024

https://github.com/Snowflake-Labs/Excelerator

This is an Excel Addin for Windows that reads and writes data to Snowflake

excel odbc-driver snowflake windows

Last synced: 16 Jun 2024

https://github.com/Snowflake-Labs/awesome-snowflake

A curated list of resources about Snowflake

awesome-list snowflake snowflakedb

Last synced: 16 Jun 2024

https://github.com/bruin-data/ingestr

ingestr is a CLI tool to copy data between any databases with a single command seamlessly.

bigquery copy-database data-ingestion data-integration data-pipeline duckdb ingestion-pipeline mssql postgresql snowflake

Last synced: 15 Jun 2024

https://github.com/starlake-ai/jsqltranspiler

Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data types and format characters) using Java.

bigquery databricks duckdb java query redshift rewrite snowflake transpiler

Last synced: 13 Jun 2024

https://github.com/gp2112/snowflake-cli

A simple python script for hosting a Snowflake Proxy in your python program or with it's standalone cli

onion privacy proxy python snowflake tor

Last synced: 07 Jun 2024

https://github.com/artie-labs/transfer

Database replication platform that leverages change data capture. Stream production data from databases to your data warehouse (Snowflake, BigQuery, Redshift) in real-time.

apache-kafka bigquery cdc change-data-capture data-integration data-pipelines database debezium elt golang kafka redshift snowflake

Last synced: 07 Jun 2024

https://github.com/PeerDB-io/peerdb

Fast, Simple and a cost effective tool to replicate data from Postgres to Data Warehouses, Queues and Storage

bigquery cdc clickhouse cloud-native distributed-systems etl eventhubs kafka postgres postgresql realtime rust s3 snowflake sql stream-processing

Last synced: 07 Jun 2024

https://github.com/Multiwoven/multiwoven

๐Ÿ”ฅ๐Ÿ”ฅ๐Ÿ”ฅ Open Source Alternative to Hightouch, Census, and RudderStack. Leading Reverse ETL and Customer Data Platform (CDP) for Data Teams.

bigquery cdp customer-data-platform data-activation data-engineering data-pipeline data-warehouse databricks dbt etl hacktoberfest open-source postresql react redshift reverse-etl ruby self-hosted snowflake typescript

Last synced: 05 Jun 2024

https://github.com/apache/doris

Apache Doris is an easy-to-use, high performance and unified analytics database.

bigquery database dbt delta-lake elt etl hadoop hive hudi iceberg lakehouse olap query-engine real-time redshift snowflake spark sql

Last synced: 31 May 2024

https://github.com/discordlookup/discordlookup

DiscordLookup | Get more out of Discord with Discord Lookup! Snowflake Decoder, Guild List with Stats, Invite Info and more...

discord discord-bot discord-tools discordlookup experiments hacktoberfest lookup snowflake tools

Last synced: 31 May 2024

https://github.com/RobThree/IdGen

Twitter Snowflake-alike ID generator for .Net

c-sharp dotnet idgenerator snowflake twitter-snowflake unique-id

Last synced: 31 May 2024

https://github.com/zsvoboda/dbd

dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.

bigquery csv database database-schemas elt etl excel json mysql parquet postgresql python python3 redshift snowflake sql sqlite xls xlsx

Last synced: 27 May 2024

https://github.com/vd2org/snowflake

The Snowflake generator done right.

id id-generator ids snowflake snowflake-id

Last synced: 21 May 2024

https://github.com/yitter/IdGenerator

๐Ÿ’Žๅคš่ฏญ่จ€ๅฎž็Žฐ๏ผŒ้ซ˜ๆ€ง่ƒฝ็”Ÿๆˆๅ”ฏไธ€ๆ•ฐๅญ—IDใ€‚ ๐Ÿ’Žไผ˜ๅŒ–็š„้›ช่Šฑ็ฎ—ๆณ•๏ผˆSnowFlake๏ผ‰โ€”โ€”้›ช่Šฑๆผ‚็งป็ฎ—ๆณ•๏ผŒๅœจ็ผฉ็ŸญID้•ฟๅบฆ็š„ๅŒๆ—ถ๏ผŒๅ…ทๅค‡ๆž้ซ˜็žฌๆ—ถๅนถๅ‘ๅค„็†่ƒฝๅŠ›๏ผˆ50W/0.1s๏ผ‰ใ€‚ ๐Ÿ’ŽๅŽŸ็”Ÿๆ”ฏๆŒ C#/Java/Go/Rust/C/JavaScript/TypeScript/Python/Pascal ๅคš่ฏญ่จ€๏ผŒๆไพ›ๅ…ถๅฎƒ้€‚็”จไบŽๅ…ถๅฎƒ่ฏญ่จ€็š„ๅคš็บฟ็จ‹ๅฎ‰ๅ…จ่ฐƒ็”จๅŠจๆ€ๅบ“๏ผˆFFI๏ผ‰ใ€‚๐Ÿ’Žๆ”ฏๆŒๅฎนๅ™จ็Žฏๅขƒ่‡ชๅŠจๆ‰ฉๅฎน๏ผˆ่‡ชๅŠจๆณจๅ†Œ WorkerId ๏ผ‰๏ผŒๅ•ๆœบๆˆ–ๅˆ†ๅธƒๅผๅ”ฏไธ€IdGeneratorใ€‚๐Ÿ’Ž้กถๅฐ–ไผ˜ๅŒ–๏ผŒ่ถ…ๅผบๆ•ˆ่ƒฝใ€‚

c c-sharp delphi dotnet go idgenerator java javascript pascal php rust snowflake sql twritter-snowflake typescript unique-id

Last synced: 18 May 2024

https://github.com/snowflakedb/snowflake-cli

Snowflake CLI is an open-source command-line tool explicitly designed for developer-centric workloads in addition to SQL operations.

cli data-ops devops-tools snowflake sql

Last synced: 15 May 2024

https://github.com/HotPotatoC/snowflake

โ„๏ธ Simple twitter's snowflake uniquely identifiable descriptors (IDs) format generator for Go

go golang snowflake snowflake-twitter unique-id unique-identifier

Last synced: 14 May 2024

https://github.com/omnata-labs/dbt-ml-preprocessing

A SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.

bigquery dbt redshift scikit-learn snowflake

Last synced: 13 May 2024

https://github.com/ScalefreeCOM/datavault4dbt

Scalefree's dbt package for a Data Vault 2.0 implementation congruent to the original Data Vault 2.0 definition by Dan Linstedt including the Staging Area, DV2.0 main entities, PITs and Snapshot Tables.

azure-synapse bigquery datavault dbt dbt-packages exasol google-bigquery hubs links pits postgresql redshift satellites scalefree snapshots snowflake sourcemarts stagingarea

Last synced: 13 May 2024

https://github.com/get-select/dbt-snowflake-monitoring

A dbt package from SELECT to help you monitor Snowflake performance and costs

dbt snowflake

Last synced: 13 May 2024

https://github.com/datnguye/dbt-translate

โœจ Let's translate Numbers to Words โœจ

dbt money2word num2word package snowflake sql sqlserver udf

Last synced: 13 May 2024

https://github.com/bwmarrin/snowflake

A simple to use Go (golang) package to generate or parse Twitter snowflake IDs

go golang snowflake

Last synced: 13 May 2024

https://github.com/dbt-checkpoint/dbt-checkpoint

:fishing_pole_and_fish: List of `pre-commit` hooks to ensure the quality of your `dbt` projects.

bigquery business-intelligence dbt pre-commit pre-commit-hook quality-assurance snowflake sql

Last synced: 13 May 2024

https://github.com/elementary-data/elementary

The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

analytics-engineer bigquery data-analysis data-governance data-lineage data-observability data-pipeline data-pipelines data-reliability data-warehouse dataops dbt dbt-artifacts dbt-packages lineage redshift snowflake

Last synced: 13 May 2024

https://github.com/ByConity/ByConity

ByConity is an open source cloud data warehouse

bytedance clickhouse clickhouse-database cloud kubernets lakehouse olap s3 snowflake sql tiktok

Last synced: 13 May 2024

https://github.com/DataKitchen/data-observability-installer

Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team is the first to know and the first to solve with visibility across and down your data estate. Save time with simple, fast data quality test generation and execution. Trust your data, tools, and systems end to end.

data data-engineering data-observability data-profiling data-quality data-reliability data-science datachecker datacleaner datacleaning dataops dataquality datatesting datavalidation mssql pipeline-tests postgresql redshift self-hosted snowflake

Last synced: 12 May 2024

https://github.com/airbytehq/airbyte

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

bigquery change-data-capture data data-analysis data-collection data-engineering data-integration data-pipeline elt etl java mssql mysql pipeline postgresql python redshift s3 self-hosted snowflake

Last synced: 09 May 2024

https://github.com/cuebook/CueObserve

Timeseries Anomaly detection and Root Cause Analysis on data in SQL data warehouses and databases

anomaly anomaly-detection bigquery datawarehouse prophet-facebook redshift root-cause-analysis snowflake sql timeseries-analysis timeseries-forecasting

Last synced: 09 May 2024

https://github.com/snowflakedb/snowflake-kafka-connector

Snowflake Kafka Connector (Sink Connector)

java kafka kafka-connect snowflake

Last synced: 07 May 2024

https://github.com/rdagumampan/yuniql

Free and open source schema versioning and database migration made natively with .NET/6. NEW THIS MAY 2022! v1.3.15 released!

amazon-rds azure-sql-database data-engineering database-migrations datawarehouse dotnet-core dotnet-tool mariadb mysql oracle postgresql redshift snowflake sql sqlserver yuniql

Last synced: 05 May 2024

https://github.com/snowflake-so/move-cron-parser

Cron parser library written for Move language

cron cron-parser snowflake

Last synced: 03 May 2024

https://github.com/Ahoo-Wang/CosId

Universal, flexible, high-performance distributed ID generator. | ้€š็”จใ€็ตๆดปใ€้ซ˜ๆ€ง่ƒฝ็š„ๅˆ†ๅธƒๅผ ID ็”Ÿๆˆๅ™จ

clock clock-synchronization cloud-native distributed generator gradle id id-generator idgenerator java k8s kubernetes microservice redis sharding snowflake spring spring-boot spring-cloud zookeeper

Last synced: 02 May 2024

https://github.com/muyo/sno

Compact, sortable and fast unique IDs with embedded metadata.

id-generator snowflake unique-id uuid

Last synced: 30 Apr 2024

https://github.com/edwingeng/wuid

An extremely fast globally unique number generator.

guid snowflake uuid

Last synced: 30 Apr 2024

https://github.com/osamingo/indigo

A distributed unique ID generator of using Sonyflake and encoded by Base58

base58 go id identity snowflake sonyflake

Last synced: 29 Apr 2024

https://github.com/bytebase/bytebase

The GitLab/GitHub for database DevOps. World's most advanced database DevOps and CI/CD for Developer, DBA and Platform Engineering teams.

cicd clickhouse dbeaver devops github gitlab gitops mongodb mysql oracle postgres postgresql redis schema-changes schema-migrations snowflake sql sql-client sqlserver tidb

Last synced: 29 Apr 2024

https://github.com/build-trust/ockam

Orchestrate end-to-end encryption, cryptographic identities, mutual authentication, and authorization policies between distributed applications โ€“ at massive scale.

authentication authorization credentials distributed-systems e2ee encrypted-connections encrypted-messages encryption end-to-end-encryption identity kafka key-management messaging rust security snowflake trust zero-trust

Last synced: 29 Apr 2024

https://github.com/datafuselabs/databend

๐——๐—ฎ๐˜๐—ฎ, ๐—”๐—ป๐—ฎ๐—น๐˜†๐˜๐—ถ๐—ฐ๐˜€ & ๐—”๐—œ. Modern alternative to Snowflake. Cost-effective and simple for massive-scale analytics. https://databend.com

ai bigdata database rust serverless snowflake

Last synced: 29 Apr 2024

https://github.com/bartonhammond/snowflake

:snowflake: A React-Native Android iOS Starter App/ BoilerPlate / Example with Redux, RN Router, & Jest with the Snowflake Hapi Server running locally or on RedHat OpenShift for the backend, or a Parse Server running locally or remotely on Heroku

async-storage bitrise eslint fastlane hapi-server i18n immutablejs jest key-mirror parse-server react-native react-native-router-flux react-native-vector-icons redux snowflake standard-js tcomb-form-native validate

Last synced: 25 Apr 2024

https://github.com/dragonfruitnetwork/onionfruit

OnionFruitโ„ข Connect - Tor access client with country selection, bridge configuration, pluggable transports and experimental DNS support

bridge dns meek obfs4 obfs4proxy onion onionfruit proxy snowflake socks5 tor tor-bridge tor-browser tor-network vpn

Last synced: 24 Apr 2024

https://github.com/marco-roy/DDO

A DBT package to perform DataOps & administrative CI/CD on your data warehouse.

data dataops datawarehouse datawarehouseautomation dbt snowflake

Last synced: 22 Apr 2024

https://github.com/rupurt/odbc-scanner-duckdb-extension

A DuckDB extension to read data directly from databases supporting the ODBC interface

analytics bigquery columnar-database cpp data-engineering db2 duckdb mariadb mssql mysql nix odbc olap oracle postgres snowflake vector-engine

Last synced: 20 Apr 2024

https://github.com/jitsucom/jitsu

Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeline in minutes, not days

bigquery clickhouse data-collection data-connectors data-integration golang postgres redshift snowflake

Last synced: 18 Apr 2024

https://github.com/Tencent/APIJSON

๐Ÿ† ้›ถไปฃ็ ใ€ๅ…จๅŠŸ่ƒฝใ€ๅผบๅฎ‰ๅ…จ ORM ๅบ“ ๐Ÿš€ ๅŽ็ซฏๆŽฅๅฃๅ’Œๆ–‡ๆกฃ้›ถไปฃ็ ๏ผŒๅ‰็ซฏ(ๅฎขๆˆท็ซฏ) ๅฎšๅˆถ่ฟ”ๅ›ž JSON ็š„ๆ•ฐๆฎๅ’Œ็ป“ๆž„ใ€‚ ๐Ÿ† A JSON Transmission Protocol and an ORM Library ๐Ÿš€ provides APIs and Docs without writing any code.

baas clickhouse crud databricks elasticsearch hadoop hive influxdb low-code lowcode milvus nocode oracle postgresql postgresql-database serverless snowflake sqlserver tdengine tidb

Last synced: 13 Apr 2024

https://github.com/getstrm/pace

Data policy IN, dynamic view OUT: PACE is the Policy As Code Engine. It helps you to programatically create and apply a data policy to a processing platform like Databricks, Snowflake or BigQuery, with definitions imported from Collibra, Datahub, ODD and the like.

bigquery data-catalog data-contracts data-governance data-processing databricks policy-enforcement snowflake

Last synced: 11 Apr 2024

https://github.com/Datavault-UK/automate-dv

A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)

data-vault dataengineering datalake datavault datavault20 datawarehouse datawarehousing dbt elt etl metadata snowflake sql

Last synced: 10 Apr 2024

https://github.com/Snowflake-Labs/terraform-provider-snowflake

Terraform provider for managing Snowflake accounts

snowflake terraform

Last synced: 10 Apr 2024

https://github.com/rpcxio/did

้ซ˜ๆ€ง่ƒฝ็š„ID็”Ÿๆˆๅ™จ, ๅŸบไบŽrpcxๅ’ŒMemcachedๅ่ฎฎๆไพ›็ฝ‘็ปœๆœๅŠก่ฐƒ็”จ

idgen memcached snowflake sonyflake

Last synced: 09 Apr 2024

https://github.com/infinitelambda/dbt-tags

Tag-based masking policies management in Snowflake โ„๏ธ ๐Ÿท๏ธ

dbt masking-policy snowflake tags

Last synced: 08 Apr 2024

https://github.com/infinitelambda/dbt-data-diff

Data-diff solution for dbt-ers with Snowflake โ„๏ธ ๐Ÿš€

data-diff dbt package snowflake stored-procedure

Last synced: 08 Apr 2024

https://github.com/oduwsdl/tweetedat

TweetedAt tells the time of a tweet based on its tweet id

html javascript snowflake snowflake-twitter tweet-time tweetedat twitter

Last synced: 31 Mar 2024

https://github.com/astronomer/astro-sdk

Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.

airflow apache-airflow bigquery dags data-analysis data-science elt etl gcs pandas postgres python s3 snowflake sql sqlite workflows

Last synced: 24 Mar 2024

https://github.com/hopinc/pika

๐Ÿ†” The friendly, pragmatic and functional ID system

id snowflake

Last synced: 23 Mar 2024

https://github.com/scottpersinger/pgwarehouse

Easily sync your Postgres database to a Snowflake, ClickHouse, or DuckDB warehouse.

analytics clickhouse data-warehouse postgres postgresql snowflake synchronization warehouse

Last synced: 21 Mar 2024

https://github.com/tokern/piicatcher

Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub

aws-athena aws-glue aws-redshift catalog data data-catalog database phi pii python snowflake

Last synced: 20 Mar 2024

https://github.com/hcodes/snowflakes

โ„๏ธ Falling snowflakes

christmas css fullscreen happy-new-year js snowflake

Last synced: 18 Mar 2024

https://github.com/getdozer/dozer

Dozer is a real-time data platform for building, deploying and maintaining data products.

api apis data ethereum etl grpc low-code postgres realtime rest rest-api rust snowflake sql streaming

Last synced: 15 Mar 2024

https://github.com/dbcli/odbc-cli

ODBC CLI with autocompletion, syntax highlighting, object preview

database mysql odbc postgres python snowflake sqlite sqlserver

Last synced: 13 Mar 2024