Projects in Awesome Lists tagged with duckdb
A curated list of projects in awesome lists tagged with duckdb .
https://github.com/canner/wrenai
π€ Open-source GenBI AI Agent that empowers data-driven teams to chat with their data to generate Text-to-SQL, charts, spreadsheets, reports, dashboards and BI. ππππ§βπ»
agent anthropic bedrock bigquery business-intelligence charts duckdb genbi llm openai postgresql rag spreadsheets sql sqlai text-to-sql text2sql vertex
Last synced: 12 May 2025
https://github.com/ibis-project/ibis
the portable Python dataframe library
bigquery clickhouse database datafusion duckdb impala mssql mysql pandas polars postgresql pyarrow pyspark python snowflake sql sqlite trino
Last synced: 13 May 2025
https://github.com/evidence-dev/evidence
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
analytics business-intelligence dashboard data-engineering data-science data-visualization dbt duckdb exploratory-data-analysis self-hosted sql svelte tailwindcss webassembly
Last synced: 13 May 2025
https://github.com/lancedb/lance
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..
apache-arrow computer-vision data-analysis data-analytics data-centric data-format data-science dataops deep-learning duckdb embeddings llms machine-learning mlops python rust
Last synced: 05 May 2025
https://github.com/dotnetcore/freesql
.NET aot orm, VB.NET/C# orm, Mysql/PostgreSQL/SqlServer/Oracle orm, Sqlite/Firebird/Clickhouse/DuckDB orm, θΎΎζ’¦/ιδ»/θθ°·/ηΏ°ι«/ι«ζ― orm, η₯ι orm, εε€§ιη¨ orm, ε½δΊ§ orm, TDengine orm, QuestDB orm, MsAccess orm.
access clickhouse codefirst dameng duckdb firebird gbase hacktoberfest highgo kingbase mysql odbc oracle orm postgresql questdb shentongdata sqlite sqlserver xugu
Last synced: 14 May 2025
https://github.com/dotnetcore/FreeSql
π¦ .NET aot orm, C# orm, VB.NET orm, Mysql orm, Postgresql orm, SqlServer orm, Oracle orm, Sqlite orm, Firebird orm, θΎΎζ’¦ orm, δΊΊε€§ιδ» orm, η₯ι orm, ηΏ°ι« orm, εε€§ιη¨ orm, θθ°· orm, ε½δΊ§ orm, Clickhouse orm, DuckDB orm, TDengine orm, QuestDB orm, MsAccess orm.
access clickhouse codefirst dameng duckdb firebird gbase hacktoberfest highgo kingbase mysql odbc oracle orm postgresql questdb shentongdata sqlite sqlserver xugu
Last synced: 21 Mar 2025
https://github.com/antonycourtney/tad
A desktop application for viewing and analyzing tabular data
csv data-analysis data-science database desktop-application duckdb parquet-viewer pivot-tables pivots tabular-data
Last synced: 22 Mar 2025
https://github.com/bruin-data/ingestr
ingestr is a CLI tool to copy data between any databases with a single command seamlessly.
bigquery copy-database data-ingestion data-integration data-pipeline duckdb ingestion-pipeline mssql postgresql snowflake
Last synced: 13 May 2025
https://github.com/rilldata/rill
Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.
bi business-analytics csv data data-analysis data-visualization dataviz duckdb gcs golang parquet parquet-tools parquet-viewer s3 sql sql-editor svelte sveltejs sveltekit
Last synced: 13 May 2025
https://github.com/pretzelai/pretzelai
The modern replacement for Jupyter Notebooks
analytics artificial-intelligence business-intelligence businessintelligence dashboard data data-analysis data-analytics data-science data-visualization duckdb notebooks open-source prql reporting sql sql-editor sql-editor-online visualization wasm
Last synced: 14 May 2025
https://github.com/Canner/WrenAI
π Open-source SQL AI Agent for Text-to-SQL. Supporting PostgreSQL, DuckDB, MySQL, MS SQL, ClickHouse, Trino, JSON, CSV, Parquet data sources, and more! π
agent ai bigquery duckdb fastapi gpt hacktoberfest llm nextjs nlp openai postgresql python rag sql sqlai text-to-sql text2sql typescript
Last synced: 25 Nov 2024
https://github.com/rilldata/rill-developer
Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.
bi business-analytics csv data data-analysis data-visualization dataviz duckdb gcs golang parquet parquet-tools parquet-viewer s3 sql sql-editor svelte sveltejs sveltekit
Last synced: 08 Mar 2025
https://github.com/moj-analytical-services/splink
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
data-matching data-science deduplicate-data deduplication duckdb em-algorithm entity-resolution fuzzy-matching record-linkage spark uk-gov-data-science
Last synced: 13 May 2025
https://github.com/BemiHQ/BemiDB
Single-binary Postgres read replica optimized for analytics
analytics data-lakehouse data-movement data-warehouse duckdb iceberg olap parquet postgresql replication zero-etl
Last synced: 01 May 2025
https://github.com/scratchdata/scratchdata
Scratch is a swiss army knife for big data.
bigquery clickhouse data-warehouse duckdb hacktoberfest motherduck olap redshift snowflake
Last synced: 16 May 2025
https://github.com/duckdb/dbt-duckdb
dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)
Last synced: 14 May 2025
https://github.com/uwdata/mosaic
An extensible framework for linking databases and interactive views.
data duckdb interaction scalability visualization
Last synced: 14 May 2025
https://github.com/arkflow-rs/arkflow
High-performance Rust stream processing engine, providing powerful data stream processing capabilities, supporting multiple input/output sources and processors.
arkflow datafusion duckdb flow kafka mysql postgresql rust rust-lang sql sqlite stream tokio tokio-rs
Last synced: 16 May 2025
https://github.com/latitude-dev/latitude
Developer-first embedded analytics
analytics business-intelligence dashboard data data-analysis data-analytics data-app data-engineering data-science data-visualization duckdb embedded-analytics exploratory-data-analysis javascript-framework open-source react self-hosted sql svelte tailwindcss
Last synced: 31 Dec 2024
https://github.com/deepseek-ai/smallpond
A lightweight data processing framework built on DuckDB and 3FS.
Last synced: 09 Mar 2025
https://github.com/timestored/qstudio
qStudio - Free SQL Analysis Tool
clickhouse database duckdb duckdb-database gui kdb kdb-q mysql postgresql questdb sql timescaledb
Last synced: 08 May 2025
https://github.com/canner/vulcan-sql
Data API Framework for AI Agents and Data Apps
ai ai-agent analytics api-builder bigquery clickhouse data-lake data-warehouse database duckdb ksqldb postgresql reporting restful-api snowflake spreadsheet sql typescript vulcan-sql vulcansql
Last synced: 15 May 2025
https://github.com/Canner/vulcan-sql
Data API Framework for AI Agents and Data Apps
ai ai-agent analytics api-builder bigquery clickhouse data-lake data-warehouse database duckdb ksqldb postgresql reporting restful-api snowflake spreadsheet sql typescript vulcan-sql vulcansql
Last synced: 11 Apr 2025
https://github.com/ploomber/jupysql
Better SQL in Jupyter. π
bigquery clickhouse data-engineering data-science duckdb hive jupyter mysql polars postgres presto python redshift snowflake spark-sql sql sqlite trino tsql
Last synced: 23 Jan 2025
https://github.com/litestar-org/advanced-alchemy
A carefully crafted, thoroughly tested, optimized companion library for SQLAlchemy
alembic cockroachdb duckdb fastapi flask litestar litestar-org mssql mysql oracle-db postgresql repository-pattern sanic spanner sqlalchemy sqlite starlette
Last synced: 14 May 2025
https://github.com/giorgi/duckdb.net
Bindings and ADO.NET Provider for DuckDB
ado-net data-science duckdb duckdb-database hacktoberfest
Last synced: 14 May 2025
https://github.com/paradedb/pg_analytics
DuckDB-powered data lake analytics from Postgres
analytics arrow big-data columnar database datafusion datalake deltalake duckdb iceberg lakehouse lakehouse-platform object-storage olap paradedb parquet postgres postgresql realtime-analytics sql
Last synced: 24 Mar 2025
https://github.com/incentius-foss/WhatTheDuck
WhatTheDuck is an open-source web application built on DuckDB. It allows users to upload CSV files, store them in tables, and perform SQL queries on the data.
Last synced: 31 Dec 2024
https://github.com/Giorgi/DuckDB.NET
Bindings and ADO.NET Provider for DuckDB
ado-net data-science duckdb duckdb-database hacktoberfest
Last synced: 24 Mar 2025
https://github.com/ark-flow/arkflow
High-performance Rust stream processing engine, providing powerful data stream processing capabilities, supporting multiple input/output sources and processors.
datafusion duckdb flow kafka mysql postgresql rust rust-lang sql sqlite stream tokio tokio-rs
Last synced: 03 Apr 2025
https://github.com/ekzhang/inline-sql
πͺ Inline SQL in any Python program
dsl duckdb olap python query-language sql
Last synced: 13 Apr 2025
https://github.com/apecloud/myduckserver
Unified MySQL, Postgres & FlightSQL Server, Powered by DuckDB.
analytics arrow business-analytics business-intelligence columnar-storage data-engineering data-science database duckdb htap mariadb mysql olap pandas parquet polars postgres replication sql zero-etl
Last synced: 15 May 2025
https://github.com/mause/duckdb_engine
SQLAlchemy driver for DuckDB
duckdb duckdb-engine python sql sqlalchemy
Last synced: 14 May 2025
https://github.com/Mause/duckdb_engine
SQLAlchemy driver for DuckDB
duckdb duckdb-engine python sql sqlalchemy
Last synced: 24 Mar 2025
https://github.com/turbot/powerpipe
Powerpipe: Dashboards for DevOps. Visualize cloud configurations. Assess security posture against a massive library of benchmarks. Build custom dashboards with code.
aws azure cis cloud cnapp cspm dashboards devops devsecops duckdb gcp hacktoberfest hcl kubernetes mysql postgresql security sql sqlite terraform
Last synced: 16 May 2025
https://github.com/rpbouman/huey
Light-weight, browser-based ROLAP pivot tables on top of DuckDB-WASM
data duckdb excel pivot-tables rolap small-data sql
Last synced: 24 Mar 2025
https://github.com/duckdb/duckdb-rs
Ergonomic bindings to duckdb for Rust
arrow database duckdb ffi ffi-bindings olap rust
Last synced: 07 Apr 2025
https://github.com/alitrack/duckdb_fdw
DuckDB Foreign Data Wrapper for PostgreSQL
duckdb duckdb-fdw fdw foreign-data-wrapper postgresql
Last synced: 24 Mar 2025
https://github.com/l1xnan/duckling
A fast viewer for CSV/Parquet files and databases such as DuckDB, SQLite, PostgreSQL, MySQL, Clickhouse, etc., base on Tauri
clickhouse duckdb mysql parquet postgresql rust sqlite tauri
Last synced: 16 May 2025
https://github.com/medama-io/medama
Self-hostable, privacy-focused website analytics.
analytics cookie-free dashboard duckdb go google-analytics medama metrics privacy self-hosted web-analytics website-analytics
Last synced: 13 Dec 2024
https://github.com/sutoiku/puffin
Serverless HTAP cloud data platform powered by Arrow Γ DuckDB Γ Iceberg
arrow duckdb iceberg serverless
Last synced: 27 Apr 2025
https://github.com/nshiab/simple-data-analysis
Easy-to-use and high-performance JavaScript library for data analysis. Works with tabular and geospatial data.
analysis bun data data-analysis data-science duckdb geospatial javascript node node-js nodejs spatial spatial-analysis sql typescript
Last synced: 10 Apr 2025
https://github.com/kraina-ai/quackosm
QuackOSM: an open-source Python and CLI tool for reading OpenStreetMap PBF files using DuckDB
duckdb geo geospatial openstreetmap osm pbf python
Last synced: 15 May 2025
https://github.com/evidence-dev/duckdb_gsheets
DuckDB extension to read and write Google Sheets using SQL
duckdb duckdb-extension google-sheets
Last synced: 23 Apr 2025
https://github.com/theseus-rs/rsql
Command line SQL interface for relational databases and common data file formats
cockroachdb command-line csv data database duckdb json mariadb mysql parquet postgres postgresql redshift snowflake sql sqlite sqlite3 sqlserver
Last synced: 16 May 2025
https://github.com/duckdb/duckdb-sqlite
DuckDB extension to read and write to SQLite databases
Last synced: 12 Apr 2025
https://github.com/duckdb/sqlite_scanner
DuckDB extension to read and write to SQLite databases
Last synced: 13 Dec 2024
https://github.com/metrico/quackpipe
QuackPipe is an OLAP API built on top of DuckDB with ClickHouse compatibility bits
api clickhouse clickhouse-server csv database duckdb duckdb-api duckdb-engine gigapipe golang lambda lambda-functions olap parquet qryn rest-api s3 server sql
Last synced: 04 Apr 2025
https://github.com/cnstlungu/portable-data-stack-dagster
A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset
business-intelligence dagster data-engineering data-visualization dbt duckdb python superset
Last synced: 01 May 2025
https://github.com/isaacbrodsky/h3-duckdb
Bindings for H3 to DuckDB
duckdb geospatial h3 hexagon spatial-indexing
Last synced: 10 Apr 2025
https://github.com/quackscience/duckdb-extension-httpserver
DuckDB HTTP API Server and Query Interface in a Community Extension
api api-server community-extension duckdb extension http-server olap user-interface
Last synced: 05 Apr 2025
https://github.com/dataflareapp/dataflare
Simple, easy-to-use database manager
clickhouse cloudflare-d1 cockroachdb database database-manager databend duckdb libsql mariadb mysql postgresql questdb rqlite sqlcipher sqlite sqlserver
Last synced: 05 Apr 2025
https://github.com/DataflareApp/Dataflare
Simple, easy-to-use database manager
clickhouse cloudflare-d1 cockroachdb database database-manager databend duckdb libsql mariadb mysql postgresql questdb rqlite sqlcipher sqlite sqlserver
Last synced: 07 Mar 2025
https://github.com/buremba/universql
The bridge to effortless multi-engine data applications, currently supports Snowflake βοΈ and DuckDB π¦
databricks dbt duckdb proxy-server snowflake sql sql-proxy sqlglot
Last synced: 12 Apr 2025
https://github.com/valmi-io/valmi-activation
β‘ valmi.io reverse ETL (data activation) is the open source ( OSS ) data activation platform to load data from warehouses into Webhooks and SaaS tools like Klaviyo, Facebook Ads, Salesforce, Braze etc. Valmi.io Customer Data Platform (CDP) helps track and ingest user activity events from websites, shopify, serverside events. https://cloud.valmi.io
airbyte cdp composable-cdp dagster dbt duckdb ecommerce email-marketing etl event-ingestion event-tracking marketing-automation modern-data-stack open-source push-notifications reverse-etl shopify shopify-app shopify-events user-behavior
Last synced: 01 May 2025
https://github.com/mattf96s/quackdb
Open-source in-browser DuckDB SQL editor
apache-arrow comlink duckdb duckdb-wasm parquet remix remix-run shadcn sql sst
Last synced: 16 Apr 2025
https://github.com/dbt-labs/jaffle_shop_duckdb
Get started with dbt in less than 1 minute from `git clone` to `dbt docs serve` for free!
Last synced: 07 Apr 2025
https://github.com/patricktrainer/duckdb-embedding-search
Fast similarity search using DuckDB
cosine-similarity duckdb embeddings gpt-4 llm openai rag search-engine semantic-search similarity-search vector vector-database vector-search
Last synced: 06 Apr 2025
https://github.com/clickhouse/jsonbench
JSONBench: a Benchmark For Data Analytics On JSON
analytics benchmark clickhouse database duckdb elasticsearch json mongodb postgresql sql
Last synced: 06 Apr 2025
https://github.com/tobilg/duckerd
CLI to create an ER Diagram from DuckDB database files
cli database-diagram duckdb erd erdiagram
Last synced: 06 Apr 2025
https://github.com/BauplanLabs/quack-reduce
A playground for running duckdb as a stateless query engine over a data lake
duckdb lambda serverless sql streamlit-dashboard
Last synced: 24 Mar 2025
https://github.com/rayokota/kwack
In-Memory Analytics for Kafka using DuckDB
Last synced: 10 Apr 2025
https://github.com/maxcountryman/warc-parquet
ποΈ A simple CLI for converting WARC to Parquet.
crawling duckdb parquet warc web-archiving
Last synced: 16 May 2025
https://github.com/evanwashere/duckdb
π¦ fast duckdb bindings for bun runtime
Last synced: 12 Apr 2025
https://github.com/igor-suhorukov/openstreetmap_h3
OSM planet dump high performance data loader. Transform OpenStreetMap World/Region PBF dump into partitioned by H3 regions PostGIS pgsnapshot (lossless) OSM schema representation and/or into ArrowIPC/Parquet dumps
apach-sedona apache-arrow apache-spark arrow citusdb column-store converter duckdb geometry-processing geospatial java openstreetmap parquet parquet-files pbf pbf-format postgis postgresql world
Last synced: 07 Apr 2025
https://github.com/karlseguin/zuckdb.zig
A DuckDB driver for Zig
duckdb duckdb-driver zig zig-library zig-package
Last synced: 06 Apr 2025
https://github.com/ktanaka101/mcp-server-duckdb
A Model Context Protocol (MCP) server implementation for DuckDB, providing database interaction capabilities
Last synced: 09 Apr 2025
https://github.com/caioricciuti/duck-ui
Duck-UI is a web-based interface for interacting with DuckDB, a high-performance analytical database system. It features a SQL editor, data import/export, data explorer, query history, theme toggle, and keyboard shortcuts, all running seamlessly in the browser using DuckDB's WebAssembly (WASM) capabilities.
data-science data-visualization dataanalysis datanalytics duckdb local
Last synced: 04 Apr 2025
https://github.com/ClickHouse/JSONBench
JSONBench: a Benchmark For Data Analytics On JSON
analytics benchmark clickhouse database duckdb elasticsearch json mongodb postgresql sql
Last synced: 07 Feb 2025
https://github.com/rupurt/odbc-scanner-duckdb-extension
A DuckDB extension to read data directly from databases supporting the ODBC interface
analytics bigquery columnar-database cpp data-engineering db2 duckdb mariadb mssql mysql nix odbc olap oracle postgres snowflake vector-engine
Last synced: 15 Feb 2025
https://github.com/elsbrock/hetzner-radar
π΅οΈ Track prices of the Hetzner dedicated server auction
cloudflare cloudflare-d1 cloudflare-workers duckdb hetzner sqlite sveltekit tailwind wasm
Last synced: 11 Apr 2025
https://github.com/josephmachado/python_essentials_for_data_engineers
Code for blog at https://www.startdataengineering.com/post/python-for-de/
data-engineering data-quality-checks duckdb polars python transformations
Last synced: 15 Apr 2025
https://github.com/marcua/ayb
ayb makes it easy to create databases, share them with collaborators, and query them from a web application or the command line
Last synced: 04 Apr 2025
https://github.com/rustyconover/duckdb-shellfs-extension
DuckDB extension allowing shell commands to be used for input and output.
duckdb duckdb-extension popen shell
Last synced: 12 May 2025
https://github.com/techascent/tmducken
tech.ml.dataset integration with duckdb
Last synced: 16 Mar 2025
https://github.com/quackscience/quackflight
DuckDB API Server with Arrow Flight SQL Airport support and concurrent writes/reads (quackpipe)
airport api arrow clickhouse duckdb duckdb-server flight flight-sql fuck-trump server
Last synced: 15 Apr 2025
https://github.com/tjanif/airflow-elt-blueprint
A self-contained, ready to run Airflow ELT project. Can be run locally or within codespaces.
airflow astro-cli astro-python-sdk astronomer codespaces duckdb minio streamlit
Last synced: 15 May 2025
https://github.com/gwenwindflower/octocatalog
Nicely modeled data built on the Github Archive.
dbt duckdb github-archive public-datasets
Last synced: 01 May 2025
https://github.com/quackscience/duckdb-extension-httpclient
DuckDB HTTP GET/POST Client in a Community Extension
duckdb duckdb-extension extension http-client
Last synced: 07 May 2025
https://github.com/mattf96s/QuackDB
Open-source in-browser DuckDB SQL editor
apache-arrow comlink duckdb duckdb-wasm parquet remix remix-run shadcn sql sst
Last synced: 14 May 2025
https://github.com/datawan-labs/schools
2nd Place Solid Hack 2024. Visualizing School Distribution and Population Density in Indonesia, By mapping where people live alongside school locations, this study shows how spatial data can reveal patterns of educational access across the country.
duckdb map mapbox maplibre population-density solidjs visualization
Last synced: 19 Apr 2025
https://github.com/cldellow/datasette-parquet
Add DuckDB, Parquet, CSV and JSON lines support to Datasette
datasette datasette-plugin duckdb parquet
Last synced: 13 Apr 2025
https://github.com/josephmachado/cost_effective_data_pipelines
Cost Efficient Data Pipelines with DuckDB
Last synced: 15 Apr 2025
https://github.com/tsffarias/liftoff_data
Arquitetura CRM de Baixo Custo com Gen AI, projetada para startups que precisam processar e analisar dados de vendas de forma eficiente.
ai airflow api briefer crm dbt docker duckdb etl faker fastapi minio n8n pgduckdb postgresql pydantic python sqlalchemy startup streamlit
Last synced: 02 Feb 2025
https://github.com/squashql/squashql
Official repository of SquashQL, the SQL query engine for multi-dimensional and hierarchical analysis that empowers your SQL database
bigquery clickhouse database duckdb java jdbc query querybuilder snowflake spark sql typescript
Last synced: 14 Dec 2024
https://github.com/starlake-ai/jsqltranspiler
Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data types and format characters) using Java.
abstract-syntax-tree bigquery column databricks duckdb java lineage query redshift resolver rewrite snowflake transpiler
Last synced: 16 May 2025
https://github.com/quackscience/duckdb-extension-openprompt
DuckDB Community Extension to prompt LLMs from SQL
duckdb duckdb-database extension llm llms ollama prompt sql
Last synced: 05 May 2025
https://github.com/dfinke/psduckdb
PSDuckDB is a PowerShell module that provides seamless integration with DuckDB, enabling efficient execution of analytical SQL queries directly from the PowerShell environment.
data-analysis data-science duckdb powershell sql
Last synced: 16 Mar 2025
https://github.com/dfinke/PSDuckDB
PSDuckDB is a PowerShell module that provides seamless integration with DuckDB, enabling efficient execution of analytical SQL queries directly from the PowerShell environment.
data-analysis data-science duckdb powershell sql
Last synced: 16 Dec 2024
https://github.com/quackscience/duckdb-extension-clickhouse-sql
DuckDB Community Extension implementing ClickHouse SQL Dialect macros and Custom functions for DuckDB
chdb chsql clickhouse clickhouse-database clickhouse-server duckdb duckdb-community duckdb-extension duckdb-server quackpipe sql
Last synced: 19 Dec 2024
https://github.com/rustyconover/duckdb-lindel-extension
DuckDB Extension Linearization/Delinearization, Z-Order, Hilbert and Morton Curves
duckdb duckdb-extension hilbert hilbert-curve hilbert-transform linearization morton morton-order space-filling-curves zorder-curves
Last synced: 26 Mar 2025
https://github.com/uwdata/mosaic-framework-example
Using Mosaic and DuckDB within Observable Framework
dashboard duckdb mosaic publishing visualization
Last synced: 08 Apr 2025
https://github.com/randomfractals/chicago-crimes
Exploring Chicago crimes dataset with Jupyter notebooks, DuckDB, Malloy and new Panel/PyScript data and dashboard tools.
chicago crimes duckdb julia jupyter-notebooks large-csv malloy malloydata parquet polars pyarrow
Last synced: 22 Mar 2025
https://github.com/jakthom/hercules
A DuckDB-powered Prometheus exporter
duckdb metrics monitoring observability observability-data prometheus prometheus-exporter prometheus-metrics prometheus-utility
Last synced: 15 Apr 2025