awesome-duckdb
🦆 A curated list of awesome DuckDB resources
https://github.com/davidgasquez/awesome-duckdb
Last synced: about 7 hours ago
JSON representation
-
Resources
- DuckDB setup - GitHub Action to install DuckDB in CI.
- DuckDB snippets - Collection of snippets curated by MotherDuck.
- DuckDB tldr page - DuckDB's entry in [tldr pages](https://tldr.sh/), available in CLI via the `tldr duckdb` command.
- Observable notebooks - Notebooks using DuckDB on the Observable data visualization platform.
- Official documentation - Official DuckDB documentation.
- Serverless DuckDB over S3 - Running DuckDB over a data lake on S3 using lambda.
- docker-duckdb - Docker image for DuckDB CLI.
- DuckDB AWS Lambda layer - Run DuckDB in AWS Lambda functions.
- Serverless DuckDB as API - Use DuckDB as API with Amazon API Gateway and AWS Lambda.
- Serverless Parquet Repartitioner - Use DuckDB to repartition data in S3-based Data Lakes.
- duckdb-nf - Example uses of DuckDB with Nextflow.
- DuckLake - A lakehouse format from the team behind DuckDB.
- DuckERD CLI
- DuckDB clients - Client APIs for DuckDB.
- Official blog - Official DuckDB blog.
- DuckDB RSS feed - Feed for the official DuckDB blog.
- DuckDB clients - Client APIs for DuckDB.
- DuckDB documentation PDF - The DuckDB documentation as a single PDF file.
- Compatible DuckDB extensions for AWS Lambda - Extensions specifically compiled for the AWS Lambda runtime (GLIBC 2.26).
- duckdb-docker - Official Docker image for the DuckDB CLI.
- SQLNotebook - SQL notebooks by TimerStored powered by DuckDB.
- Awesome DuckLake - A curated list of awesome DuckLake tools and resources.
- DuckDB in Science - A collection of scientific papers building on DuckDB.
- DuckDB documentation MD - The DuckDB documentation as a single Markdown file.
- DuckDB execution plan visualizer - Visualizing and understanding DuckDB EXPLAIN plans made easy.
- DuckDB documentation PDF - The DuckDB documentation as a single PDF file.
- DuckDB documentation MD - The DuckDB documentation as a single Markdown file.
- Compatible DuckDB extensions for AWS Lambda - Extensions specifically compiled for the AWS Lambda runtime (GLIBC 2.26).
- Observable notebooks - Notebooks using DuckDB on the Observable data visualization platform.
-
Client APIs
-
Tools Powered by DuckDB
-
- MotherDuck - Serverless data warehouse powered by DuckDB.
- Boiling Data - Serverless data analytics overlay on top of S3 Data Lakes.
- Hex Dataframe SQL - Hex's Dataframe SQL cells are powered by DuckDB.
- Mode - Mode uses DuckDB for their in-memory data engine.
- Honeycomb Maps - A browser-based geospatial analysis tool leveraging DuckDB Wasm.
- VulcanSQL - DuckDB can be used as a caching layer or a data connector in VulcanSQL, a Data API framework for data folks to create REST APIs by writing SQL templates.
- Bauplan - A serverless data transformation platform for data lakes.
- Excalichart.com - A fast, free dashboard for exploring your data.
- Malloy - Malloy is an experimental language for describing data relationships and transformations. Malloy connects to BigQuery, Snowflake, Trino, and Postgres, and natively supports DuckDB.
- Shaper - Open Source, SQL-driven Data Dashboards powered by DuckDB.
- ParadeDB - Postgres for Search and Analytics, powered by DuckDB-embedded-in-Postgres.
- DatalakeStudio - Load, explore, transform your datasets and expose them via API. Integration with external APIs, S3, PostgreSQL and ChatGPT.
- Spice.ai - A unified SQL query interface and portable runtime to locally materialize (using an embedded DuckDB), accelerate, and query datasets from any database, data warehouse, or data lake.
- Quackpipe - Serverless OLAP API/UI built on top of DuckDB with basic ClickHouse API compatibility and MotherDuck support.
- Census - Census's dataset diffing for incremental syncs is powered by DuckDB.
- Crunchy Bridge for Analytics - Fully managed DBaaS based in Postgres integrated with DuckDB.
- UniverSQL - An implementation of Snowflake API, enables running queries on Snowflake tables locally with DuckDB without a running warehouse.
- Honeycomb Maps - A browser-based geospatial analysis tool leveraging DuckDB-Wasm.
- Iceburst - The real-time data lake for monitoring & security.
- SQLGlot - Python transpiler that translates between 24 different SQL dialects including DuckDB.
- Latitude - Latitude uses DuckDB to power data snapshots. Drop a CSV file and query it with SQL at the speed of light.
- Parquet Explorer - Visual Studio Code extension for exploring Parquet files with SQL, powered by DuckDB.
- DQOps - Data quality platform for data engineers, data quality teams and data operations.
- Ibis Project - A DataFrame API for interacting with DuckDB (and other compute engines).
- yato - The smallest DuckDB SQL orchestrator on Earth.
- Whereabouts - Fast, accurate, open-source geocoding in Python, using DuckDB.
- Phoenix Analytics - Plug and play analytics for Phoenix applications, powered by DuckDB.
- Excalichart.com - A fast, free dashboard for exploring your data.
- Iceburst - The real-time data lake for monitoring & security.
- ADPivot - No code tool built on top of DuckDB-Wasm and Pyodide that helps build pivot tables from databases of any size with a few clicks.
- SQLMesh - A next-generation data transformation and modeling framework with support for DuckDB connections for state, transformations & running unit tests locally.
- Huey - Blazing-fast & intuitive pivot tables on Parquet, CSV, JSON files and DuckDB tables in the browser based on DuckDB-Wasm. open-source (MIT). Zero install!
- Kepler.gl - Kepler.gl is a powerful open-source geospatial analysis tool for large-scale data sets, now embeds duckdb wasm to create geospatial layers.
- DuckDB OPFS Todo List App - A fully-functional todo list application that demonstrates DuckDB WASM OPFS (Origin Private File System) persistence using a pure functional programming approach.
- Apple Embedding Atlas - A tool that provides interactive visualizations for large embeddings. Uses DuckDB.
- Definite - Definite pulls all your data into a single place for analytics and dashboards. No engineering or SQL required. Get a managed data warehouse (DuckDB), ELT, data modeling / transformations and BI in a single platform.
- Amphi ETL - Low-code data pipelines for structured and unstructured data. SQL transformations are powered by DuckDB.
- Sloggo - Minimal RFC 5424 syslog collector and viewer based on DuckDB. Runs as a single, resource-friendly process.
- Rill Data - Tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL.
- Duck-UI - Duck-UI is a web-based interface for interacting with DuckDB with a SQL editor, data import/export, data explorer, query history, theme toggle and keyboard shortcuts.
- Tailpipe - An open-source SIEM for instant log insights.
- Latitude - Latitude uses DuckDB to power data snapshots. Drop a CSV file and query it with SQL at the speed of light.
- ETLX - DuckDB-powered ETL tool written in Go, inspired by evidence.dev’s syntax. It uses a structured Markdown config where heading levels define nested blocks, yaml code blocks specify metadata, and sql code blocks handle data interactions. Enables clean, code-light orchestration with minimal setup.
- Datakit - The privacy-first data analysis toolkit.
- Hugr - An data mesh platform and high-performance GraphQL backend powered by DuckDB.
- duckdb.yazi - Preview csv/tsv, json, and Parquet files in the yazi file manager using duckdb. View the raw data, or a "summarized" view with data-types, min, max, avg etc. for all columns.
- Greybeam - Routes your Snowflake queries to a DuckDB powered warehouse to reduce costs and speed up queries.
- Boilstream - Manage with SQL, like for creating topics (tables) and derived topics (materialised views) - all landing on object storage in DuckLake as optimised Parquet files.
- Arc - Time-series data warehouse built on DuckDB.
- Boiling Data - Serverless data analytics overlay on top of S3 Data Lakes.
- DQOps - Data quality platform for data engineers, data quality teams and data operations.
-
Web Clients
- QuackDB - Open-source online DuckDB SQL playground and editor.
- SQL Playground - DuckDB-WASM based SQL playground for running queries on local or remote data, being able to show data as tables or visually as graphs.
- Sidequery - Sidequery is a privacy-preserving DuckDB-powered query editor & data exploration tool for local & remote data.
-
Web Clients (WebAssembly)
- DuckDB UI - A local web-based notebook interface for DuckDB.
- WhatTheDuck - WhatTheDuck is an open-source web application built on DuckDB. It allows users to upload CSV files, store them in tables, and perform SQL queries on the data.
- Online DuckDB Shell - Online DuckDB shell powered by DuckDB-Wasm.
- Sekuel Playground - Query your local Parquet, CSV, JSON. Your data will not be sent out of the device you are using.
- CSVFiddle - Free tool to explore and share insights from CSV files using SQL. Import data, write SQL, then instantly share it with anyone.
- Codapi - Embed executable code snippets directly into your product documentation, online course or blog post.
- PondPilot - Open-source, 100% client-side data exploration tool that enables users to analyze local and remote data using SQL. Zero-copy direct access to local datasets sets PondPilot apart from similar tools. It runs entirely in the browser—no servers, no cloud uploads, and no setup required.
-
-
Integrations
-
Web Clients
- Kestra DuckDB plugin - Run queries with DuckDB to schedule data transformations and process automations, and run event-driven anomaly detection pipelines.
- data load tool - DuckDB destination - Extract and load data from APIs to DuckDB using dlt.
- target-duckdb - Load data to DuckDB based on Singer spec.
- nodbi - NoSQL Database Connector for R, providing a common API across Elasticsearch, CouchDB, MongoDB, SQLite, PostgreSQL, and DuckDB.
-
- Airbyte DuckDB destination - Load data to DuckDB with Airbyte.
-
Web Clients (WebAssembly)
- SQLFlow - Enables SQL-based stream processing, powered by DuckDB.
- nf-sqldb - This plugin provides support for interacting with SQL databases in Nextflow scripts.
- MindsDB - The platform for customizing AI from enterprise data. [MindsDB integrates with DuckDB](https://docs.mindsdb.com/integrations/data-integrations/duckdb), making data from DuckDB accessible to a diverse range of AI/ML models.
- sqlite2duckdb - A CLI tool to convert SQLite database to DuckDB.
- duckplyr - Drop-in replacement for dplyr in R that uses DuckDB for performance.
- kwack - In-Memory Analytics for Kafka using DuckDB.
- xlDuckDb - Excel addin to run DuckDB queries in Excel.
- Hasura DuckDB Connector - Allows connecting to a DuckDB database or a MotherDuck-hosted DuckDB database through a GraphQL API.
- Ontop - Allows to create Virtual Knowledge Graphs directly from DuckDB.
- The Quack is Back: SAS/ACCESS Meets DuckDB - SAS/ACCESS engine support for DuckDB.
- duckdb-teradata - Teradata connector.
- Kotlin DataFrame - Supports reading from DuckDB databases using JDBC.
- Sidemantic - A semantic layer with DuckDB integration.
- dbt-duckdb - DuckDB dbt adapter.
- Airbyte DuckDB destination - Load data to DuckDB with Airbyte.
- duckplyr - Drop-in replacement for dplyr in R that uses DuckDB for performance.
- duckdb-tableau-connector - DuckDB Tableau connector.
- duckdb-power-query-connector - DuckDB Power Query Custom Connector.
- metabase_duckdb_driver - Metabase DuckDB Driver shipped as 3rd party plugin.
-
-
SQL Clients and IDE that Support DuckDB
-
- Harlequin - The DuckDB IDE for your terminal. ([GitHub](https://github.com/tconbeer/harlequin)).
-
Web Clients
- VSCode SQLTools (Free) - Free open-source VSCode extension to query and explore your DuckDB databases with latest DuckDB support.
- jOOQ - Type safe querying of DuckDB (and many other RDBMS) from Java. A [transpiler](https://www.jooq.org/translate/) from and to DuckDB is also available.
- SQL DATA LENS - A lightweight, commercial SQL IDE that supports different DBMS, including DuckDB. The focus on performance and special DBMS features.
- qStudio - A free SQL tool specialized for data analysts. It runs on every operating system and allows easy browsing of tables and charting of results.
- DuckDB SQL Tools - Free DuckDB SQL Tools for VS Code IDE. [Premium version available](https://github.com/RandomFractals/pro-data-tools/blob/main/duckdb-tools.md#duckdb-pro-tools) with advanced features.
- DataGrip - Paid SQL IDE by Jetbrains that supports many different database technologies, including DuckDB.
- manifold-sql (DuckDB for Java) - Use native DuckDB SQL of any complexity directly & type-safely in Java source with comprehensive IntelliJ support.
-
Web Clients (WebAssembly)
- Duckling - A fast viewer for CSV/Parquet files and DuckDB/SQLite, based on Tauri.
- rsql - CLI for DuckDB, LibSQL, MariaDB, MySQL, PostgreSQL, SQLite3 and SQL Server.
- rainfrog - A database TUI with experimental support for DuckDB.
- DBeaver - DBeaver is a universal database access and development tool that can be used to connect almost any type of database.
-
-
Extensions
-
Web Clients
- `postgres_scanner` - Allows DuckDB to directly read data from a running Postgres instance.
- `spatial` - Enables geospatial processing.
- `sqlite_scanner` - To read and write to SQLite databases.
- ERPL - DuckDB SAP connector using RFC, ODP, or BICS.
- Official Extensions - Official DuckDB extensions.
- `fts` - To support full text search.
- Official Extensions - Official DuckDB extensions.
- `sqlite` - To read from and write to SQLite databases.
- `mysql` - To read from and write to MySQL databases.
- `postgres` - To read from and write to PostgreSQL databases.
- `vss` - Add support for vector similarity search.
- KĂązu - Scan DuckDB tables in KĂązu, an embeddable property graph database management system.
-
- Lance - Integrate Lance (modern columnar data format for ML implemented in Rust) with DuckDB.
-
[Core Extensions](https://duckdb.org/docs/stable/core_extensions/overview)
- `arrow` - A zero-copy data integration between Apache Arrow and DuckDB.
- `avro` - For reading Avro files.
- `ducklake` - For DuckLake support.
- `aws` - For handling AWS credentials.
- `azure` - For using the Azure Blob storage.
- `delta` - For Delta Lake support.
- `iceberg` - For reading Iceberg tables.
- `inet` - For storing and handling IPv4 and IPv6 Internet addresses.
-
[Community Extensions](https://duckdb.org/community_extensions/)
- `bigquery` - Integrates DuckDB with Google BigQuery, allowing direct querying and management of BigQuery datasets.
- `chsql` - ClickHouse SQL Dialect macros for DuckDB.
- `duckherder` - Distributed execution for DuckDB queries.
- `duckpgq` - Add supports for SQL/PGQ (Property Graph Queries) introduced in the SQL:2023 standard.
- `evalexpr_rhai` - Evaluates the [Rhai](https://rhai.rs) scripting language as part of SQL.
- `fuzzycomplete` - Performs fuzzy string matching for autocompletion.
- `gsheets` - Read and write Google Sheets using SQL.
- `h3` - Adds support for the H3 discrete global grid system.
- `httpserver` - DuckDB HTTP API Server and Query Interface.
- `lindel` - Linearization/Delinearization, Z-Order, Hilbert and Morton Curves.
- `netquack` - Parsing, extracting, and analyzing domains, URIs, and paths with ease.
- `prql` - Run PRQL commands directly within DuckDB.
- `scrooge` - A set of aggregation functions and data scanners on financial data.
- `shellfs` - Allows shell commands to be used for input and output.
- `ulid` - ULID data type for DuckDB. A ULID is similar to a UUID except that it also contains a timestamp component.
- `cache_httpfs` - Adds a read caching layer to duckdb filesystem to improve query performance and reduce egress cost.
- `3fs` - Integrates DuckDB with DeepSeek 3FS distributed file system.
- `chsql` - ClickHouse SQL Dialect macros for DuckDB.
- `crypto` - Cryptographic hash functions and HMAC.
-
Other Extensions
- duckdb_engine - SQLAlchemy driver for DuckDB.
- duckdb-extension-template-zig - A Zig & Nix toolkit template for building extensions against multiple versions of DuckDB using Zig, C or C++.
- duckdb-jfr-extension - DuckDB extension to read JFR (Java Flight Recorder) files directly.
- duckdb_protobuf - Plugin for querying encoded protobuf messages (both sequences and individual messages per file).
- duckdb-pytables - DuckDB extension to allow running SQL on arbitrary data sources.
- Lance - Integrate Lance (modern columnar data format for ML implemented in Rust) with DuckDB.
- ODBC Scanner DuckDB Extension - DuckDB extension to read data directly from databases supporting the ODBC interface.
- QDuckDB - Plugin for reading DuckDB spatial tables in QGIS software.
- UC Catalog Extension - Proof-of-concept extension combining the `delta` extension with Unity Catalog.
- FlockMTL - Integrate language model (LLM) capabilities directly into your queries and workflows.
- ERPL Web - ERPL Web is a DuckDB extension that connects API-based ecosystems via standard interfaces like OData, GraphQL, and REST.
- quack-zig - The infamous DuckDB quack extension rewritten in C and built with Zig. Proof that you can develop DuckDB extensions without drowning in boilerplate.
- ERPL - DuckDB SAP connector using RFC, ODP, or BICS.
- Lance - Integrate Lance (modern columnar data format for ML implemented in Rust) with DuckDB.
- FlockMTL - Integrate language model (LLM) capabilities directly into your queries and workflows.
-
Extension Statistics
- DuckDB Extension Radar - Repository that contains DuckDB extensions on GitHub. Refreshed daily.
- DuckDB extension weekly downloads - Statistics of weekly downloads for core extensions and community extensions. Refreshed daily.
-
-
Media
-
Talks
- In-Process Analytical Data Management with DuckDB @ PyData Amsterdam - Hannes MĂĽhleisen.
- DuckDB: Bringing analytical SQL directly to your Python shell @ EuroPython - Pedro Holanda.
- DuckCon #3 playlist
- DuckDB: Bringing Analytical SQL Directly To Your Python Shell @ FOSDEM - Pedro Holanda.
- State Of The Duck @ DuckCon - Hannes MĂĽhleisen & Mark Raasveldt.
- DuckDB Extensions @ DuckCon - Pedro Holanda & Sam Ansmink.
- Developing Systems in Academia: The Good, the Bad, and the not-so-Ugly Duckling @ CIDR - Hannes MĂĽhleisen.
- DuckDB An Embeddable Analytical Database @ FOSDEM - Hannes MĂĽhleisen.
- DuckDB tutorials playlist by Learn Data with Mark - Mark Needham.
- DuckDB tutorials playlist by MotherDuck - Mehdi Ouazza (MotherDuck).
- Nextflow and database uses: powering data engineering, exploring DuckDB, and beyond - Edmund Miller.
- State of the Duck @ DuckCon #4 - Hannes MĂĽhleisen and Mark Raasveldt.
- State of the Duck @ DuckCon #3 - Hannes MĂĽhleisen and Mark Raasveldt.
- State of the Duck @ DuckCon #2 - Hannes MĂĽhleisen & Mark Raasveldt.
- DuckCon #4 playlist
- DuckDB: Bringing Analytical SQL Directly To Your Python Shell @ FOSDEM - Pedro Holanda.
- Why should you care about DuckDB? @ Dublin DuckDB meetup - Mihai Bojin.
- Exploring Monte Carlo Simulations With DuckDB @ Dublin DuckDB meetup - James McNeill.
- DuckDB Extensions @ DuckCon - Pedro Holanda & Sam Ansmink.
- Developing Systems in Academia: The Good, the Bad, and the not-so-Ugly Duckling @ CIDR - Hannes MĂĽhleisen.
- DuckDB An Embeddable Analytical Database @ FOSDEM - Hannes MĂĽhleisen.
- DuckDB and recommenders : a lightning fast synergy @ Dublin DuckDB meetup - Khalil Muhammad.
- DuckDB: The Power of a Data Warehouse in your Python Process @ PyData Yerevan - Gábor Szárnyas.
- DuckDB keynote @ Data + AI Summit 2023 - Hannes MĂĽhleisen.
- DuckLake - The SQL-Powered Lakehouse Format for the Rest of Us - Hannes MĂĽhleisen.
- Introducing DuckLake - Hannes MĂĽhleisen and Mark Raasveldt.
- DuckCon #6 playlist
- DuckDB: Crunching data anywhere from laptops to servers @ GOTO Amsterdam 2024 - Gábor Szárnyas.
- DuckDB – Overview and latest developments @ DuckCon #5 - Hannes Mühleisen and Mark Raasveldt.
- DuckCon #5 playlist
- DuckCon #4 playlist
- DuckCon #3 playlist
- DuckDB tutorials playlist by MotherDuck - Mehdi Ouazza.
- Nextflow and database uses: powering data engineering, exploring DuckDB, and beyond - Edmund Miller.
- DuckCon #6 playlist
- DuckCon #5 playlist
- DuckCon #4 playlist
- DuckCon #3 playlist
- DuckDB tutorials playlist by Learn Data with Mark - Mark Needham.
- DuckDB tutorials playlist by MotherDuck - Mehdi Ouazza.
- Nextflow and database uses: powering data engineering, exploring DuckDB, and beyond - Edmund Miller.
-
Podcasts
- The Personal Data Warehouse - Jordan Tigani.
- Software Engineering Daily: DuckDB - Hannes MĂĽhleisen.
- Move Your Database To The Data And Speed Up Your Analytics With DuckDB - Hannes MĂĽhleisen.
- Developer Voices: Implementing Hardware-Friendly Databases - Hannes MĂĽhleisen.
- The Geek Narrator: DuckDB Internals - Mark Raasveldt.
- The Analytics Engineering Podcast: The Personal Data Warehouse - Jordan Tigani.
- The Analytics Engineering Podcast: The Personal Data Warehouse - Jordan Tigani.
- The Analytics Engineering Podcast: The Personal Data Warehouse - Jordan Tigani.
- DuckDB in Research - Series in Disseminate, the Computer Science Research Podcast, with host Jack Waudby.
- Data Engineering Podcast: Move Your Database To The Data And Speed Up Your Analytics With DuckDB - Hannes MĂĽhleisen.
-
Blog Posts
- Modern Data Stack in a Box - Fast, free, and open-source Modern Data Stack deployed on a laptop using the combination of DuckDB, Meltano, dbt, and Apache Superset.
- Exploring StarCraft 2 data with Airflow, DuckDB and Streamlit - Example project using DuckDB to persist API data, but also explains how to use DuckDB as a versatile data manipulation tool in data wrangling scripts.
- How to use DuckDB, Motherduck and Kestra for ETL - How DuckDB can transform data, mask sensitive PII information, detect anomalies in event-driven workflows, and streamline reporting use cases.
- DuckDB vs. MotherDuck — how do they compare - What are key differences between them, and when to choose each of these options.
- Building DuckDB Extensions with Zig and Nix - For Nix users and Zig developers familiar with DuckDB looking to extend it’s capabilities with custom extensions.
- Exploring StarCraft 2 data with Airflow, DuckDB and Streamlit - Example project using DuckDB to persist API data, but also explains how to use DuckDB as a versatile data manipulation tool in data wrangling scripts.
- Exploring StarCraft 2 data with Airflow, DuckDB and Streamlit - Example project using DuckDB to persist API data, but also explains how to use DuckDB as a versatile data manipulation tool in data wrangling scripts.
- Exploring StarCraft 2 data with Airflow, DuckDB and Streamlit - Example project using DuckDB to persist API data, but also explains how to use DuckDB as a versatile data manipulation tool in data wrangling scripts.
- Exploring StarCraft 2 data with Airflow, DuckDB and Streamlit - Example project using DuckDB to persist API data, but also explains how to use DuckDB as a versatile data manipulation tool in data wrangling scripts.
- Putting DuckDB in Postgres to Query Iceberg - How ParadeDB embedded DuckDB in Postgres to achieve fast analytics and Apache Iceberg compatibility from Postgres.
- Exploring StarCraft 2 data with Airflow, DuckDB and Streamlit - Example project using DuckDB to persist API data, but also explains how to use DuckDB as a versatile data manipulation tool in data wrangling scripts.
- DuckDB: The Rising Star in the Big Data Landscape
- Exploring StarCraft 2 data with Airflow, DuckDB and Streamlit - Example project using DuckDB to persist API data, but also explains how to use DuckDB as a versatile data manipulation tool in data wrangling scripts.
- Exploring StarCraft 2 data with Airflow, DuckDB and Streamlit - Example project using DuckDB to persist API data, but also explains how to use DuckDB as a versatile data manipulation tool in data wrangling scripts.
- Exploring StarCraft 2 data with Airflow, DuckDB and Streamlit - Example project using DuckDB to persist API data, but also explains how to use DuckDB as a versatile data manipulation tool in data wrangling scripts.
- How to Make a DuckDB Extension for a Table Function? - How to make a DuckDB extension to fetch data from external sources.
- Exploring StarCraft 2 data with Airflow, DuckDB and Streamlit - Example project using DuckDB to persist API data, but also explains how to use DuckDB as a versatile data manipulation tool in data wrangling scripts.
- Exploring StarCraft 2 data with Airflow, DuckDB and Streamlit - Example project using DuckDB to persist API data, but also explains how to use DuckDB as a versatile data manipulation tool in data wrangling scripts.
- Exploring StarCraft 2 data with Airflow, DuckDB and Streamlit - Example project using DuckDB to persist API data, but also explains how to use DuckDB as a versatile data manipulation tool in data wrangling scripts.
- Exploring StarCraft 2 data with Airflow, DuckDB and Streamlit - Example project using DuckDB to persist API data, but also explains how to use DuckDB as a versatile data manipulation tool in data wrangling scripts.
- Duck Takes Flight: Streaming Data in DuckDB - Adding concurrent read/write to DuckDB with Arrow Flight.
- Exploring StarCraft 2 data with Airflow, DuckDB and Streamlit - Example project using DuckDB to persist API data, but also explains how to use DuckDB as a versatile data manipulation tool in data wrangling scripts.
- DuckDB Doesn't Need Data To Be a Database - How DuckDB can provide a view over data stored in S3.
- Securing DuckDB, Improving Startup Time, and Working Offline - How to set up DuckDB and how to work with extensions in an offline (and potentially sensitive) environment.
- Datalore 2025.2 Is Out With Report Tabs, Extended DuckDB Support, and Snowflake Key Pair Authentication - Improved DuckDB support in JetBrains' Datalore collaborative data science platform
- Using DuckDB WASM + Cloudflare R2 to host and query big data (for almost free) - Demo of using Cloudflare R2 hosting and a WASM DuckDB application to store and query data
- Building DuckDB Extensions with Zig and Nix - For Nix users and Zig developers familiar with DuckDB looking to extend its capabilities with custom extensions.
- Exploring StarCraft 2 data with Airflow, DuckDB and Streamlit - Example project using DuckDB to persist API data, but also explains how to use DuckDB as a versatile data manipulation tool in data wrangling scripts.
-
Books
- DuckDB in Action - DuckDB in Action will show you how to quickly get your hands dirty with DuckDB.
- Getting Started with DuckDB - A practical guide for accelerating your data science, data analytics, and data engineering workflows.
-
-
Installers
- DuckDB version manager (`duckman`) - platform installer and version manager for DuckDB.
- Snap package - Snap package of DuckDB, e.g., for Ubuntu Linux.
- Chocolatey packages - Chocolatey package for Windows.
- Official installation page
- Brew formula
-
Libraries Powered by DuckDB
-
- Mosaic - An extensible framework for linking databases and interactive views.
- Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
- Simple-data-analysis - Easy-to-use and high-performance JavaScript library for data analysis.
- duckdb_fdw - DuckDB Foreign Data Wrapper for PostgreSQL.
- @jetblack/duckdb-react - A context manager for React and DuckDB-Wasm.
- QuackOSM - A Python library for downloading and transforming raw OpenStreetMap data into GeoParquet files.
- flapi - An API Framework that heavily relies on the power of DuckDB and DuckDB extensions. Ready to build performant and cost-efficient APIs on top of BigQuery or Snowflake for AI Agents and Data Apps.
- smallpond - A distributed data processing framework by DeepSeek built on DuckDB and 3FS.
- BemiDB - PostgreSQL read replica optimized for analytics, using DuckDB.
- jsqltranspiler - Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB-compatible SQL.
- SQLRooms - An open-source react framework for single-node data analytics powered by DuckDB.
- Omilayers - A Python library for efficient data management that wraps the APIs of SQLite and DuckDB and offers a high-level interface analytical tasks that involve fast storage, processing and retrieval of data.
- Unleasharp.DB.DuckDB - Lightweight DuckDB query-building client for C#.
- Splink - A free Python library for fast, accurate data deduplication and record linkage.
-
Web Clients
- pg_lakehouse - PostgreSQL extension embedding DuckDB-in-Postgres for fast on-disk and remote object storage analytics from Postgres. Built as a Foreign Data Wrapper with full query pushdown to DuckDB.
- PyGWalker - A Pyhon library that turns your dataframe into an interactive UI for data visualization.
-
-
Projects Powered by DuckDB
-
Web Clients
- `endoflife.date` database - Daily dumps of endoflife.date data.
-
Web Clients (WebAssembly)
- NBA Monte Carlo - Monte Carlo simulation of the NBA season, leveraging Meltano, dbt, DuckDB and Evidence.
- `transfermarkt-datasets` - Curated football datasets from [Transfermarkt](https://www.transfermarkt.co.uk/).
- duckdb-embedding-search - A search engine for DuckDB that uses embedding vectors to find similar documents.
- DuckDB PyPI stats live dashboard - Live dashboard of PyPI downloads using DuckDB, dbt, Evidence and MotherDuck with code source to build your own.
- PyStack't - Python package that supports data preparation for object-centric process mining.
- Georgia State University Book Prices - DuckDB powered WASM app where you can search how much students spend on books at Georgia State University
- Datadex - Open-source and local-friendly data platform to collaborate on Open Data using DuckDB, Dagster, dbt, and Quarto.
-
-
Chat with this page
-
Backends
- duckdb-server - A Rust-based server that runs a local DuckDB instance and supports queries over Web Sockets or HTTP/HTTPS, returning data in either Apache Arrow or JSON format.
- DuckDB API - a TypeScript-based Docker image containing DuckDB, and a Hono framework REST API with JSON or streaming Arrow responses.
- Mosaic DuckDB Server - A Python-based server that runs a local DuckDB instance and supports queries over Web Sockets or HTTP, returning data in either Apache Arrow or JSON format.
- Mosaic DuckDB Server - A Python-based server that runs a local DuckDB instance and supports queries over Web Sockets or HTTP, returning data in either Apache Arrow or JSON format.
-
Newsletters
- DuckDB Newsletter - Monthly newsletter by MotherDuck.
- Learn DuckDB by example - Newsletter by Tobias MĂĽller.
-
Logos and Icons
-
DuckDB Clients and UIs
-
Web Clients (WebAssembly)
- SQL Workbench Embedded - A lightweight JavaScript library that turns SQL code blocks into interactive, browser-based database environments. Powered by DuckDB WASM.
- PondPilot - Open-source, 100% client-side data exploration tool that enables users to analyze local and remote data using SQL. Zero-copy direct access to local datasets sets PondPilot apart from similar tools. It runs entirely in the browser—no servers, no cloud uploads, and no setup required.
- Preswald - WASM packager for Python-based interactive data apps.
- Medama - Self-hostable, privacy-focused website analytics.
- TabulaStudio - Browser-only enterprise data analytics platform with Jupyter-style notebooks, AI-powered visualizations, and enterprise performance (10M+ rows/second). Direct access to files and live databases like (Neon and Supabase ) without servers, cloud uploads, or setup—your data never leaves your browser.
- SQL Workbench - DuckDB-Wasm based SQL Workbench for running queries on local or remote data, being able to show data as tables or visually as graphs, and sharing queries via URLs.
- SQL Workbench Embedded - A lightweight JavaScript library that turns SQL code blocks into interactive, browser-based database environments. Powered by DuckDB WASM.
- QuackDB - Open-source online DuckDB SQL playground and editor.
- Sidequery - Sidequery is a privacy-preserving DuckDB-powered query editor & data exploration tool for local & remote data.
- PondPilot - Open-source, 100% client-side data exploration tool that enables users to analyze local and remote data using SQL. Zero-copy direct access to local datasets sets PondPilot apart from similar tools. It runs entirely in the browser—no servers, no cloud uploads, and no setup required.
- TabulaStudio - Browser-only enterprise data analytics platform with Jupyter-style notebooks, AI-powered visualizations, and enterprise performance (10M+ rows/second). Direct access to files and live databases like (Neon and Supabase ) without servers, cloud uploads, or setup—your data never leaves your browser.
-
-
Client-Server Setups
-
Web Clients (WebAssembly)
- MyDuck Server - A server wrapping DuckDB with MySQL and PostgreSQL wire protocol support.
- pg_analytics - PostgreSQL extension embedding DuckDB-in-PostgreSQL for fast on-disk and remote object storage analytics from Postgres. Built as a Foreign Data Wrapper with full query pushdown to DuckDB. Integrates easily with ParadeDB.
- pg_duckdb - DuckDB-powered PostgreSQL for high-performance apps & analytics.
- pg_mooncake - A PostgreSQL extension that adds native column store tables with DuckDB.
- GizmoSQL - Arrow Flight SQL Server - A C++ implementation of the [Arrow Flight SQL protocol](https://arrow.apache.org/docs/format/FlightSql.html) that runs in a client-server setup with DuckDB or SQLite as backends.
- Hatch - A Go-based implementation of a DuckDB Arrow Flight SQL Server.
- Termux DuckDB package - DuckDB CLI client for the Termux Android terminal emulator.
- pg_lake - `pg_lake` integrates Iceberg and data lake files into Postgres. Uses DuckDB to execute queries.
- Hatch - A Go-based implementation of a DuckDB Arrow Flight SQL Server.
-
-
Tutorials
-
Extension Statistics
- DBQuacks - An interactive SQL tutorial powered by DuckDB.
-
Programming Languages
Categories
Media
81
Tools Powered by DuckDB
61
Extensions
57
Client APIs
34
Resources
29
Integrations
24
Libraries Powered by DuckDB
16
SQL Clients and IDE that Support DuckDB
12
DuckDB Clients and UIs
11
Client-Server Setups
9
Projects Powered by DuckDB
8
Installers
5
Backends
4
Logos and Icons
3
Newsletters
2
Tutorials
1
Chat with this page
1
Sub Categories
Keywords
duckdb
48
sql
18
database
11
analytics
11
parquet
10
duckdb-extension
9
olap
9
mysql
6
postgres
6
data-science
6
python
6
sqlite
6
data
6
postgresql
6
snowflake
5
geospatial
4
csv
4
bigquery
3
cpp
3
data-engineering
3
redshift
3
mariadb
3
dbt
3
object-storage
3
api
3
clickhouse
3
excel
3
rust
3
s3
3
databricks
3
arrow
3
data-analysis
3
rag
2
duckdb-engine
2
databases
2
sqlalchemy
2
extension
2
ai
2
llm
2
ffi
2
ffi-bindings
2
zero-etl
2
replication
2
big-data
2
analytics-engineering
2
sqlite3
2
json
2
pivot-tables
2
parquet-viewer
2
react
2