awesome-database-learning
A list of learning materials to understand databases internals
https://github.com/pingcap/awesome-database-learning
Last synced: 9 days ago
JSON representation
-
Benchmark & Testing
-
Learned Indexes Structures
-
-
Consensus
-
Learned Indexes Structures
- Distributed consensus revised - Related algorithms, by Heidi Howard
- Ark: A Real-World Consensus Implementation
- Ark: A Real-World Consensus Implementation
-
-
Data Partitioning
-
DDL
-
HTAP
-
Network
-
Concurrency Control
- The End of Slow Networks: It's Time for a Redesign
- Accelerating Relational Databases by Leveraging Remote Memory and RDMA
- Don't Hold My Data Hostage: A Case for Client Protocol Redesign
- The End of Slow Networks: It's Time for a Redesign
- Don't Hold My Data Hostage: A Case for Client Protocol Redesign
-
-
Query Execution
-
Bloom Filter
-
Execution Framework
-
Hash Table
- Fibonacci Hashing: The Optimization that the World Forgot (or: a Better Alternative to Integer Modulo)
- All hash table sizes you will ever need - Thomas Neumann](https://databasearchitects.blogspot.com/)
-
Join
-
Vectorization vs Compilization
- Overhead of a Generalized Query Execution Engine
- MonetDB/X100: Hyper-Pipelining Query Execution
- Efficiently Compiling Efficient Query Plans for Modern Hardware
- Relaxed Operator Fusion for In-Memory Databases: Making Compilation, Vectorization, and Prefetching Work Together At Last
- Everything You Always Wanted to Know About Compiled and Vectorized Queries But Were Afraid to Ask
- Adaptive Execution of Compiled Queries
- Relaxed Operator Fusion for In-Memory Databases: Making Compilation, Vectorization, and Prefetching Work Together At Last
- Everything You Always Wanted to Know About Compiled and Vectorized Queries But Were Afraid to Ask
-
-
Query Optimizer
-
Blogs
-
Cost Model
- Approximation Schemes for Many-Objective Query Optimization
- Multi-Objective Parametric Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
- Modelling Costs for a MM-DBMS - Time Databases
- Multi-Objective Parametric Query Optimization
- Approximation Schemes for Many-Objective Query Optimization
-
Functional Dependency & Physical Properties
-
Join Order Optimization
- Analysis of Two Existing and One New Dynamic Programming Algorithm for the Generation of Optimal Bushy Join Trees without Cross Products
- How Good Are Query Optimizers, Really?
- Adaptive Optimization of Very Large Join Queries
- Analysis of Two Existing and One New Dynamic Programming Algorithm for the Generation of Optimal Bushy Join Trees without Cross Products
-
Planner Models
- 数据库内核杂谈
- 数据库内核杂谈(九):开源优化器 ORCA
- SQL 查询优化原理与 Volcano Optimizer 介绍 - zh)
- Cascades Optimizer - ming)
- Access Path Selection in a Relational Database Management System
- Query Processing in Main Memory Database Management Systems
- Query Optimization by Simulated Annealing
- Grammar-like Functional Rules for Representing Query Optimization Alternatives
- The Volcano Optimizer Generator- Extensibility and Efficient Search
- The Cascades Framework for Query Optimization
- An Overview of Query Optimization in Relational Systems
- LEO – DB2’s LEarning Optimizer
- Robust Query Processing through Progressive Optimization
- Orca: A Modular Query Optimizer Architecture for Big Data
- Parallelizing Query Optimization on Shared-Nothing Architectures
- The MemSQL Query Optimizer: A modern optimizer for real-time analytics in a distributed database
- Access Path Selection in a Relational Database Management System
- Parallelizing Query Optimization on Shared-Nothing Architectures
- The MemSQL Query Optimizer: A modern optimizer for real-time analytics in a distributed database
- Query Optimization by Simulated Annealing
- LEO – DB2’s LEarning Optimizer
- Query Processing in Main Memory Database Management Systems
- The Volcano Optimizer Generator- Extensibility and Efficient Search
- The Cascades Framework for Query Optimization
- Orca: A Modular Query Optimizer Architecture for Big Data
-
Statistics
- Universality of Serial Histograms
- Distinct Sampling for Highly-Accurate Answers to Distinct Values Queries and Event Reports
- The History of Histograms
- An Improved Data Stream Summary: The Count-Min Sketch and its Applications
- New Estimation Algorithms for Streaming Data: Count-min Can Do More
- Adaptive Statistics in Oracle 12c
- Deep Unsupervised Cardinality Estimation
- NeuroCard: One Cardinality Estimator for All Tables
- Synopses for Massive Data: Samples, Histograms, Wavelets, Sketches
- SEEKing the truth about ad hoc join costs
- Towards Estimation Error Guarantees for Distinct Values
- Accurate Estimation of the Number of Tuples Satisfying a Condition
- Optimal Histograms for Limiting Worst-Case Error Propagation in the Size of Join Results
- Balancing Histogram Optimality and Practicality for Query Result Size Estimation
- Improved Histograms for Selectivity Estimation of Range Predicates
- An Improved Data Stream Summary: The Count-Min Sketch and its Applications
- New Estimation Algorithms for Streaming Data: Count-min Can Do More
- Preventing Bad Plans by Bounding the Impact of Cardinality Estimation Errors
- Histograms Reloaded: The Merits of Bucket Diversity
- Exploiting Ordered Dictionaries to Efficiently Construct Histograms with Q-Error Guarantees in SAP HANA
- Adaptive Statistics in Oracle 12c
- Pessimistic Cardinality Estimation: Tighter Upper Bounds for Intermediate Join Cardinalities
- Deep Unsupervised Cardinality Estimation
- Synopses for Massive Data: Samples, Histograms, Wavelets, Sketches
- Universality of Serial Histograms
- Distinct Sampling for Highly-Accurate Answers to Distinct Values Queries and Event Reports
- The History of Histograms
-
Subquery Optimization
-
-
Recommended Courses, Books and Talks
-
Blogs
-
Books
- Database Systems: The Complete Book
- Designing Data-Intensive Applications
- Database Internals
- Foundations of Databases
- Readings in Database Systems, 5th Edition
- Database Design and Implementation: Second Edition (Data-Centric Systems and Applications)
- Principles of Distributed Database Systems, 4th ed
- Inside SQLite
- Architecture of a Database System
- Relational Database Index Design and the Optimizers
- Transactional Information Systems: Theory, Algorithms, and the Practice of Concurrency Control
- Database Systems: The Complete Book
- Foundations of Databases
- Readings in Database Systems, 5th Edition
- Designing Data-Intensive Applications
- Inside SQLite
-
Courses
-
Talks
-
-
Relational Model
-
Bloom Filter
-
Codd's Rules
-
ER to Relational Model
-
Relational Algebra
-
Relational Data Model
-
SQL - Overview
- An Overview of SQL Text Functions - rahman/)
-
-
Replication & Consistency
-
Scheduling
-
Learned Indexes Structures
- Building a Large-scale Distributed Storage System Based on Raft
- Automated Demand-driven Resource Scaling in Relational Database-as-a-Service
- MorphoSys: Automatic Physical Design Metamorphosis for Distributed Database Systems
- Autoscaling Tiered Cloud Storage in Anna
- Adaptive HTAP through Elastic Resource Scheduling
- MorphoSys: Automatic Physical Design Metamorphosis for Distributed Database Systems
-
Programming Languages
Categories
Sub Categories
Learned Indexes Structures
40
Cost Model
38
Statistics
27
Concurrency Control
25
Planner Models
25
Books
16
Disk IO
12
Blogs
10
Vectorization vs Compilization
8
Courses
7
Subquery Optimization
7
NoSQL Systems
6
Buffer Management
6
LSM-Tree
6
Bloom Filter
5
Functional Dependency & Physical Properties
5
Join Order Optimization
4
Isolation Levels
4
Join
4
B-Tree
4
Talks
2
Hash Table
2
Execution Framework
2
Relational Algebra
2
Relational Data Model
1
Codd's Rules
1
SQL - Overview
1
ER to Relational Model
1
Keywords