awesome-database-arXiv-papers
A curated list of awesome arXiv papers on database
https://github.com/aaronchenwei/awesome-database-arXiv-papers
Last synced: 12 days ago
JSON representation
-
Uncategorized
-
Uncategorized
- Using Fuzzy Matching of Queries to optimize Database workloads
- C5: Cloned Concurrency Control that Always Keeps Up
- Sparx: Distributed Outlier Detection at Scale
- Deep Learning in Business Analytics: A Clash of Expectations and Reality
- MATE: Multi-Attribute Table Extraction
- Enabling High-Performance and Energy-Efficient Hybrid Transactional/Analytical Databases with Hardware/Software Cooperation
- Use of Context in Data Quality Management: a Systematic Literature Review
- Online Aggregation based Approximate Query Processing: A Literature Survey
- Towards Polyglot Data Stores -- Overview and Open Research Questions
- Benchmarking Apache Arrow Flight -- A wire-speed protocol for data transfer, querying and microservices
- Weakly Supervised Text-to-SQL Parsing through Question Decomposition
- Evaluating the Text-to-SQL Capabilities of Large Language Models
- HIE-SQL: History Information Enhanced Network for Context-Dependent Text-to-SQL Semantic Parsing
- UniSAr: A Unified Structure-Aware Autoregressive Language Model for Text-to-SQL
- JanusAQP: Efficient Partition Tree Maintenance for Dynamic Approximate Query Processing
- Blink: Lightweight Sample Runs for Cost Optimization of Big Data Applications
- StreamingHub: Interactive Stream Analysis Workflows
- To Migrate or not to Migrate: An Analysis of Operator Migration in Distributed Stream Processing
- The Analysis of Online Event Streams: Predicting the Next Activity for Anomaly Detection
- Factor Windows: Cost-based Query Rewriting for Optimizing Correlated Window Aggregates
- Constructing and Analyzing the LSM Compaction Design Space
- Breaking Down Memory Walls: Adaptive Memory Management in LSM-based Storage Systems
- Sigma Workbook: A Spreadsheet for Cloud Data Warehouses
- BigBird: Big Data Storage and Analytics at Scale in Hybrid Cloud
- The Case for Distributed Shared-Memory Databases with RDMA-Enabled Memory Disaggregation
- Heterogeneous Data-Centric Architectures for Modern Data-Intensive Applications: Case Studies in Machine Learning and Databases
- Writes Hurt: Lessons in Cache Design for Optane NVRAM
- Efficient Compactions Between Storage Tiers with PrismDB
- RASAT: Integrating Relational Structures into Pretrained Seq2Seq Model for Text-to-SQL
- Sherman: A Write-Optimized Distributed B+Tree Index on Disaggregated Memory
- How to use Persistent Memory in your Database
- Relational Memory: Native In-Memory Accesses on Rows and Columns
- Key-Value Stores on Flash Storage Devices: A Survey
- Palpatine: Mining Frequent Sequences for Data Prefetching in NoSQL Distributed Key-Value Stores
- SciTS: A Benchmark for Time-Series Database in Scientific Experiments and Industrial Internet of Things
- OLxPBench: Real-time, Semantically Consistent, and Domain-specific are Essential in Benchmarking, Designing, and Implementing HTAP Systems
- DBSP: Automatic Incremental View Maintenance for Rich Query Languages
- Prefix Filter: Practically and Theoretically Better Than Bloom
- Separate and conquer heuristic allows robust mining of contrast sets from various types of data
- Discovering Process Models from Uncertain Event Data
- QUIP: Query-driven Missing Value Imputation
- Givens QR Decomposition over Relational Databases
- Differentially Private Linear Sketches: Efficient Implementations and Applications
- LDP-IDS: Local Differential Privacy for Infinite Data Streams
- SeeSaw: interactive ad-hoc search over image databases
- bloomRF: On Performing Range-Queries in Bloom-Filters with Piecewise-Monotone Hash Functions and Prefix Hashing
- Towards Observability for Production Machine Learning Pipelines
- HQANN: Efficient and Robust Similarity Search for Hybrid Queries with Structured and Unstructured Constraints
- Real-Time LSM-Trees for HTAP Workloads
- Sub-O(log n) Out-of-Order Sliding-Window Aggregation
-
Categories
Sub Categories