Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

awesome-modern-bigdata

A list of awesome modern big data libraries, frameworks and platforms.
https://github.com/yangliuyu/awesome-modern-bigdata

Last synced: 5 days ago
JSON representation

  • Ingestion

  • File Storage

    • Fluid - native Distributed Dataset Orchestrator and Accelerator for data-intensive applications, such as big data and AI applications.
    • MINIO - performance, S3 compatible object storage.
  • OLAP Query Engine

  • Database

    • StarRocks - gen sub-second MPP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics and ad-hoc query.
    • TiKV - source, distributed, and transactional key-value database. Unlike other traditional NoSQL systems, TiKV not only provides classical key-value APIs, but also transactional APIs with ACID compliance.
  • Data Lake

    • Flink Table Store
    • Iceberg - performance format for huge analytic tables. Iceberg brings the reliability and simplicity of SQL tables to big data, while making it possible for engines like Spark, Trino, Flink, Presto, and Hive to safely work with the same tables, at the same time.
    • Flink Table Store
  • Data Analytics

    • Zeppelin - based notebook that enables interactive data analytics. You can make beautiful data-driven, interactive and collaborative documents with SQL, Scala and more.
  • Data Visualization

  • Orchestration

    • StreamPipes - service (Industrial) IoT toolbox to enable non-technical users to connect, analyze and explore IoT data streams.