Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists by apache

A curated list of projects in awesome lists by apache .

https://github.com/datafuselabs/opendal

Apache OpenDAL: access data freely.

azblob gcs hacktoberfest redis rust s3 storage

Last synced: 06 Aug 2024

https://github.com/apache/avro

Apache Avro is a data serialization system.

avro bigdata c cplusplus csharp dotnet java perl php python ruby

Last synced: 28 Oct 2024

https://github.com/apache/nutch

Apache Nutch is an extensible and scalable web crawler

apache crawling hadoop java nutch web-crawler

Last synced: 07 Oct 2024

https://github.com/apache/incubator-age

Graph database optimized for fast analysis and real-time data processing. It is provided as an extension to PostgreSQL.

age-database agensgraph analytics graph-database graphdb multi-model-dbms postgresql postgresql-extension

Last synced: 05 Aug 2024

https://github.com/apache/maven-mvnd

Apache Maven Daemon

apache-maven build-management java maven

Last synced: 12 Oct 2024

https://github.com/apache/datafusion-sqlparser-rs

Extensible SQL Lexer and Parser for Rust

parser rust sql

Last synced: 07 Oct 2024

https://github.com/apache/commons-lang

Apache Commons Lang

commons

Last synced: 15 Oct 2024

https://github.com/apache/pdfbox

Mirror of Apache PDFBox

content java library pdfbox

Last synced: 07 Oct 2024

https://github.com/apache/cassandra-gocql-driver

GoCQL Driver for Apache Cassandra®

cassandra client database driver go golang nosql

Last synced: 07 Oct 2024

https://github.com/gocql/gocql

GoCQL Driver for Apache Cassandra®

cassandra client database driver go golang nosql

Last synced: 22 Oct 2024

https://github.com/apache/incubator-hugegraph

A graph database that supports more than 100+ billion data, high performance and scalability (Include OLTP Engine & REST-API & Backends)

big-data database graph graph-database graphdb gremlin

Last synced: 09 Oct 2024

https://github.com/CeresDB/ceresdb

Apache HoraeDB (incubating) is a high-performance, distributed, cloud native time-series database.

cloud-native database distributed-database horaedb iot-database rust sql timeseries-analysis timeseries-database tsdb

Last synced: 11 Aug 2024

https://github.com/apache/horaedb

Apache HoraeDB (incubating) is a high-performance, distributed, cloud native time-series database.

cloud-native database distributed-database horaedb iot-database rust sql timeseries-analysis timeseries-database tsdb

Last synced: 15 Oct 2024

https://github.com/apache/flume

Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log-like data

apache api flume java jvm library

Last synced: 09 Sep 2024

https://github.com/apache/logging-flume

Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log-like data

apache api flume java jvm library

Last synced: 09 Oct 2024

https://github.com/apache/parquet-java

Apache Parquet Java

apache parquet parquet-java

Last synced: 07 Oct 2024

https://github.com/apache/incubator-devlake

Apache DevLake is an open-source dev data platform to ingest, analyze, and visualize the fragmented data from DevOps tools, extracting insights for engineering excellence, developer experience, and community growth.

dashboard-friendly data data-analysis data-engineering data-integration data-transfers devops domain-layer dora etl golang hacktoberfest integration jira open-source user-friendly

Last synced: 01 Nov 2024

https://github.com/apache/nuttx

Apache NuttX is a mature, real-time embedded operating system (RTOS)

embedded mcu microcontroller nuttx real-time rtos

Last synced: 15 Oct 2024

https://github.com/apache/lucene

Apache Lucene open-source search software

backend information-retrieval java lucene nosql search search-engine

Last synced: 29 Oct 2024

https://github.com/apache/geode

Apache Geode

apache datagrid geode

Last synced: 29 Oct 2024

https://github.com/apache/incubator-seata-samples

Apache Seata(incubating) Samples for Java

Last synced: 07 Oct 2024

https://github.com/apache/tika

The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).

content extraction java metadata tika

Last synced: 28 Oct 2024

https://github.com/apache/arrow-rs

Official Rust implementation of Apache Arrow

Last synced: 28 Oct 2024

https://github.com/apache/dubbo-samples

samples for Apache Dubbo

dubbo

Last synced: 07 Oct 2024

https://github.com/apache/gobblin

A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.

apache data ingestion management replication

Last synced: 07 Oct 2024

https://github.com/apache/ambari

Apache Ambari simplifies provisioning, managing, and monitoring of Apache Hadoop clusters.

ambari big-data java javascript python

Last synced: 29 Oct 2024

https://github.com/apache/mahout

Mirror of Apache Mahout

java library mahout

Last synced: 07 Oct 2024

https://github.com/apache/rocketmq-spring

Apache RocketMQ Spring Integration

rocketmq

Last synced: 07 Oct 2024

https://github.com/apache/libcloud

Apache Libcloud is a Python library which hides differences between different cloud provider APIs and allows you to manage different cloud resources through a unified and easy to use API.

apache cloud libcloud library python

Last synced: 28 Oct 2024

https://github.com/apache/flink-table-store

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

big-data data-ingestion flink paimon real-time-analytics spark streaming-datalake table-store

Last synced: 10 Aug 2024

https://github.com/apache/paimon

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

big-data data-ingestion flink paimon real-time-analytics spark streaming-datalake table-store

Last synced: 12 Oct 2024

https://github.com/apache/tinkerpop

Apache TinkerPop - a graph computing framework

apache graph graph-database graphdb gremlin gremlin-server tinkerpop

Last synced: 29 Oct 2024

https://github.com/apache/kyuubi

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

data-lake hacktoberfest hadoop hive jdbc kubernetes spark spark-sql sql thrift

Last synced: 12 Oct 2024

https://github.com/apache/incubator-pegasus

Apache Pegasus - A horizontally scalable, strongly consistent and high-performance key-value store

distributed-database key-value-store nosql pegasus

Last synced: 15 Oct 2024

https://github.com/apache/servicecomb-pack

Apache ServiceComb Pack is an eventually data consistency solution for micro-service applications. ServiceComb Pack currently provides TCC and Saga distributed transaction co-ordination solutions by using Alpha as a transaction coordinator and Omega as an transaction agent .

servicecomb

Last synced: 07 Oct 2024

https://github.com/apache/poi

Mirror of Apache POI

content java library poi

Last synced: 07 Oct 2024

https://github.com/apache/servicecomb-java-chassis

ServiceComb Java Chassis is a Software Development Kit (SDK) for rapid development of microservices in Java, providing service registration, service discovery, dynamic routing, and service management features

servicecomb

Last synced: 04 Aug 2024

https://github.com/apache/bookkeeper

Apache BookKeeper - a scalable, fault tolerant and low latency storage service optimized for append-only workloads

apache big-data bookkeeper distributed-log distributed-systems wal

Last synced: 28 Oct 2024

https://github.com/apache/sedona

A cluster computing framework for processing large-scale geospatial data

cluster-computing geospatial java python scala spatial-analysis spatial-query spatial-sql

Last synced: 29 Oct 2024

https://github.com/apache/kudu

Mirror of Apache Kudu

big-data cplusplus kudu

Last synced: 07 Oct 2024

https://github.com/apache/atlas

Apache Atlas

atlas

Last synced: 29 Oct 2024

https://github.com/apache/cloudstack

Apache CloudStack is an opensource Infrastructure as a Service (IaaS) cloud computing platform

cloud cloudstack iaas infrastructure java kubernetes kvm libvirt orchestration python virtual-machine virtualization vmware vsphere xcp-ng xenserver

Last synced: 15 Oct 2024

https://github.com/apache/parquet-format

Apache Parquet Format

apache parquet parquet-format

Last synced: 29 Oct 2024

https://github.com/apache/trafficserver

Apache Traffic Server™ is a fast, scalable and extensible HTTP/1.1 and HTTP/2 compliant caching proxy server.

apache cache cdn forwardproxy hacktoberfest http2 http3 proxy quic reverseproxy

Last synced: 15 Oct 2024

https://github.com/apache/incubator-seata-go

Go Implementation For Seata

at saga tcc xa

Last synced: 07 Oct 2024

https://github.com/transaction-wg/seata-golang

Go Implementation For Seata

at saga tcc xa

Last synced: 04 Aug 2024

https://github.com/seata/seata-go

Go Implementation For Seata

at saga tcc xa

Last synced: 05 Aug 2024

https://github.com/apache/httpcomponents-client

Mirror of Apache HttpClient

httpcomponents

Last synced: 28 Oct 2024

https://github.com/apache/carbondata

High performance data store solution

apache big-data carbondata data-format hadoop java scala spark

Last synced: 07 Oct 2024

https://github.com/apache/guacamole-client

Mirror of Apache Guacamole Client

c guacamole java javascript network-client network-server

Last synced: 29 Oct 2024

https://github.com/apache/servicecomb-service-center

A standalone service center to allow services to register their instance information and to discover providers of a given service

servicecomb

Last synced: 07 Oct 2024

https://github.com/apache/cassandra-java-driver

Java Driver for Apache Cassandra®

Last synced: 15 Oct 2024

https://github.com/apache/datafusion-ballista

Apache Arrow Ballista Distributed Query Engine

arrow big-data dataframe distributed olap python query-engine rust sql

Last synced: 09 Oct 2024

https://github.com/apache/arrow-ballista

Apache Arrow Ballista Distributed Query Engine

arrow big-data dataframe distributed olap python query-engine rust sql

Last synced: 09 Sep 2024

https://github.com/apache/inlong

Apache InLong - a one-stop, full-scenario integration framework for massive data

data-streaming event-streaming framework full-scenario-service inlong massive-data-integration one-stop-service

Last synced: 15 Oct 2024

https://github.com/apache/struts

Mirror of Apache Struts

java struts web-framework

Last synced: 29 Oct 2024

https://github.com/apache/rocketmq-client-go

Apache RocketMQ go client

rocketmq

Last synced: 29 Oct 2024

https://github.com/apache/fineract

Apache Fineract

fineract

Last synced: 15 Oct 2024

https://github.com/apache/ratis

Open source Java implementation for Raft consensus protocol.

consensus consensus-protocol java raft

Last synced: 15 Oct 2024

https://github.com/apache/plc4x

PLC4X The Industrial IoT adapter

ab ads bacnet c can cpp ethernetip firmata go iot java knx modbus net opcua python s7 siemens

Last synced: 14 Oct 2024

https://github.com/apache/incubator-teaclave-sgx-sdk

Apache Teaclave (incubating) SGX SDK helps developers to write Intel SGX applications in the Rust programming language, and also known as Rust SGX SDK.

confidential-computing rust sgx tee trusted-execution-environment universal-secure-computing

Last synced: 01 Nov 2024

https://github.com/baidu/rust-sgx-sdk

Apache Teaclave (incubating) SGX SDK helps developers to write Intel SGX applications in the Rust programming language, and also known as Rust SGX SDK.

confidential-computing rust sgx tee trusted-execution-environment universal-secure-computing

Last synced: 07 Sep 2024

https://github.com/apache/griffin

Mirror of Apache griffin

griffin

Last synced: 07 Oct 2024

https://github.com/apache/nano

Nano is now part of Apache CouchDB. Repo moved to https://GitHub.com/apache/couchdb-nano

nano

Last synced: 01 Oct 2024

https://github.com/apache/pekko

Build highly concurrent, distributed, and resilient message-driven applications using Java/Scala

actor-model cloud-native concurrency distributed-actors distributed-systems high-performance pekko reactive streaming

Last synced: 09 Oct 2024

https://github.com/apache/incubator-pekko

Build highly concurrent, distributed, and resilient message-driven applications using Java/Scala

actor-model cloud-native concurrency distributed-actors distributed-systems high-performance pekko reactive streaming

Last synced: 09 Sep 2024

https://github.com/apache/impala

Apache Impala

impala

Last synced: 15 Oct 2024

https://github.com/apache/rocketmq-dashboard

The state-of-the-art Dashboard of Apache RoccketMQ provides excellent monitoring capability. Various graphs and statistics of events, performance and system information of clients and application is evidently made available to the user.

Last synced: 07 Oct 2024

https://github.com/apache/jena

Apache Jena

apache jena rdf sparql

Last synced: 15 Oct 2024

https://github.com/apache/accumulo

Apache Accumulo

accumulo big-data hacktoberfest

Last synced: 15 Oct 2024

https://github.com/apache/incubator-gluten

Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.

arrow clickhouse simd spark-sql vectorization velox

Last synced: 07 Oct 2024

https://github.com/apache/polaris

The interoperable, open source catalog for Apache Iceberg

apache2 catalog iceberg

Last synced: 07 Oct 2024

https://github.com/apache/solr

Apache Solr open-source search software

backend information-retrieval java lucene nosql search search-engine solr

Last synced: 14 Oct 2024

https://github.com/apache/phoenix

Apache Phoenix

database hbase java phoenix sql

Last synced: 07 Oct 2024

https://github.com/apache/systemds

An open source ML system for the end-to-end data science lifecycle

dml java python systemds

Last synced: 15 Oct 2024

https://github.com/apache/trafficcontrol

Apache Traffic Control is an Open Source implementation of a Content Delivery Network

apache cdn contentdelivery hacktoberfest trafficcontrol

Last synced: 15 Oct 2024

https://github.com/apache/commons-io

Apache Commons IO

commons

Last synced: 07 Oct 2024

https://github.com/apache/usergrid

Mirror of Apache Usergrid

usergrid

Last synced: 01 Oct 2024

https://github.com/apache/apisix-dashboard

Dashboard for Apache APISIX

api api-management apisix dashboard devops

Last synced: 07 Oct 2024

https://github.com/apache/freemarker

Apache Freemarker

freemarker

Last synced: 07 Oct 2024

https://github.com/apache/openoffice

Apache OpenOffice

content cplusplus openoffice

Last synced: 07 Oct 2024

https://github.com/apache/mina

Mirror of Apache MINA

java mina network-client network-server

Last synced: 07 Oct 2024

https://github.com/apache/ranger

Apache Ranger - To enable, monitor and manage comprehensive data security across the Hadoop platform and beyond

apache authz docker java python ranger

Last synced: 29 Oct 2024

https://github.com/apache/flink-training

Apache Flink Training Excercises

Last synced: 07 Oct 2024

https://github.com/apache/mina-sshd

Apache MINA sshd is a comprehensive Java library for client- and server-side SSH.

apache java library ssh

Last synced: 29 Oct 2024

https://github.com/apache/incubator-livy

Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.

apachelivy bigdata livy spark

Last synced: 07 Oct 2024

https://github.com/apache/celeborn

Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.

bigdata shuffle spark

Last synced: 29 Oct 2024