Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Apache Cassandra

Apache Cassandra is a free, open source, distributed, wide column store, NoSQL database management system designed to handle large amounts of data across many commodity servers, providing high availability with no single point of failure.

https://github.com/viklover/storageservice

In-memory key-value storage via binary splay tree

aspnetcore cassandra dotnet splay-tree

Last synced: 14 Jan 2025

https://github.com/oracle-quickstart/oci-cassandra

Terraform module to deploy Cassandra on Oracle Cloud Infrastructure (OCI)

cassandra cloud nosql oci oracle oracle-led terraform

Last synced: 07 Nov 2024

https://github.com/ansrivas/fwatcher

An application to watch a given directory for new files, read it and publish to Kafka ( using actors )

actors cassandra filewatcher golang kafka protoactor-go

Last synced: 28 Jan 2025

https://github.com/findinpath/cassandra-select-distinct-partition-keys

Demo on how to select the distinct partition keys of a Cassandra table

cassandra distinct-partition-keys testcontainers

Last synced: 29 Jan 2025

https://github.com/findinpath/search-alert

Proof of concept project on implementing both near-real-time & batched search agent functionality

cassandra elasticsearch kafka kafka-consumer percolator

Last synced: 29 Jan 2025

https://github.com/findinpath/spring-data-cassandra-repository-methods-timing

Proof of concept on timing spring data cassandra repository methods

cassandra micrometer monitoring testcontainers

Last synced: 29 Jan 2025

https://github.com/zsomborjoel/pyspark-streaming

Pocs with Kafka and Spark streaming by using Python

cassandra hbase kafka spark-streaming

Last synced: 11 Feb 2025

https://github.com/murtaza-arif/all-you-need-to-know-for-data-engineer

This repository is designed to showcase various aspects of data engineering, including tools, frameworks, and end-to-end projects. It covers everything from data ingestion and transformation to data warehousing and cloud-based solutions.

cassandra data data-engineering data-science kafka kafka-consumer kafka-streams pyarrow spark

Last synced: 13 Feb 2025

https://github.com/anicetkeric/spring-webflux-cassandra

REST CRUD API with Spring WebFlux and Spring Data Cassandra.

cassandra cassandra-cql docker docker-compose spring-boot spring-webflux

Last synced: 29 Dec 2024

https://github.com/simplyatul/cassandrademo

A cache using Spring-boot and Cassandra

cassandra java spring-boot

Last synced: 16 Jan 2025

https://github.com/samirprakash/go-bookstore

REST and oAUTH micro services in Go with Cassandra, MySQL and ElasticSearch

cassandra ddd-architecture domain-driven-design golang microservices modelviewcontrollerpattern mvc-architecture postgresql

Last synced: 26 Jan 2025

https://github.com/martishin/cqrs-hotel-management

CQRS and Event Sourcing-based hotel management system, built using Kotlin, Ktor, Akka, and Cassandra

akka cassandra cqrs docker event-sourcing gradle kotlin kotlin-coroutines ktor

Last synced: 11 Feb 2025

https://github.com/rizkimufrizal/cassandra-data

Library for Testing Insert And Read Performance Cassandra Database

apache-thrift cassandra performance performance-test

Last synced: 08 Jan 2025

https://github.com/prekshivyas/datastreamingetl

Utilizing my background and love for Apache Airflow and Data to build a real-time data streaming pipeline

apache-airflow apache-kafka apache-spark apache-zookeeper cassandra data-engineering data-ingestion data-pipeline data-processing data-visualization docker docker-compose

Last synced: 13 Feb 2025

https://github.com/samdvr/simplesparkstreaming

Simple Kafka SparkStreaming example app

apache-spark cassandra kafka

Last synced: 03 Feb 2025

https://github.com/andromedarabbit/docker-dd-agent-cassandra

dd-agent with cassandra nodetool

cassandra datadog docker

Last synced: 12 Jan 2025

https://github.com/gurbaj5124871/chat-microservice-http-server

Chat microservice HTTP server (expressJS, cassandra, mongodb and redis)

cassandra expressjs microservice mongodb nodejs redis

Last synced: 13 Feb 2025

https://github.com/knands42/data-modeling-with-cassandra

Build a simple ETL and model a few cassandra databases to recieve this values and query from it

cassandra etl python

Last synced: 25 Jan 2025

https://github.com/bilalvdemir/cassandra-spring-example

Spring Boot Connect To Cassadra and Execute Query

cassandra cassandra-cql spring-boot

Last synced: 09 Feb 2025

https://github.com/flynnfc/bagginsdb

⚡A cassandra inspired distrubuted wide column nosql database⚡

cassandra database distrubuted-systems go nosql

Last synced: 11 Jan 2025

https://github.com/hengxin/lwt-tla

TLA+ Specification of LWT (Lightweight Transactions) in Cassandra, ScyllaDB, and CASPaxos

caspaxos cassandra lightweight-transactions paxos scylladb tla

Last synced: 07 Jan 2025

https://github.com/idealista/cassandra_role

Ansible role to install an Apache Cassandra server/cluster

ansible ansible-role cassandra cassandra-cluster cassandra-database debian

Last synced: 06 Feb 2025

https://github.com/dinel13/anak-unhas-be

web service as beckend for https://anak-unhas.web.app/ who enable student to search other student and chat them

cassandra golang mongodb websocket

Last synced: 13 Feb 2025

https://github.com/konradmalik/scala-seed

Seed project for dockerized Scala with included Spark and Cassandra.

cassandra docker makefile multimodule sbt scala seed spark template typesafe-config

Last synced: 17 Jan 2025

https://github.com/mramshaw/python_cassandra

Getting familiar with accessing Cassandra from Python

cassandra cassandra-database cassandra-driver cql cqlsh database docker python

Last synced: 14 Jan 2025

https://github.com/sombriks/sample-cassandra

exploratory project to see how to consume Apache Cassandra / AWS Keyspaces from spring-boot-kotlin project

cassandra docker docker-compose keyspaces kotlin spring spring-boot spring-web-mvc testcontainers

Last synced: 16 Jan 2025

https://github.com/castaglia/proftpd-mod_sql_cassandra

ProFTPD module for interacting with Cassandra

c cassandra proftpd

Last synced: 03 Feb 2025

https://github.com/itsmandrew/de-e2e-kakfatest

An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage. All components are containerized with Docker for easy deployment and scalability.

airflow apache-kafka apache-spark apache-zooker cassandra data-engineering docker etl-pipeline

Last synced: 17 Jan 2025

https://github.com/alokjani/vagrant.cassandra-cluster

Apache Cassandra Cluster with Ansible on CentOS 7

cassandra cluster vagrant

Last synced: 11 Feb 2025

https://github.com/andfanilo/lyon2-nosql

Redis/mongo/orientdb tutorials with Jupyter notebooks. Used for tutorial at university.

cassandra elasticsearch jupyter-notebook mongodb nosql orientdb python redis vagrant

Last synced: 03 Nov 2024

https://github.com/tohideq/users-list-demo

A simple C# project to learn Cassandra Database and File Class

cassandra csharp

Last synced: 18 Feb 2025

https://github.com/tkrs/agni

Fully functional Cassandra client for Scala

cassandra cats-effect monix shapeless twitter-util

Last synced: 21 Jan 2025

https://github.com/dariocm/taco-cloud

Spring boot in action project example

cassandra jpa lombok maven spring-boot spring-mvc thymeleaf

Last synced: 21 Jan 2025

https://github.com/ndomah/data-engineering

Links to data engineering projects and learning materials.

airflow aws azure cassandra data-engineering databricks elt etl kafka pipelines snowflake

Last synced: 15 Feb 2025

https://github.com/erikpelli/bigmetric

Scalable system to collect data from multiple temperature sensors using Spring Boot

cassandra cluster docker grafana java kafka microservices spring

Last synced: 15 Feb 2025

https://github.com/thriving-dev/social-platform-feed-reactive-cassandra

POC Implementation of a social platform feed backend, with an RESTful API. Built with reacting programming using Kotlin, Quarkus and Mutiny, with ScyllaDB as the database (Apache Cassandra Protocol). This POC was featured on #WeAreDevelopers World Congress 2024.

cassandra cassandra-cql gradle kotlin quarkus reactive-programming scylladb

Last synced: 10 Feb 2025

https://github.com/satta/balboa-backend-cassandra

🛑 Experimental Cassandra backend for balboa

balboa cassandra golang passive-dns pdns

Last synced: 07 Feb 2025

https://github.com/rhzs/sentry-cassandra-docker

Sentry with Cassandra ScyllaDB as Nodestorage in docker

cassandra docker scylladb sentry

Last synced: 18 Feb 2025

https://github.com/jamestang12/bookstore-items-api

Book store API is a microservice architecture API that group 3 individual API and 2 custom build library into a docker container which is highly maintainable and testable

cassandra docker elasticsearch gin gocql golang mux mysql

Last synced: 18 Feb 2025

https://github.com/axonops/axonops-kafka-cassandra-demo

A series of demo working applications showing Cassandra® and Kafka® working together using AxonOps™

axonops cassandra flink kafka

Last synced: 10 Feb 2025

https://github.com/mrkem598/info-handling-cassandra-app

:memo: Info-handling-cassandra-app. A single page web application with REST for information handling. Student information created using cassandra db and maven as a project directory. The following tools and frameworks are applied to develop the app. Spring v4.0, Cassandra, AngularJS V1.3.0, Bootstrap v3.1.1, HTML4, CSS3,

angular bootstrap cassandra css html java jboss maven rest-api spring

Last synced: 03 Feb 2025

https://github.com/riptl/cqlcopy

Efficient replacement for Cassandra's cqlsh COPY

apache-cassandra big-data cassandra cassandra-cql pipeline

Last synced: 23 Jan 2025

https://github.com/nezorflame/tuidconv

Small utility for the extraction of datetime from UUID v1 and v2

cassandra datetime go golang timeuuid

Last synced: 03 Feb 2025

https://github.com/ysden123/ys-scala-cassandra

Lightweight wrapper over the DataStax Java Driver for Cassandra

cassandra datastax driver scala wrapper

Last synced: 27 Jan 2025

https://github.com/sanogotech/cassandradbkillrvideo-sample-schema

Sample Cassandra CQL Schema.

cassandra nosql

Last synced: 23 Jan 2025

https://github.com/mvharsh/big-data

This repository contains all my Big Data files

cassandra database mongodb neo4j oracle-database weka

Last synced: 13 Feb 2025

https://github.com/duyledat197/messenger

Design a messenger platform that can serve for around more than 100M users. The platform supports web and mobile apps(android, ios).

cassandra clean-code golang grpc grpc-ecosystem grpc-gateway mqtt opensearch postgresql protoc redis scylladb swagger webrtc websocket

Last synced: 23 Jan 2025

https://github.com/sanogotech/docker-airflowsparkkafkadata-engineeringend-to-end

Docker Apache Airflow Data Engineering End-to-End Project — Spark, Kafka, Airflow, Docker, Cassandra, Python

airflow cassandra cassandra-database dataengineering docker kafka python spark

Last synced: 23 Jan 2025

https://github.com/michelderu/cassandra-on-kubernetes

Example on deploying a 3 node Cassandra cluster on Kubernetes using Minikube and cass-operator.

cass-operator cassandra kubernetes minikube

Last synced: 20 Jan 2025

https://github.com/smok-serwis/cassandra-docker-dev

A Cassandra image for development only. Supports writing in a pre-provided schema.

cassandra ci docker nosql nosql-database

Last synced: 28 Dec 2024

https://github.com/bousettayounes/real-time-user-data-streaming

Developing a data pipeline to stream user data from a user generator API, apply necessary transformations, and seamlessly insert the processed data into a storage system

airflow cassandra dataengineering datastreaming docker kafka postgresql spark streaming

Last synced: 09 Nov 2024

https://github.com/dananderson/cassandra-test

Cassandra Test is a Java test framework for writing unit tests and integration tests against a Cassandra database.

cassandra cassandra-database cassandra-test java spring-test unit-testing

Last synced: 21 Jan 2025

https://github.com/dimits-ts/large-scale-data

Distributed computing for data science tasks, executed on a Ubuntu server.

cassandra kafka map-reduce spark vagrant

Last synced: 21 Jan 2025

https://github.com/navicore/azureblobtocassandra

A demo app to exercise reading from Azure blob storage and writing to Cassandra from Spark 2.x

apache-cassandra apache-spark azure-blob azure-storage cassandra spark

Last synced: 21 Jan 2025

https://github.com/leleueri/dw-cassandra-healthcheck

A Dropwizard HealthCheck implementation for Cassandra cluster. This implementation allow to check the ConsistencyLevel requirement on a keyspace.

apache-cassandra cassandra cassandra-database

Last synced: 21 Jan 2025

https://github.com/sgangopadhyay/python-cassandra

Basic CRUD Operations using the Python Cassandra Driver

apache apache-cassandra cassandra cassandra-database python python3

Last synced: 21 Jan 2025

https://github.com/vuanhtuan1012/data-modeling-with-cassandra

Design an Apache Cassandra database which can create queries on song play data to answer the questions of the analysis team of a music streaming application.

apache-cassandra cassandra etl-pipeline jupyter-notebook music-streaming-application python3

Last synced: 21 Jan 2025

https://github.com/dina-hosny/learning-apache-cassandra

Simple Tasks to Learn and Practice the Apache Cassandra NoSQL database queries.

apache apache-cassandra cassandra cassandra-database cql nosql

Last synced: 21 Jan 2025

https://github.com/dmarks84/coursework_capstone_full_data_engineering

Final Project for IBM Data Engineering & Python Professional Certificate -- Applied all skills and methods utilized in the series of courses for this certification

apache-airflow apache-hadoop apache-kafka apache-spark api beautifulsoup cassandra dags etl mongodb nosql pandas plotly postgresql python scipy seaborn sql

Last synced: 21 Jan 2025

https://github.com/mmncit/rucasrar

Simple Rails application using Cassandra NoSQL database

cassandra cassandra-database cequel rails5 rails5-app

Last synced: 09 Feb 2025

https://github.com/yosrak5/data-streaming

This project involves the development of a robust data engineering pipeline that orchestrates the seamless ingestion, processing, and storage of data .

airflow-dags apache cassandra docker etl kafka python spark

Last synced: 11 Dec 2024

https://github.com/saurabhkumarr99/reader-s_haven

Reader's Haven is an online BookStore . This is developed in Spring Boot (Java 17) , Postgress Db ,Cassandra DB , Redis Catch.

cassandra jwt-token kafka postgresql redis spring-boot spring-security

Last synced: 21 Jan 2025

https://github.com/pixelcaliber/chat-app-message-service

chat-application messaging service: Enables one-to-one messaging using websockets, load balanced using nginx and uses redis cluster for caching

cassandra cql-queries flask-application messa python redis socket-io

Last synced: 21 Jan 2025

https://github.com/ankitjaadoo/web-scraping-with-python-fastapi-celery-nosql

Learn how to scrape websites with Python, Selenium, Requests HTML, Celery, FastAPI, & NoSQL with Cassandra via AstraDB.

astradb cassandra cassandra-driver celery fastapi nosql-databases python python3 requests-html scheduled-tasks

Last synced: 21 Jan 2025

https://github.com/ilieschibane/projet-iot-cloud-bigdata

Implémentation d'une pipeline permettant de faire la prédiction de la maladie de parkinson via des outils d'IoT, Cloud, et Big Data

big-data cassandra cloud flask hadoop-hdfs iot kafka machine-learning mongodb mqtt python rest-api sickit-learn spark

Last synced: 21 Jan 2025

https://github.com/bujowskis/put-bd-project

Distributed system for library management

cassandra docker docker-compose flask

Last synced: 09 Feb 2025

https://github.com/intina47/hopper

e-commerce crawler

cassandra cpp curl gumbo webcrawler

Last synced: 23 Jan 2025

https://github.com/fcbento/cassandra-java

CRUD operations using Java, Spring Boot, Cassandra. Authentication and authorization. JWT

cassandra cassandra-database java spring-boot

Last synced: 09 Feb 2025

https://github.com/akhich551995/data-streaming-project-airflow-kafka-spark-t-cassandra-docker

building a real-time data streaming pipeline, covering each phase from data ingestion to processing and finally storage. We'll utilize a powerful stack of tools and technologies, including Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra—all neatly containerized using Docker.

airflow airflow-dags cassandra docker kafka postgresql python spark zookeeper

Last synced: 21 Jan 2025

https://github.com/bluecube246/youtube-streaming-pipeline

Data pipeline that sends raw youtube data to the Kafka producer. The data is processed using Spark Streaming where the data is cleaned with sentiment analysis. Final output is saved to Cassandra

cassandra kafka sentiment-analysis spark-streaming youtube-api

Last synced: 21 Jan 2025

https://github.com/pregismond/working-with-nosql-databases

Final Assignment Submission: Working with NoSQL Databases

cassandra coursera ibm-cloud ibm-cloudant ibm-skills-network mongodb nosql

Last synced: 21 Jan 2025

https://github.com/aykhans/oh-my-url

Simple url shortener implementation with go and postgresql / cassandra.

cassandra go postgresql url-shortener

Last synced: 21 Jan 2025