Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Apache Cassandra

Apache Cassandra is a free, open source, distributed, wide column store, NoSQL database management system designed to handle large amounts of data across many commodity servers, providing high availability with no single point of failure.

https://github.com/dimits-ts/large-scale-data

Distributed computing for data science tasks, executed on a Ubuntu server.

cassandra kafka map-reduce spark vagrant

Last synced: 21 Jan 2025

https://github.com/sangqle/message-platform

Implement a high-load messaging system, use Cassandra for heavy writing, and Kafka for a message broker and message queue system

cassandra high-performance kafka message-broker mysql redis

Last synced: 11 Jan 2025

https://github.com/navicore/azureblobtocassandra

A demo app to exercise reading from Azure blob storage and writing to Cassandra from Spark 2.x

apache-cassandra apache-spark azure-blob azure-storage cassandra spark

Last synced: 21 Jan 2025

https://github.com/leleueri/dw-cassandra-healthcheck

A Dropwizard HealthCheck implementation for Cassandra cluster. This implementation allow to check the ConsistencyLevel requirement on a keyspace.

apache-cassandra cassandra cassandra-database

Last synced: 21 Jan 2025

https://github.com/m0t0k1ch1/zabbix-cassandra-template

a Zabbix template for Cassandra

cassandra jmx zabbix

Last synced: 21 Jan 2025

https://github.com/kurtosis-tech/cassandra-package

A Kurtosis Starlark Package that spins up a Cassandra Network

cassandra distributed-systems docker-compose kurtosis kurtosis-package

Last synced: 03 Nov 2024

https://github.com/sgangopadhyay/python-cassandra

Basic CRUD Operations using the Python Cassandra Driver

apache apache-cassandra cassandra cassandra-database python python3

Last synced: 21 Jan 2025

https://github.com/vuanhtuan1012/data-modeling-with-cassandra

Design an Apache Cassandra database which can create queries on song play data to answer the questions of the analysis team of a music streaming application.

apache-cassandra cassandra etl-pipeline jupyter-notebook music-streaming-application python3

Last synced: 21 Jan 2025

https://github.com/dina-hosny/learning-apache-cassandra

Simple Tasks to Learn and Practice the Apache Cassandra NoSQL database queries.

apache apache-cassandra cassandra cassandra-database cql nosql

Last synced: 21 Jan 2025

https://github.com/dmarks84/coursework_capstone_full_data_engineering

Final Project for IBM Data Engineering & Python Professional Certificate -- Applied all skills and methods utilized in the series of courses for this certification

apache-airflow apache-hadoop apache-kafka apache-spark api beautifulsoup cassandra dags etl mongodb nosql pandas plotly postgresql python scipy seaborn sql

Last synced: 21 Jan 2025

https://github.com/saurabhkumarr99/reader-s_haven

Reader's Haven is an online BookStore . This is developed in Spring Boot (Java 17) , Postgress Db ,Cassandra DB , Redis Catch.

cassandra jwt-token kafka postgresql redis spring-boot spring-security

Last synced: 21 Jan 2025

https://github.com/pixelcaliber/chat-app-message-service

chat-application messaging service: Enables one-to-one messaging using websockets, load balanced using nginx and uses redis cluster for caching

cassandra cql-queries flask-application messa python redis socket-io

Last synced: 21 Jan 2025

https://github.com/kuzznya/timepicker

Simple project to choose the date for the event with friends. It uses Kotlin, Quarkus, Kafka, Cassandra

cassandra kafka kotlin quarkus websockets

Last synced: 09 Jan 2025

https://github.com/mrgraversen/docker-distributed-hashing

💻 Distributed hashing example using Docker 🐳 Compose, Spring Cloud Eureka, Zuul, Redis, and Cassandra

cassandra eureka redis spring-cloud zuul-proxy

Last synced: 29 Jan 2025

https://github.com/ankitjaadoo/web-scraping-with-python-fastapi-celery-nosql

Learn how to scrape websites with Python, Selenium, Requests HTML, Celery, FastAPI, & NoSQL with Cassandra via AstraDB.

astradb cassandra cassandra-driver celery fastapi nosql-databases python python3 requests-html scheduled-tasks

Last synced: 21 Jan 2025

https://github.com/ilieschibane/projet-iot-cloud-bigdata

Implémentation d'une pipeline permettant de faire la prédiction de la maladie de parkinson via des outils d'IoT, Cloud, et Big Data

big-data cassandra cloud flask hadoop-hdfs iot kafka machine-learning mongodb mqtt python rest-api sickit-learn spark

Last synced: 21 Jan 2025

https://github.com/akhich551995/data-streaming-project-airflow-kafka-spark-t-cassandra-docker

building a real-time data streaming pipeline, covering each phase from data ingestion to processing and finally storage. We'll utilize a powerful stack of tools and technologies, including Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra—all neatly containerized using Docker.

airflow airflow-dags cassandra docker kafka postgresql python spark zookeeper

Last synced: 21 Jan 2025

https://github.com/bluecube246/youtube-streaming-pipeline

Data pipeline that sends raw youtube data to the Kafka producer. The data is processed using Spark Streaming where the data is cleaned with sentiment analysis. Final output is saved to Cassandra

cassandra kafka sentiment-analysis spark-streaming youtube-api

Last synced: 21 Jan 2025

https://github.com/mmncit/rucasrar-event-manager

Simple event sourcing with RabbitMQ, Redis, Cassandra

cache cassandra message-queue rabbitmq rails redis

Last synced: 19 Jan 2025

https://github.com/mmikhail2001/highload_youtube

Расчетно-пояснительная записка. Проектирование высоконагруженного сервиса YouTube.

bgp-anycast cassandra cdn clickhouse ecmp envoy geo-dns internet-exchange k8s kafka load-balancing mapreduce s3 tarantool

Last synced: 23 Jan 2025

https://github.com/pregismond/working-with-nosql-databases

Final Assignment Submission: Working with NoSQL Databases

cassandra coursera ibm-cloud ibm-cloudant ibm-skills-network mongodb nosql

Last synced: 21 Jan 2025

https://github.com/aykhans/oh-my-url

Simple url shortener implementation with go and postgresql / cassandra.

cassandra go postgresql url-shortener

Last synced: 21 Jan 2025

https://github.com/komodoooo/cqldump

A primitive cassandra dumper

cassandra cassandra-export cqldump

Last synced: 21 Jan 2025

https://github.com/saadkh1/real-time_sales_data_pipeline_kafa_spark_cassandra_redash

This repository implements a real-time sales data pipeline leveraging Apache Kafka, Apache Spark, Apache Cassandra, and Redash. It facilitates the efficient ingestion, processing, storage, and visualization of sales data streams.

cassandra fastapi kafka redash spark

Last synced: 21 Jan 2025

https://github.com/anant/example-cql-arithmetic-operators

CQL Arithmetic Operators are now supported in Cassandra 4.0!

cassandra cql docker gitpod

Last synced: 19 Jan 2025

https://github.com/mark-eskander/health-monitoring-system-samsung-capstone

Project simulates health monitoring data visualization by utilizing a real-time data pipeline built with Kafka and Spark. The pipeline streams health data, processes it using Spark, and feeds it into Power BI for dynamic visualization, enabling real-time monitoring and insights.

cassandra data-engineering docker kafka mongodb powerbi spark spark-streaming

Last synced: 01 Feb 2025

https://github.com/anant/example-cassandra-cql-copy

Learn how to do data operations with CQL Copy

cassandra cql csv docker

Last synced: 19 Jan 2025

https://github.com/anant/example-cassandra-nifi

Learn how to connect Apache Cassandra and Apache Nifi

apache cassandra cqlsh docker nifi

Last synced: 19 Jan 2025

https://github.com/anant/cassandra.sitecore

Tools to connect Sitecore with real time Cassandra

cassandra sitecore

Last synced: 19 Jan 2025

https://github.com/sumukhahe/click-event-analysis

The project is it capture , Monitor and analyze user click events on the e-commerce website, specifically focusing on instances where users explore product pages but do not complete purchases.

cassandra kafka python3 scala spark-sql

Last synced: 21 Jan 2025

https://github.com/bartekbh/akka-shop

REST API with event sourcing for online shop written in Scala and Akka

akka akka-http akka-persistence cassandra event-sourcing scala

Last synced: 21 Jan 2025

https://github.com/smatiolids/aws-glue-astra-loader

How to load data from AWS S3 to AstraDB/Cassandra using AWS GLue

astradb aws-glue cassandra pyspark

Last synced: 21 Jan 2025

https://github.com/derder3010/django-cockroach-astra

Django Project with Django REST Framework, SimpleJWT, Cockroachlabs, Astra Cassandra, Redis, and R2 Cloudflare

astra cassandra cloudflare cockroach django django-rest-framework python

Last synced: 21 Jan 2025

https://github.com/skngetich/teamcloud-setup

This is setup for teamcloud using docker

cassandra docker

Last synced: 21 Jan 2025

https://github.com/bousettayounes/real-time-user-data-streaming

Developing a data pipeline to stream user data from a user generator API, apply necessary transformations, and seamlessly insert the processed data into a storage system

airflow cassandra dataengineering datastreaming docker kafka postgresql spark streaming

Last synced: 09 Nov 2024

https://github.com/james-leste/big-data-platform

This is a course project focusing on designing, implementing and operating a big data platform

cassandra javascript mongodb python

Last synced: 21 Jan 2025

https://github.com/nunum/cassandra-phantom-scala-driver

This is an example/tutorial demonstrating how to use phantom Cassandra driver

cassandra driver phantom scala toturial

Last synced: 24 Jan 2025

https://github.com/ecomclub/cassandra-to-csv

Shell script to import and export Cassandra table data to CSV

cassandra csv-export

Last synced: 23 Jan 2025

https://github.com/n3011/airbnb_template

Simple demo for Airbnb like property management app

cassandra flask

Last synced: 29 Jan 2025

https://github.com/manenko/cassander

Cassandra driver for Rust which utilizes the DataStax C/C++ driver

cassandra cassandra-driver database-driver rust

Last synced: 21 Jan 2025

https://github.com/findinpath/cassandra-migration-spring-boot-demo

Proof of concept on how to perform Cassandra database schema migrations on application startup.

cassandra migration spring-boot testcontainers

Last synced: 29 Jan 2025

https://github.com/travelxml/netflix-clone-with-astradb-graphql-prod

Netflix Clone with Cassandra, GraphQL and Node, it's Prod Release

astradb cassandra cassandra-database cloud graphql node node-module nodejs

Last synced: 21 Jan 2025

https://github.com/adamatti/learncassandra

Pet project to play with Cassandra and Spring

cassandra groovy java jvm spring

Last synced: 19 Jan 2025

https://github.com/anthonytedja/rplace

r/Place Collaborative Canvas Distributed System with AWS Services

aws cassandra docker express pubsub redis websockets

Last synced: 30 Jan 2025

https://github.com/anthonytedja/redirectv2

URL Shortener Service - High Level

cassandra docker docker-swarm java redis visualizer

Last synced: 30 Jan 2025

https://github.com/sapvs/cassandra-docker

Low profile cassandra image on alpine linux, Uses headless JRE

alpine-image cassandra docker

Last synced: 05 Jan 2025

https://github.com/himanshuchopade97/socialmediaengagementanalysis

Analyze social media data effortlessly with DataStax AstraDB and LangFlow. This project integrates scalable cloud-based storage with AI-driven workflows using LangChain, OpenAI, and Google GenAI to uncover performance insights and trends. Ideal for analysts, marketers, and AI enthusiasts.

astradb cassandra datastax google-generative-ai langflow openai

Last synced: 05 Jan 2025

https://github.com/findinpath/cassandra-select-distinct-partition-keys

Demo on how to select the distinct partition keys of a Cassandra table

cassandra distinct-partition-keys testcontainers

Last synced: 29 Jan 2025

https://github.com/findinpath/search-alert

Proof of concept project on implementing both near-real-time & batched search agent functionality

cassandra elasticsearch kafka kafka-consumer percolator

Last synced: 29 Jan 2025

https://github.com/findinpath/spring-data-cassandra-repository-methods-timing

Proof of concept on timing spring data cassandra repository methods

cassandra micrometer monitoring testcontainers

Last synced: 29 Jan 2025

https://github.com/samirprakash/go-bookstore

REST and oAUTH micro services in Go with Cassandra, MySQL and ElasticSearch

cassandra ddd-architecture domain-driven-design golang microservices modelviewcontrollerpattern mvc-architecture postgresql

Last synced: 26 Jan 2025

https://github.com/samdvr/simplesparkstreaming

Simple Kafka SparkStreaming example app

apache-spark cassandra kafka

Last synced: 03 Feb 2025

https://github.com/bilalvdemir/cassandra-spring-example

Spring Boot Connect To Cassadra and Execute Query

cassandra cassandra-cql spring-boot

Last synced: 17 Dec 2024

https://github.com/simplyatul/cassandrademo

A cache using Spring-boot and Cassandra

cassandra java spring-boot

Last synced: 16 Jan 2025

https://github.com/konradmalik/scala-seed

Seed project for dockerized Scala with included Spark and Cassandra.

cassandra docker makefile multimodule sbt scala seed spark template typesafe-config

Last synced: 17 Jan 2025

https://github.com/dedpixta/distributed-computing

microservice application for tweet storage and processing

cassandra kafka postrgesql redis

Last synced: 21 Jan 2025

https://github.com/itsmandrew/de-e2e-kakfatest

An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage. All components are containerized with Docker for easy deployment and scalability.

airflow apache-kafka apache-spark apache-zooker cassandra data-engineering docker etl-pipeline

Last synced: 17 Jan 2025

https://github.com/vitalvas/cassandra-redis-proxy

Redis Server with Cassandra as storage backend

cassandra redis redis-proxy redis-proxy-service redis-server

Last synced: 19 Nov 2024

https://github.com/fabritsius/investor

Helper tool for managing your stock assets

cassandra golang grpc investing microservices tinkoff

Last synced: 21 Jan 2025

https://github.com/riptl/cqlcopy

Efficient replacement for Cassandra's cqlsh COPY

apache-cassandra big-data cassandra cassandra-cql pipeline

Last synced: 23 Jan 2025

https://github.com/sanogotech/cassandradbkillrvideo-sample-schema

Sample Cassandra CQL Schema.

cassandra nosql

Last synced: 23 Jan 2025

https://github.com/duyledat197/messenger

Design a messenger platform that can serve for around more than 100M users. The platform supports web and mobile apps(android, ios).

cassandra clean-code golang grpc grpc-ecosystem grpc-gateway mqtt opensearch postgresql protoc redis scylladb swagger webrtc websocket

Last synced: 23 Jan 2025

https://github.com/shixi99/spam-classification-rest-api

Spam Classification Rest API using Keras, FastAPI & NoSQL

cassandra fastapi keras nosql python tensorflow

Last synced: 05 Jan 2025

https://github.com/sanogotech/docker-airflowsparkkafkadata-engineeringend-to-end

Docker Apache Airflow Data Engineering End-to-End Project — Spark, Kafka, Airflow, Docker, Cassandra, Python

airflow cassandra cassandra-database dataengineering docker kafka python spark

Last synced: 23 Jan 2025

https://github.com/oneananda/nosql_deepdive

Welcome to NoSQL_DeepDive, a comprehensive repository designed to explore and understand the various aspects of NoSQL databases. This repository aims to provide in-depth knowledge, practical examples, and advanced techniques for working with different types of NoSQL databases.

arangodb cassandra couchdb dynamodb hbase mongodb neo4j nosql nosql-database redis

Last synced: 28 Dec 2024

https://github.com/martishin/cqrs-akka-kotlin-example

CQRS and Event Sourcing-based hotel management system, built using Kotlin, Ktor, Akka, and Cassandra

akka cassandra cqrs docker event-sourcing gradle kotlin kotlin-coroutines ktor

Last synced: 31 Oct 2024

https://github.com/deathhunterx/simple-datapipeline

A simple data pipeline using Apache Kafka, Cassandra and Jupyter Notebook

apache-kafka cassandra jupyter-notebook

Last synced: 16 Dec 2024

https://github.com/bousettayounes/real-time-processing-of-users-data

Developing a data pipeline to stream user data from a user generator API, apply necessary transformations, and seamlessly insert the processed data into a storage system

airflow cassandra dataengineering datastreaming docker kafka postgresql spark streaming

Last synced: 05 Jan 2025

https://github.com/mkorangestripe/platform

Automation, build, and performance testing utilities

apache-tomcat cassandra python spinnaker vsphere

Last synced: 20 Jan 2025