Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Apache Cassandra
![](https://explore-feed.github.com/topics/cassandra/cassandra.png)
Apache Cassandra is a free, open source, distributed, wide column store, NoSQL database management system designed to handle large amounts of data across many commodity servers, providing high availability with no single point of failure.
- GitHub: https://github.com/topics/cassandra
- Wikipedia: https://en.wikipedia.org/wiki/Apache_Cassandra
- Repo: https://github.com/apache/cassandra
- Created by: Apache Software Foundation
- Released: July 2008
- Related Topics: language, dotnet,
- Aliases: apache-cassandra,
- Last updated: 2025-02-13 00:04:43 UTC
- JSON Representation
https://github.com/mvharsh/big-data
This repository contains all my Big Data files
cassandra database mongodb neo4j oracle-database weka
Last synced: 13 Feb 2025
https://github.com/duyledat197/messenger
Design a messenger platform that can serve for around more than 100M users. The platform supports web and mobile apps(android, ios).
cassandra clean-code golang grpc grpc-ecosystem grpc-gateway mqtt opensearch postgresql protoc redis scylladb swagger webrtc websocket
Last synced: 23 Jan 2025
https://github.com/sadafasad/realtime-data-streaming
Realtime user streaming data pipeline
apache-airflow apache-kafka apache-spark api cassandra python shell-script
Last synced: 21 Jan 2025
https://github.com/sanogotech/docker-airflowsparkkafkadata-engineeringend-to-end
Docker Apache Airflow Data Engineering End-to-End Project — Spark, Kafka, Airflow, Docker, Cassandra, Python
airflow cassandra cassandra-database dataengineering docker kafka python spark
Last synced: 23 Jan 2025
https://github.com/saurabhkumarr99/reader-s_haven
Reader's Haven is an online BookStore . This is developed in Spring Boot (Java 17) , Postgress Db ,Cassandra DB , Redis Catch.
cassandra jwt-token kafka postgresql redis spring-boot spring-security
Last synced: 21 Jan 2025
https://github.com/pixelcaliber/chat-app-message-service
chat-application messaging service: Enables one-to-one messaging using websockets, load balanced using nginx and uses redis cluster for caching
cassandra cql-queries flask-application messa python redis socket-io
Last synced: 21 Jan 2025
https://github.com/ankitjaadoo/web-scraping-with-python-fastapi-celery-nosql
Learn how to scrape websites with Python, Selenium, Requests HTML, Celery, FastAPI, & NoSQL with Cassandra via AstraDB.
astradb cassandra cassandra-driver celery fastapi nosql-databases python python3 requests-html scheduled-tasks
Last synced: 21 Jan 2025
https://github.com/hatamiarash7/kubernetes-cassandra
Deploy Apache Cassandra in Kubernetes
apache apache-cassandra cassandra cassandra-cluster cassandra-database database kubernetes
Last synced: 21 Jan 2025
https://github.com/ilieschibane/projet-iot-cloud-bigdata
Implémentation d'une pipeline permettant de faire la prédiction de la maladie de parkinson via des outils d'IoT, Cloud, et Big Data
big-data cassandra cloud flask hadoop-hdfs iot kafka machine-learning mongodb mqtt python rest-api sickit-learn spark
Last synced: 21 Jan 2025
https://github.com/akhich551995/data-streaming-project-airflow-kafka-spark-t-cassandra-docker
building a real-time data streaming pipeline, covering each phase from data ingestion to processing and finally storage. We'll utilize a powerful stack of tools and technologies, including Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra—all neatly containerized using Docker.
airflow airflow-dags cassandra docker kafka postgresql python spark zookeeper
Last synced: 21 Jan 2025
https://github.com/himanshuchopade97/socialmediaengagementanalysis
Analyze social media data effortlessly with DataStax AstraDB and LangFlow. This project integrates scalable cloud-based storage with AI-driven workflows using LangChain, OpenAI, and Google GenAI to uncover performance insights and trends. Ideal for analysts, marketers, and AI enthusiasts.
astradb cassandra datastax google-generative-ai langflow openai
Last synced: 05 Jan 2025
https://github.com/terror-1/scalable-apps
This is a demo of a massively scalable e-commerce website built using Docker Compose. It includes services such as a web server, database, caching layer, and load balancer, all orchestrated with Docker Compose for easy deployment and scaling.
cassandra docker docker-compose elasticsearch gatling java kafka maven postgressql redis spring-boot
Last synced: 30 Jan 2025
https://github.com/bluecube246/youtube-streaming-pipeline
Data pipeline that sends raw youtube data to the Kafka producer. The data is processed using Spark Streaming where the data is cleaned with sentiment analysis. Final output is saved to Cassandra
cassandra kafka sentiment-analysis spark-streaming youtube-api
Last synced: 21 Jan 2025
https://github.com/ndomah/realtime-data-streaming-of-random-user-data
End-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage.
apache-airflow apache-kafka apache-spark apache-zookeeper big-data cassandra containerization data-engineering data-pipeline data-processing data-storage docker etl-pipeline postgresql python real-time-analytics
Last synced: 31 Dec 2024
https://github.com/yosrak5/data-streaming
This project involves the development of a robust data engineering pipeline that orchestrates the seamless ingestion, processing, and storage of data .
airflow-dags apache cassandra docker etl kafka python spark
Last synced: 11 Dec 2024
https://github.com/vishalgattani/quixotic-kafka
Python Stream Processing for Apache Kafka, Spark, Cassandra.
cassandra cassandra-database docker kafka kafka-consumer kafka-producer kafka-streams quixstream ros ros-noetic spark-sql spark-streaming
Last synced: 11 Dec 2024
https://github.com/blugavere/cassandra-repository
cassandra persistence repositories repository repository-pattern
Last synced: 08 Feb 2025
https://github.com/pregismond/working-with-nosql-databases
Final Assignment Submission: Working with NoSQL Databases
cassandra coursera ibm-cloud ibm-cloudant ibm-skills-network mongodb nosql
Last synced: 21 Jan 2025
https://github.com/aykhans/oh-my-url
Simple url shortener implementation with go and postgresql / cassandra.
cassandra go postgresql url-shortener
Last synced: 21 Jan 2025
https://github.com/kurtosis-tech/cassandra-package
A Kurtosis Starlark Package that spins up a Cassandra Network
cassandra distributed-systems docker-compose kurtosis kurtosis-package
Last synced: 03 Nov 2024
https://github.com/nashtech-labs/triple-manipulation.g8
Stores the RDF Triple and Search the value of Object on the basis of Subject and Predicate.
cassandra rdf semantic-web triple-store triples
Last synced: 23 Dec 2024
https://github.com/intina47/hopper
e-commerce crawler
cassandra cpp curl gumbo webcrawler
Last synced: 23 Jan 2025
https://github.com/erikpelli/bigmetric
Scalable system to collect data from multiple temperature sensors using Spring Boot
cassandra cluster docker grafana java kafka microservices spring
Last synced: 23 Dec 2024
https://github.com/fcbento/cassandra-java
CRUD operations using Java, Spring Boot, Cassandra. Authentication and authorization. JWT
cassandra cassandra-database java spring-boot
Last synced: 09 Feb 2025
https://github.com/komodoooo/cqldump
A primitive cassandra dumper
cassandra cassandra-export cqldump
Last synced: 21 Jan 2025
https://github.com/aelesbao/yelp-dataset-challenge
apache-spark cassandra spark yelp-dataset
Last synced: 05 Jan 2025
https://github.com/rafaelsouzaribeiro/web-chat-websocket-in-golang
Web chat with WebSocket, Redis, and Cassandra, including notifications for logged-in and logged-out users, and emoji support, implemented in Go and JavaScript.
cassandra chat clean-architecture emojis golang javascrit login logout redis websocket
Last synced: 21 Jan 2025
https://github.com/daggerok/docker-files
This repository contains docker / docker-compose files examples
alpine cassandra dind docker docker-compose docker-in-docker dockerfile gitlab java jenkins mongo mysql nginx oracle postgres rabbitmq redis sonarqube spring-boot stomp
Last synced: 10 Jan 2025
https://github.com/daggerok/datastax-astra-db-spring-boot-app
Spring Data Cassandra + Datastax Astra DB
aastra-db astra cassandra cassandra-database datastax datastax-astra datastax-astra-db datastax-cassandra-driver spring-boot spring-data spring-data-astra spring-data-astra-db spring-data-cassandra spring-data-cassandra-astra spring-data-datastax
Last synced: 10 Jan 2025
https://github.com/daggerok/cassandra
Embedded Cassandra
cassandra cassandra-cluster cassandra-cql cassandra-database cassandra-java embedded embedded-cassandra embedded-database github-release-maven-plugin github-release-plugin github-releases jutzig
Last synced: 10 Jan 2025
https://github.com/mmncit/rucasrar
Simple Rails application using Cassandra NoSQL database
cassandra cassandra-database cequel rails5 rails5-app
Last synced: 09 Feb 2025
https://github.com/ytake/builderscon-example
this is a pen
cassandra example kafka php spark-streaming
Last synced: 06 Feb 2025
https://github.com/bousettayounes/real-time-user-data-streaming
Developing a data pipeline to stream user data from a user generator API, apply necessary transformations, and seamlessly insert the processed data into a storage system
airflow cassandra dataengineering datastreaming docker kafka postgresql spark streaming
Last synced: 09 Nov 2024
https://github.com/yaninyzwitty/llamaindex-astradb-quiz-app
A terminal quiz app for asking questions about the great gatsby
astradb cassandra cassandra-vector-db llamaindex nodejs tpescript
Last synced: 21 Jan 2025
https://github.com/olejek88/absorber
Crypto data acquisition and visualizing
cassandra highcharts nodejs socket-io
Last synced: 17 Jan 2025
https://github.com/dwoz/pytest-cassandra
pytest cassandra cluster fixture
cassandra ccm pytest-plugin python testing-tools
Last synced: 10 Jan 2025
https://github.com/naren-jha/betterreads
Cassandra demo app - BetterReads
cassandra cassandra-cql github-login spring-boot
Last synced: 24 Dec 2024
https://github.com/naren-jha/inbox-app
Cassandra demo app - An email application where millions of users can send emails/messages to one another
cassandra cassandra-cql github-login spring-boot
Last synced: 24 Dec 2024
https://github.com/naren-jha/betterread-data-loader
Spring boot app for loading data into remotely hosted Cassandra cluster on datastax
Last synced: 24 Dec 2024
https://github.com/saadkh1/real-time_sales_data_pipeline_kafa_spark_cassandra_redash
This repository implements a real-time sales data pipeline leveraging Apache Kafka, Apache Spark, Apache Cassandra, and Redash. It facilitates the efficient ingestion, processing, storage, and visualization of sales data streams.
cassandra fastapi kafka redash spark
Last synced: 21 Jan 2025
https://github.com/sumukhahe/click-event-analysis
The project is it capture , Monitor and analyze user click events on the e-commerce website, specifically focusing on instances where users explore product pages but do not complete purchases.
cassandra kafka python3 scala spark-sql
Last synced: 21 Jan 2025
https://github.com/bartekbh/akka-shop
REST API with event sourcing for online shop written in Scala and Akka
akka akka-http akka-persistence cassandra event-sourcing scala
Last synced: 21 Jan 2025
https://github.com/smatiolids/aws-glue-astra-loader
How to load data from AWS S3 to AstraDB/Cassandra using AWS GLue
astradb aws-glue cassandra pyspark
Last synced: 21 Jan 2025
https://github.com/guilhermezuriel/reduce.me
URL shortener service
cassandra docker spring-boot tailwindcss thymeleaf
Last synced: 30 Jan 2025
https://github.com/aditya1191/cloud-data-engineering
ETL Python files for loading data to cloud
airflow aws aws-s3 cassandra datalake-etl lambda postgresql redshift snowflake
Last synced: 30 Jan 2025
https://github.com/shrikantnaidu/data-engineering-by-udacity
Data Engineering Nanodegree Course Content
apache-airflow apache-spark aws aws-redshift cassandra data-engineering postgresql udacity-nanodegree
Last synced: 10 Jan 2025
https://github.com/shinigami92/dse-gremlin-schema-migrator
DataStax gremlin schema-migrator
cassandra database database-schema datastax datastax-enterprise dse gremlin java kotlin migration-tool schema-migration
Last synced: 26 Dec 2024
https://github.com/navicore/akka-http-phantom.g8
A giter8 generator for a working Akka HTTP API server persisting to Cassandra with the Phantom DSL
akka akka-http cassandra giter8 giter8-template phantom-dsl
Last synced: 26 Dec 2024
https://github.com/michelderu/wikipedia-streamlit
Real-time enterprise grade RAG pipeline using Pulsar and Cassandra (with Astra Streaming and Astra DB, named as a Leader in the Forrester Wave for Vector DBs)
astra astradb cassandra enterprise pulsar
Last synced: 23 Oct 2024
https://github.com/dedpixta/distributed-computing
microservice application for tweet storage and processing
cassandra kafka postrgesql redis
Last synced: 21 Jan 2025
https://github.com/tashi-2004/db
I've created files with solutions, named them with their following conventions and order. You can download, copy, and run them on a compiler or software for your information. There are no copyrights attached to these files; they are provided for educational purposes only.
cassandra database database-management eer-diagram erdiagram keys mariadb mysql oracle-database sql
Last synced: 26 Dec 2024
https://github.com/nfo94/infraestrutura-cassandra-pd
cassandra jupyter python spark
Last synced: 26 Dec 2024
https://github.com/vermicida/data-modeling-cassandra
Data Modeling with Cassandra, the code corresponding the project #2 of the Udacity's Data Engineer Nanodegree Program
cassandra data-engineering data-modeling etl-pipeline python
Last synced: 26 Dec 2024
https://github.com/vishalbansal28/end-to-end-realtime-data-streaming
An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.
apache-airflow apache-kafka apache-spark apache-zookeeper big-data cassandra containerization data-engineering data-pipeline data-processing data-storage docker etl-pipeline postgresql real-time-analytics
Last synced: 23 Jan 2025
https://github.com/instaclustr/ccm-java8
CCM extension that starts Cassandra (and related tools) under Java 8
cassandra ccm-extension netapp-public
Last synced: 02 Jan 2025
https://github.com/michelderu/wikipedia-pulsar-astra
Real-time enterprise grade RAG pipeline using Pulsar and Cassandra (with Astra Streaming and Astra DB, named as a Leader in the Forrester Wave for Vector DBs)
astra astradb cassandra enterprise pulsar
Last synced: 23 Oct 2024
https://github.com/mikeacosta/data-model-cassandra
Data modeling and ETL pipeline using Apache Cassandra
cassandra data-model etl jupyter-notebook python
Last synced: 10 Jan 2025
https://github.com/derder3010/django-cockroach-astra
Django Project with Django REST Framework, SimpleJWT, Cockroachlabs, Astra Cassandra, Redis, and R2 Cloudflare
astra cassandra cloudflare cockroach django django-rest-framework python
Last synced: 21 Jan 2025
https://github.com/musale/demo-go-cassandra
Golang and Cassandra db
cassandra cassandra-database demo golang
Last synced: 02 Feb 2025
https://github.com/dekelev/feathers-service-tests-cassandra
A test harness for Feathers service implementations with Cassandra DB
cassandra database db feathers feathersjs service tests testsuites
Last synced: 04 Jan 2025
https://github.com/skngetich/teamcloud-setup
This is setup for teamcloud using docker
Last synced: 21 Jan 2025
https://github.com/stargate/dynamodb-adapter-example
Example project for cassandra-dynamoDB-adapter to demonstrate usage
adapter cassandra dynamodb stargate
Last synced: 11 Jan 2025
https://github.com/captainirs/hyper-office-server
HyperOffice - A smart document management system. Winning entry for PS RK795 of Smart India Hackathon 2022
cassandra hyperledger-fabric ipfs mantine-ui sih2022
Last synced: 11 Jan 2025
https://github.com/vitalvas/cassandra-redis-proxy
Redis Server with Cassandra as storage backend
cassandra redis redis-proxy redis-proxy-service redis-server
Last synced: 19 Nov 2024
https://github.com/rupeshtr78/airflow_pipeline
Airflow Pipeline streaming data kafka to cassandra
Last synced: 12 Jan 2025
https://github.com/rupeshtr78/mqttspark
IOT Device MQTT Spark Streaming
cassandra gpio iot mqtt mqtt-broker mqtt-client raspberry-pi spark spark-streaming yarn
Last synced: 12 Jan 2025
https://github.com/rupeshtr78/blog
Big Data Spark Hadoop Kafka Flink Spark Streaming
aws bigdata cassandra elasticsearch emr-cluster flink hadoop hive hue kafka mapreduce mongodb oozie spark sparkstreaming yarn
Last synced: 12 Jan 2025
https://github.com/moritzrinow/dockerarch
Collection of scripts and compose files to run common services and architectures with docker
cassandra docker docker-compose elastic kafka
Last synced: 18 Jan 2025
https://github.com/shrikantnaidu/data-modeling-with-cassandra
Data Modeling with Cassandra
cassandra data-modeling etl-pipeline
Last synced: 21 Jan 2025
https://github.com/aliartiza75/livecassandrareader
A script to read data from the Cassandra table in real time.
cassandra cassandra-database cassandra-reader cassandra-table python3
Last synced: 19 Jan 2025
https://github.com/atlasoflivingaustralia/cmigrate
Tool for migrating between cassandra clusters
ala-product-biocache cassandra data-migration
Last synced: 19 Jan 2025
https://github.com/bibhushankarki/appdev-week3-graphql
Netflix clone from DataStax workshop.
astradb cassandra datastax graphql netflix-clone reactjs
Last synced: 19 Jan 2025
https://github.com/anant/example-redash-and-cassandra
business-intelligence cassandra dashboards nosql redash
Last synced: 19 Jan 2025
https://github.com/james-leste/big-data-platform
This is a course project focusing on designing, implementing and operating a big data platform
cassandra javascript mongodb python
Last synced: 21 Jan 2025
https://github.com/anant/example-azure-cassandra-proxy
Learn how to use the Azure Dual Write Cassandra Proxy
Last synced: 19 Jan 2025
https://github.com/anant/example-cassandra-terraform-astra-provider
cassandra datastax datastax-astra gitpod terraform
Last synced: 19 Jan 2025
https://github.com/bujowskis/put-bd-project
Distributed system for library management
cassandra docker docker-compose flask
Last synced: 09 Feb 2025
https://github.com/anant/example-cassandra-instaclustr
api cassandra instaclustr javascript nextjs react
Last synced: 19 Jan 2025
https://github.com/mhio/casserole
:stew: Casserole - Cassandra object mapper for Node.js
cassandra nodejs npm-module objectmapper orm
Last synced: 19 Jan 2025
https://github.com/manenko/cassander
Cassandra driver for Rust which utilizes the DataStax C/C++ driver
cassandra cassandra-driver database-driver rust
Last synced: 21 Jan 2025
https://github.com/ammarnajjar/docker-kong
Docker compose configurations for Kong with cassandra
cassandra docker docker-compose kong
Last synced: 31 Jan 2025
https://github.com/pixelcaliber/sentinel
A Complete, Highly Scalable and Powerful Chat Application facilitating messaging between individuals for seamless communication.
cassandra chat-application firebase firebase-notifications flask jwt-authentication kafka nginx postgresql python reactjs redis socket-io workers
Last synced: 15 Jan 2025
https://github.com/dina-hosny/data-engineering-capstone-project
Data Engineering Capstone Project - Udacity Data Engineering Expert Track.
analytics cassandra data-engineering data-pipelines data-science etl fwd spark udacity
Last synced: 13 Jan 2025
https://github.com/mikma03/databases
Main purpose of this repository is to generate knowledge about databases in general view.
cassandra graphql hadoop mongodb msql neo4j newsql nosql oracle-database postgresql redis sql
Last synced: 09 Jan 2025
https://github.com/niravpatel27/cassandra-operator-workshop
cassandra golang kubernetes kubernetes-operator minikube
Last synced: 04 Feb 2025
https://github.com/dina-hosny/sparkify---data-modeling-with-cassandra
Sparkify - Data Modeling with Cassandra - Udacity Data Engineering Expert Track.
cassandra cql data-analysis data-engineering data-modeling data-warehousing etl python
Last synced: 13 Jan 2025
https://github.com/billxsheng/oubre-sentiment-analysis
Complete data platform that performs sentiment analysis on tweets. Built using Cassandra, Kafka, Spark, Node, and React.
cassandra etl-pipeline java kafka nodejs sentiment-analysis spark twitter-api
Last synced: 17 Jan 2025
https://github.com/bousettayounes/real-time-processing-of-users-data
Developing a data pipeline to stream user data from a user generator API, apply necessary transformations, and seamlessly insert the processed data into a storage system
airflow cassandra dataengineering datastreaming docker kafka postgresql spark streaming
Last synced: 05 Jan 2025