Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Apache Cassandra

Apache Cassandra is a free, open source, distributed, wide column store, NoSQL database management system designed to handle large amounts of data across many commodity servers, providing high availability with no single point of failure.

https://github.com/mvharsh/big-data

This repository contains all my Big Data files

cassandra database mongodb neo4j oracle-database weka

Last synced: 13 Feb 2025

https://github.com/duyledat197/messenger

Design a messenger platform that can serve for around more than 100M users. The platform supports web and mobile apps(android, ios).

cassandra clean-code golang grpc grpc-ecosystem grpc-gateway mqtt opensearch postgresql protoc redis scylladb swagger webrtc websocket

Last synced: 23 Jan 2025

https://github.com/sanogotech/docker-airflowsparkkafkadata-engineeringend-to-end

Docker Apache Airflow Data Engineering End-to-End Project — Spark, Kafka, Airflow, Docker, Cassandra, Python

airflow cassandra cassandra-database dataengineering docker kafka python spark

Last synced: 23 Jan 2025

https://github.com/saurabhkumarr99/reader-s_haven

Reader's Haven is an online BookStore . This is developed in Spring Boot (Java 17) , Postgress Db ,Cassandra DB , Redis Catch.

cassandra jwt-token kafka postgresql redis spring-boot spring-security

Last synced: 21 Jan 2025

https://github.com/pixelcaliber/chat-app-message-service

chat-application messaging service: Enables one-to-one messaging using websockets, load balanced using nginx and uses redis cluster for caching

cassandra cql-queries flask-application messa python redis socket-io

Last synced: 21 Jan 2025

https://github.com/ankitjaadoo/web-scraping-with-python-fastapi-celery-nosql

Learn how to scrape websites with Python, Selenium, Requests HTML, Celery, FastAPI, & NoSQL with Cassandra via AstraDB.

astradb cassandra cassandra-driver celery fastapi nosql-databases python python3 requests-html scheduled-tasks

Last synced: 21 Jan 2025

https://github.com/ilieschibane/projet-iot-cloud-bigdata

Implémentation d'une pipeline permettant de faire la prédiction de la maladie de parkinson via des outils d'IoT, Cloud, et Big Data

big-data cassandra cloud flask hadoop-hdfs iot kafka machine-learning mongodb mqtt python rest-api sickit-learn spark

Last synced: 21 Jan 2025

https://github.com/akhich551995/data-streaming-project-airflow-kafka-spark-t-cassandra-docker

building a real-time data streaming pipeline, covering each phase from data ingestion to processing and finally storage. We'll utilize a powerful stack of tools and technologies, including Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra—all neatly containerized using Docker.

airflow airflow-dags cassandra docker kafka postgresql python spark zookeeper

Last synced: 21 Jan 2025

https://github.com/himanshuchopade97/socialmediaengagementanalysis

Analyze social media data effortlessly with DataStax AstraDB and LangFlow. This project integrates scalable cloud-based storage with AI-driven workflows using LangChain, OpenAI, and Google GenAI to uncover performance insights and trends. Ideal for analysts, marketers, and AI enthusiasts.

astradb cassandra datastax google-generative-ai langflow openai

Last synced: 05 Jan 2025

https://github.com/terror-1/scalable-apps

This is a demo of a massively scalable e-commerce website built using Docker Compose. It includes services such as a web server, database, caching layer, and load balancer, all orchestrated with Docker Compose for easy deployment and scaling.

cassandra docker docker-compose elasticsearch gatling java kafka maven postgressql redis spring-boot

Last synced: 30 Jan 2025

https://github.com/bluecube246/youtube-streaming-pipeline

Data pipeline that sends raw youtube data to the Kafka producer. The data is processed using Spark Streaming where the data is cleaned with sentiment analysis. Final output is saved to Cassandra

cassandra kafka sentiment-analysis spark-streaming youtube-api

Last synced: 21 Jan 2025

https://github.com/yosrak5/data-streaming

This project involves the development of a robust data engineering pipeline that orchestrates the seamless ingestion, processing, and storage of data .

airflow-dags apache cassandra docker etl kafka python spark

Last synced: 11 Dec 2024

https://github.com/pregismond/working-with-nosql-databases

Final Assignment Submission: Working with NoSQL Databases

cassandra coursera ibm-cloud ibm-cloudant ibm-skills-network mongodb nosql

Last synced: 21 Jan 2025

https://github.com/aykhans/oh-my-url

Simple url shortener implementation with go and postgresql / cassandra.

cassandra go postgresql url-shortener

Last synced: 21 Jan 2025

https://github.com/kurtosis-tech/cassandra-package

A Kurtosis Starlark Package that spins up a Cassandra Network

cassandra distributed-systems docker-compose kurtosis kurtosis-package

Last synced: 03 Nov 2024

https://github.com/nashtech-labs/triple-manipulation.g8

Stores the RDF Triple and Search the value of Object on the basis of Subject and Predicate.

cassandra rdf semantic-web triple-store triples

Last synced: 23 Dec 2024

https://github.com/intina47/hopper

e-commerce crawler

cassandra cpp curl gumbo webcrawler

Last synced: 23 Jan 2025

https://github.com/erikpelli/bigmetric

Scalable system to collect data from multiple temperature sensors using Spring Boot

cassandra cluster docker grafana java kafka microservices spring

Last synced: 23 Dec 2024

https://github.com/fcbento/cassandra-java

CRUD operations using Java, Spring Boot, Cassandra. Authentication and authorization. JWT

cassandra cassandra-database java spring-boot

Last synced: 09 Feb 2025

https://github.com/komodoooo/cqldump

A primitive cassandra dumper

cassandra cassandra-export cqldump

Last synced: 21 Jan 2025

https://github.com/rafaelsouzaribeiro/web-chat-websocket-in-golang

Web chat with WebSocket, Redis, and Cassandra, including notifications for logged-in and logged-out users, and emoji support, implemented in Go and JavaScript.

cassandra chat clean-architecture emojis golang javascrit login logout redis websocket

Last synced: 21 Jan 2025

https://github.com/mmncit/rucasrar

Simple Rails application using Cassandra NoSQL database

cassandra cassandra-database cequel rails5 rails5-app

Last synced: 09 Feb 2025

https://github.com/bousettayounes/real-time-user-data-streaming

Developing a data pipeline to stream user data from a user generator API, apply necessary transformations, and seamlessly insert the processed data into a storage system

airflow cassandra dataengineering datastreaming docker kafka postgresql spark streaming

Last synced: 09 Nov 2024

https://github.com/yaninyzwitty/llamaindex-astradb-quiz-app

A terminal quiz app for asking questions about the great gatsby

astradb cassandra cassandra-vector-db llamaindex nodejs tpescript

Last synced: 21 Jan 2025

https://github.com/olejek88/absorber

Crypto data acquisition and visualizing

cassandra highcharts nodejs socket-io

Last synced: 17 Jan 2025

https://github.com/dwoz/pytest-cassandra

pytest cassandra cluster fixture

cassandra ccm pytest-plugin python testing-tools

Last synced: 10 Jan 2025

https://github.com/naren-jha/betterreads

Cassandra demo app - BetterReads

cassandra cassandra-cql github-login spring-boot

Last synced: 24 Dec 2024

https://github.com/naren-jha/inbox-app

Cassandra demo app - An email application where millions of users can send emails/messages to one another

cassandra cassandra-cql github-login spring-boot

Last synced: 24 Dec 2024

https://github.com/naren-jha/betterread-data-loader

Spring boot app for loading data into remotely hosted Cassandra cluster on datastax

cassandra java spring-boot

Last synced: 24 Dec 2024

https://github.com/saadkh1/real-time_sales_data_pipeline_kafa_spark_cassandra_redash

This repository implements a real-time sales data pipeline leveraging Apache Kafka, Apache Spark, Apache Cassandra, and Redash. It facilitates the efficient ingestion, processing, storage, and visualization of sales data streams.

cassandra fastapi kafka redash spark

Last synced: 21 Jan 2025

https://github.com/sumukhahe/click-event-analysis

The project is it capture , Monitor and analyze user click events on the e-commerce website, specifically focusing on instances where users explore product pages but do not complete purchases.

cassandra kafka python3 scala spark-sql

Last synced: 21 Jan 2025

https://github.com/bartekbh/akka-shop

REST API with event sourcing for online shop written in Scala and Akka

akka akka-http akka-persistence cassandra event-sourcing scala

Last synced: 21 Jan 2025

https://github.com/willfaught/cuckle

CQL syntax builder

cassandra cql go golang

Last synced: 25 Dec 2024

https://github.com/smatiolids/aws-glue-astra-loader

How to load data from AWS S3 to AstraDB/Cassandra using AWS GLue

astradb aws-glue cassandra pyspark

Last synced: 21 Jan 2025

https://github.com/navicore/cassandra

cassandra docker image to run in k8s

cassandra dockerfile

Last synced: 26 Dec 2024

https://github.com/navicore/akka-http-phantom.g8

A giter8 generator for a working Akka HTTP API server persisting to Cassandra with the Phantom DSL

akka akka-http cassandra giter8 giter8-template phantom-dsl

Last synced: 26 Dec 2024

https://github.com/michelderu/wikipedia-streamlit

Real-time enterprise grade RAG pipeline using Pulsar and Cassandra (with Astra Streaming and Astra DB, named as a Leader in the Forrester Wave for Vector DBs)

astra astradb cassandra enterprise pulsar

Last synced: 23 Oct 2024

https://github.com/dedpixta/distributed-computing

microservice application for tweet storage and processing

cassandra kafka postrgesql redis

Last synced: 21 Jan 2025

https://github.com/tashi-2004/db

I've created files with solutions, named them with their following conventions and order. You can download, copy, and run them on a compiler or software for your information. There are no copyrights attached to these files; they are provided for educational purposes only.

cassandra database database-management eer-diagram erdiagram keys mariadb mysql oracle-database sql

Last synced: 26 Dec 2024

https://github.com/vermicida/data-modeling-cassandra

Data Modeling with Cassandra, the code corresponding the project #2 of the Udacity's Data Engineer Nanodegree Program

cassandra data-engineering data-modeling etl-pipeline python

Last synced: 26 Dec 2024

https://github.com/vishalbansal28/end-to-end-realtime-data-streaming

An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.

apache-airflow apache-kafka apache-spark apache-zookeeper big-data cassandra containerization data-engineering data-pipeline data-processing data-storage docker etl-pipeline postgresql real-time-analytics

Last synced: 23 Jan 2025

https://github.com/ecomplus/webhooks-queue

Service to store and run webhooks with Node.js and Cassandra

cassandra express nodejs queue rest-api webhooks

Last synced: 26 Dec 2024

https://github.com/instaclustr/ccm-java8

CCM extension that starts Cassandra (and related tools) under Java 8

cassandra ccm-extension netapp-public

Last synced: 02 Jan 2025

https://github.com/michelderu/wikipedia-pulsar-astra

Real-time enterprise grade RAG pipeline using Pulsar and Cassandra (with Astra Streaming and Astra DB, named as a Leader in the Forrester Wave for Vector DBs)

astra astradb cassandra enterprise pulsar

Last synced: 23 Oct 2024

https://github.com/mikeacosta/data-model-cassandra

Data modeling and ETL pipeline using Apache Cassandra

cassandra data-model etl jupyter-notebook python

Last synced: 10 Jan 2025

https://github.com/derder3010/django-cockroach-astra

Django Project with Django REST Framework, SimpleJWT, Cockroachlabs, Astra Cassandra, Redis, and R2 Cloudflare

astra cassandra cloudflare cockroach django django-rest-framework python

Last synced: 21 Jan 2025

https://github.com/musale/demo-go-cassandra

Golang and Cassandra db

cassandra cassandra-database demo golang

Last synced: 02 Feb 2025

https://github.com/dekelev/feathers-service-tests-cassandra

A test harness for Feathers service implementations with Cassandra DB

cassandra database db feathers feathersjs service tests testsuites

Last synced: 04 Jan 2025

https://github.com/skngetich/teamcloud-setup

This is setup for teamcloud using docker

cassandra docker

Last synced: 21 Jan 2025

https://github.com/stargate/dynamodb-adapter-example

Example project for cassandra-dynamoDB-adapter to demonstrate usage

adapter cassandra dynamodb stargate

Last synced: 11 Jan 2025

https://github.com/captainirs/hyper-office-server

HyperOffice - A smart document management system. Winning entry for PS RK795 of Smart India Hackathon 2022

cassandra hyperledger-fabric ipfs mantine-ui sih2022

Last synced: 11 Jan 2025

https://github.com/pabmonrol/cassandra

Cassandra Database

cassandra python

Last synced: 12 Jan 2025

https://github.com/vitalvas/cassandra-redis-proxy

Redis Server with Cassandra as storage backend

cassandra redis redis-proxy redis-proxy-service redis-server

Last synced: 19 Nov 2024

https://github.com/rupeshtr78/airflow_pipeline

Airflow Pipeline streaming data kafka to cassandra

airflow cassandra kafka

Last synced: 12 Jan 2025

https://github.com/moritzrinow/dockerarch

Collection of scripts and compose files to run common services and architectures with docker

cassandra docker docker-compose elastic kafka

Last synced: 18 Jan 2025

https://github.com/aliartiza75/livecassandrareader

A script to read data from the Cassandra table in real time.

cassandra cassandra-database cassandra-reader cassandra-table python3

Last synced: 19 Jan 2025

https://github.com/atlasoflivingaustralia/cmigrate

Tool for migrating between cassandra clusters

ala-product-biocache cassandra data-migration

Last synced: 19 Jan 2025

https://github.com/james-leste/big-data-platform

This is a course project focusing on designing, implementing and operating a big data platform

cassandra javascript mongodb python

Last synced: 21 Jan 2025

https://github.com/anant/example-azure-cassandra-proxy

Learn how to use the Azure Dual Write Cassandra Proxy

azure cassandra docker proxy

Last synced: 19 Jan 2025

https://github.com/bujowskis/put-bd-project

Distributed system for library management

cassandra docker docker-compose flask

Last synced: 09 Feb 2025

https://github.com/mhio/casserole

:stew: Casserole - Cassandra object mapper for Node.js

cassandra nodejs npm-module objectmapper orm

Last synced: 19 Jan 2025

https://github.com/manenko/cassander

Cassandra driver for Rust which utilizes the DataStax C/C++ driver

cassandra cassandra-driver database-driver rust

Last synced: 21 Jan 2025

https://github.com/ammarnajjar/docker-kong

Docker compose configurations for Kong with cassandra

cassandra docker docker-compose kong

Last synced: 31 Jan 2025

https://github.com/pixelcaliber/sentinel

A Complete, Highly Scalable and Powerful Chat Application facilitating messaging between individuals for seamless communication.

cassandra chat-application firebase firebase-notifications flask jwt-authentication kafka nginx postgresql python reactjs redis socket-io workers

Last synced: 15 Jan 2025

https://github.com/axonops/axonops-workbench-containers

A project for generating the container images used in AxonOps™ Workbench

ami axonops axonops-workbench cassandra docker kafka packer podman vm wireguard

Last synced: 28 Jan 2025

https://github.com/dina-hosny/data-engineering-capstone-project

Data Engineering Capstone Project - Udacity Data Engineering Expert Track.

analytics cassandra data-engineering data-pipelines data-science etl fwd spark udacity

Last synced: 13 Jan 2025

https://github.com/mikma03/databases

Main purpose of this repository is to generate knowledge about databases in general view.

cassandra graphql hadoop mongodb msql neo4j newsql nosql oracle-database postgresql redis sql

Last synced: 09 Jan 2025

https://github.com/dina-hosny/sparkify---data-modeling-with-cassandra

Sparkify - Data Modeling with Cassandra - Udacity Data Engineering Expert Track.

cassandra cql data-analysis data-engineering data-modeling data-warehousing etl python

Last synced: 13 Jan 2025

https://github.com/ridwanbejo/terraform-cassandra-admin

Terraform module for managing Cassandra role, grant and database

acl apache automation cassandra database devops hashicorp hcl iac iam rbac sysadmin terraform

Last synced: 08 Jan 2025

https://github.com/billxsheng/oubre-sentiment-analysis

Complete data platform that performs sentiment analysis on tweets. Built using Cassandra, Kafka, Spark, Node, and React.

cassandra etl-pipeline java kafka nodejs sentiment-analysis spark twitter-api

Last synced: 17 Jan 2025

https://github.com/bousettayounes/real-time-processing-of-users-data

Developing a data pipeline to stream user data from a user generator API, apply necessary transformations, and seamlessly insert the processed data into a storage system

airflow cassandra dataengineering datastreaming docker kafka postgresql spark streaming

Last synced: 05 Jan 2025