Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Apache Cassandra
Apache Cassandra is a free, open source, distributed, wide column store, NoSQL database management system designed to handle large amounts of data across many commodity servers, providing high availability with no single point of failure.
- GitHub: https://github.com/topics/cassandra
- Wikipedia: https://en.wikipedia.org/wiki/Apache_Cassandra
- Repo: https://github.com/apache/cassandra
- Created by: Apache Software Foundation
- Released: July 2008
- Related Topics: language, dotnet,
- Aliases: apache-cassandra,
- Last updated: 2025-02-05 00:04:38 UTC
- JSON Representation
https://github.com/deathhunterx/simple-datapipeline
A simple data pipeline using Apache Kafka, Cassandra and Jupyter Notebook
apache-kafka cassandra jupyter-notebook
Last synced: 16 Dec 2024
https://github.com/bousettayounes/real-time-processing-of-users-data
Developing a data pipeline to stream user data from a user generator API, apply necessary transformations, and seamlessly insert the processed data into a storage system
airflow cassandra dataengineering datastreaming docker kafka postgresql spark streaming
Last synced: 05 Jan 2025
https://github.com/mkorangestripe/platform
Automation, build, and performance testing utilities
apache-tomcat cassandra python spinnaker vsphere
Last synced: 20 Jan 2025
https://github.com/ndomah/realtime-data-streaming-of-random-user-data
End-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage.
apache-airflow apache-kafka apache-spark apache-zookeeper big-data cassandra containerization data-engineering data-pipeline data-processing data-storage docker etl-pipeline postgresql python real-time-analytics
Last synced: 31 Dec 2024