Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome-cassandra
A curated list of the best resources in the Cassandra community.
https://github.com/Anant/awesome-cassandra
Last synced: about 8 hours ago
JSON representation
-
Resources
-
Books
- Expert Apache Cassandra Administration
- Expert Apache Cassandra Administration
- Expert Apache Cassandra Administration
- Expert Apache Cassandra Administration
- Expert Apache Cassandra Administration
- Expert Apache Cassandra Administration
- Expert Apache Cassandra Administration
- Expert Apache Cassandra Administration
- Expert Apache Cassandra Administration
- Expert Apache Cassandra Administration
- Expert Apache Cassandra Administration
- Expert Apache Cassandra Administration
- Expert Apache Cassandra Administration
- Expert Apache Cassandra Administration
- Expert Apache Cassandra Administration
-
Blogs
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- Datastax - DataStax, Inc. is a data management company that provides commercial support, software, and cloud database-as-a-service based on Cassandra.
- Instaclustr - Managed and supported open source solutions for Cassandra, Kafka, Elasticsearch & Redis.
- DOAN DuyHai's Blog: Cassandra - Duyhai Doan is a freelance big data and cloud architect who values sharing knowledge and contributing to the technology community.
- Amy Tobert - Amy Tobert is a full-stack engineer & leader with passion for sustainable systems and people-centered leadership. Her blog details different Cassandra deployments amont other topics.
- Distributed Bytes: Cassandra - Tim Ojo is the creator of Distributed Bytes and software engineer at Capital one. These are a collection of his posts surrounding the topic of Cassandra.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- Spotify R&D / Engineering Blog : Cassandra - Cassandra related posts on Spotify's official technology blog.
- Ryan Svilha - Ryan Svilha is a principle engineer at DataStax. His blog posts covers topics surround Cassandra and associated tools.
- Anant - Anant builds and manages business platforms of which they connect customer experiences and information systems with real-time data platforms.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
- The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
-
Communities
- Cassandra Slack
- Stack Overflow: Cassandra
- Stack Overflow: cql
- Stack Overflow: spark-cassandra-connector
- Stack Overflow: Astra DataStax - ASP.NET Core - Answered question regarding connecting DataStax Astra and an ASP.NET Core API published to Microsoft Azure.
- Quora: Cassandra
- Meetups: Cassandra
-
Videos
- Best Practices for Running Cassandra on AWS - Joint webinar between Amazon Web Services (AWS) and Stackdriver, an AWS Technology partner, to learn best practices that apply to storing, analyzing and managing queries that equate to over 1+ billion measurements a day.
- Monitoring Cassandra: Don't Miss a Thing (Alain Rodriguez, The Last Pickle) | C* Summit 2016 - Talk given by Alain Rodriguez, Consultant at The Last Pickle, discussing what to monitor in Cassandra, how, and why.
- Working with .NET and Cassandra/DataStax Enterprise - Getting a C# .NET core application started to work against a Cassandra or DSE database.
-
Slides
- Cassandra DataTables Using Restful API - How to create a performant API using Python / Flash.
- GumGum: Multi-Region Cassandra in AWS - Presentation that details how Gumgum scaled out from one local Cassandra datacenter to a multi-datacenter Cassandra cluster and all the problems they encountered and choices they made while implementing it.
- Hardening Cassandra for Compliance or Paranoia - Includes details on configuring SSL, setting up a certificate authority and creating certificates and trust chains for the JVM.
- Securing Cassandra - Ben Bromhead CTO of Instaclustr, will explore the various ways in which you can setup and secure Cassandra appropriately for your threat environment.
- Tuning the Spark Cassandra Connector - Slides by Russell Spitzer maintainer of the Spark Cassandra connector.
-
-
General
-
Cassandra Use Cases
- Kaa application based on Raspberry Pi and DHT11 sensor - Cassandra IoT usecase with Raspberry Pi and a DHT11 Sensor.
- Simple Node.js Express 4 Cassandra Application - MySubscribers is a very simple application (Start of an application) which allows you to create, read, update and delete users/subscribers. This application was only created to aid the YouTube course.
-
Cassandra Distributions
- YugaByte Database - YugaByteDB is a transactional, high-performance database for building distributed cloud services. It supports Cassandra-compatible and Redis-compatible APIs, with PostgreSQL in Beta.
- DDAC/Luna - Datastax Distribution of Cassandra, a production ready distribution with a bulk loader supported by Datastax. DDAC is Deprecated now, but Datastax is still supporting Cassandra with it's new Luna Service.
- ScyllaDB - NoSQL data store using the seastar framework, compatible with Cassandra.
- Instaclustr Managed Cassandra as a Service - Instaclustr provides a fully managed and SOC 2 certified hosted & managed service for Cassandra® on AWS, Azure, GCP and IBM Cloud.
- DataStax Enterprise - Most widely used commercial distribution of Cassandra, integrated with Apache Spark (for SparkSQL, analytics), Apache Solr (for secondary index), Apache TinkerPop based Graph stored in Cassandra, and OpsCenter.
-
Using Cassandra
- Cassandra Data Copy Tool - Java tool to copy data from one cassandra table to another.
- Spring Data Cassandra Examples - Examples for the Spring Data Cassandra Project.
- Import CSV files with spark - How to import a file from S3 into cassandra using Spark.
- Installing the Cassandra / Spark OSS Stack - Installation process and user guide for the Cassandra / Spark OSS Stack.
- The Cassandra Query Language - Documentation for CQL.
- The LIMIT Clause in Cassandra might not work as you think - Blog post for the considerations on the efficiency of the LIMIT clause.
- Building a Performant API using Go and Cassandra - Tutorial documenting how to build a RESTful API using Go and Cassandra.
- From Cassandra to S3, with Spark - Blog post showing how to connect Spark to Cassandra, analyze event data from Cassandra, and store the results of the analysis into S3, making it available for reporting or further analysis.
- Cloud DevOps with Cassandra - Using Packer, Ansible/SSH and AWS command line tools to create and DBA manage EC2 Cassandra instances in AWS.
-
Cassandra Data Modeling
- killrvideo-sample-schema - Sample Cassandra CQL Schema for a YouTube clone.
- Scalar DB - Transaction library for Cassandra that makes non-ACID distributed databases/storages ACID-compliant.
- CQL: This is not the SQL you are Looking For - Presentation that explores and explains the differences between the CQL and SQL languages.
- Spring Data Cassandra Examples - Maven project that contains examples showcasing the features and functionality of the Spring Data Cassandra project.
- Common Problems in Cassandra Data Models - Presentation and Article on wide partions, tombstones, and data skew.
- Casandra Time Series Data Modeling for Massive Scale - Blog post discussing a common Cassandra data modeling technique called bucketing.
- Cassandra Data Modeling Notes - Simple notes on how to estimate the size of your cluster.
-
Cassandra Maintenance
- Cassandra backup util - Instaclustr's cassandra backup tool.
- Cassy - Simple and integrated backup tool for Cassandra.
- Medusa - Cassandra backup system.
- Running commands cluster-wide without any management tool - Some tips and tricks to do basic Cluster operations without tools like Chef, Ansible, or Salt.
- Limiting Nodetool Parallel Threads - Little known tool to do nodetool operations with less resources.
- Bootstrapping Cassandra Nodes - Indepth article on how to add nodes to a running Cassandra cluster.
- Node Replacement without Bootstrapping - How to avoid the long bootstrapping process.
- Cassandra Backup and Restore - Backup in AWS using EBS Volumes - Indepth article about Backup and recovery in AWS.
- Intro to CStar - Tutorial on how to use CStar.
-
Cassandra History
- ZDNet: Cassandra Turns 10 - Highlights of the growth of Cassandra over it's first 10 years.
-
Cassandra from Relational
- RDBMS to NoSQL - Your roadmap to understanding whether NoSQL is right for you.
- Cassandra Query Language: CQL vs SQL - Blog post documenting similarities and differences between CQL and SQL.
- Real-Time Replication from MySQL to Cassandra - Demonstration of migrating data from MySQL to Cassandra.
- Cassandra Tutorial for Beginners - Lesson plan for users just starting out with Cassandra.
-
Cassandra Architecture
- The Gossip Protocol - Inside Cassandra. - Good visual explanation of how Cassandra keeps consistent.
- Introduction To The Cassandra 3.x Storage Engine - The 3.x storage engine makes it easier for Cassandra to get bytes off disk.
- Dropping columns in Cassandra 3.0 - Blog post describing the steps Cassandra takes when a column is dropped.
- Hinted Handoff and GC Grace Demystified - Tuning the balance between GC Grace and Hinted Handoff.
- Deletes and Tombstones - Explains how deletes create tombstones in Cassandra and what they are.
- About Deletes and Tombstones in Cassandra - Deleting distributed and replicated data from a system such as Cassandra is far trickier than in a relational database.
- Null bindings on prepared statements and undesired tombstone creation - Good follow up to the last article on Tombstones.
- Undetectable tombstones in Cassandra - Indepth analysis of cell and range tombstones.
- Common Problems with Cassandra Tombstones - Large number of tombstones causes Latency and heap pressure.
- Curious Case of Tombstones - How someone dealt with tombstone issues and reclaimed space in their cluster.
- Understanding the Nuance of Compaction in Cassandra - Overview of how Cassandra manages data on disk.
- Cassandra Architecture and Operations - High level overview in one page of how Cassandra works.
- Improving Cassandra's Front Door and Backpressure - Explore how an incoming request was processed by Cassandra before, see what they changed, and look at new relevant configuration knobs available.
-
Cassandra Monitoring
- Resources for Monitoring Datastax, Cassandra, Spark, & Solr Performance - Blog post detailing different types of monitoring tools and their purpose.
- How to Monitor Cassandra - Guide to help you monitor Cassandra performance and work metrics regardles of which monitoring tool you choose to use.
- Cassandra metrics and their use in Grafana - Case study of using Cassandra metrics in Grafana.
- Cassandra Monitoring - Introduction (1/2) - Blog post detailing how Cassandra metrics can be gathered.
- Cassandra Monitoring - Graphite/InfluxDB & Grafana on Docker (2/2) - Continuation of the previous entry exploring the topic of Cassandra metric reporters mentioned in Part I. The goal is to configure a reporter that sends metrics to an external time series database.
- Monitoring Cassandra using Intel Snap and Grafana - Blog post describing how to monitor Cassandra using the Intel Snap open source telemetry framework.
-
Cassandra Performance Tuning
- Amy's Cassandra 2.1 tuning guide - Guide to tracking down performance issues in production level Cassandra clusters.
- Secret HotSpot option improving GC pauses on large heaps
- Analyzing Cassandra Performance with Flame Graphs - Visually examining Cassandra performance visually using Flamegraphs.
- Garbage Collection Tuning for Cassandra - Optimizing garbage collection for better performance.
- TWCS part 1 - how does it work and when should you use it? - Best suited for time series data that expires, Time Window Compaction Strategy comes with some caveats.
- Performing User Defined Compactions in Cassandra - Documenting a process by which we tell Cassandra to create a compaction task for one or more tables explicitly.
- Graphing cassandra-stress - Benchmarking schemas and configuration changes using the cassandra-stress tool, before pushing such changes out to production is one of the things every Cassandra developer should know and regularly practice.
- Modeling real life workloads with cassandra-stress is hard - Blog post detailing caveats with cassandra-stress when modeling real workloads.
- A Deeper Dive - Diagnosing DSE Performance Issues with Ttop and Multidump - A good review of how to look deeper into Cassandra threads. -->
-
Cassandra Security
- Hardening Cassandra Step by Step: Part 1 - Inter-Node Encryption (And a Gentle Intro to Certificates).
- Encrypting EC2 ephemeral volumes with LUKS and AWS KMS - The example used here is Cassandra data stored on ephemeral disks.
-
Cassandra Deployment
- How To Setup A Highly Available Multi-AZ Cassandra Cluster On AWS EC2
- Setting Up Cassandra Cluster Through Ansible - Guide detailing how to set up a Cassandra cluster with automation using Ansible.
- Running Cassandra on DC/OS (Mesos) - Blog that shows how to setup DC/OS in the Amazon cloud, how to install Cassandra on a DC/OS cluster, and finally new ways to interact with and Cassandra after it is installed.
- Docker Meet Cassandra. Cassandra Meet Docker - Article reviewing how to setup a complete Cassandra application with monitoring on Docker.
- K8ssandra.io - Kubernetes + Cassandra - K8ssandra provides a production-ready platform for running Cassandra on Kubernetes. This includes automation for operational tasks such as repairs, backups, and monitoring.
- Sky UK - Cassandra Kubernetes Operator - Kubernetes operator that manages Cassandra clusters inside Kubernetes. Well designed and organized.
- Rook.io - Cassandra on Kubernetes - Rook is an open source cloud-native storage orchestrator, providing the platform, framework, and support for a diverse set of storage solutions to natively integrate with cloud-native environments. They have a special operator for Cassandra amongst other providers.
-
Integrating with Cassandra
- DataStax C# Driver Documentation - Documentation on the C# Driver for Cassandra from DataStax.
- CQL data types to C# types - Documentation on CQL data types to C# types.
- Connect to Cassandra with C# - Instaclustr article on how to connect to Cassandra with C#.
- Access Amazon Keyspaces with a Cassandra .NET Core Driver - Article shows how to connect to Amazon Keyspaces by using a .NET Core client driver.
- Cassandra ADO.NET Driver - Cassandra ADO.NET Data Provider enables user to easily connect to Cassandra data from .NET applications.
- Cassandra Pagination with ASP.NET Core C# - Article covering how to create infinite scroll pagination with Cassandra and ASP.NET Core C#.
- Elassandra - Elassandra = Elasticsearch as a Cassandra secondary index.
- cassandra-trigger - Cassandra trigger to push realtime updates to elasticsearch.
-
-
Databases
-
Timeseries Databases
- Cassandra Schema — KairosDB 1.0.1 documentation - KairosDB documentation.
- Hawkular.org - Time series / distributed tracing database powered by Cassandra by Redhat.
- Hawkular GitHub - Hawkular's GitHub resources.
-
Graph
- DSE Graph | Datastax - Successor to TitanDB , Commercial Tinkerpop / Gremlin compatible large scale Graph Database on DSE.
- Introduction to TitanDB - Introductory slides about TitanDB.
- Large Scale Graph Analytics with JanusGraph - Slides detailing deployment options and technical aspects of JanusGraph.
- Architecture Overview · GitBook - Documentation for HugeGraph.
-
Miscellaneous
- Cassandra vs MongoDB - Article comparing the two popular NoSQL databases.
- Meet Stargate, DataStax's GraphQL for databases. First stop - Cassandra - Introduction and high-level overview of Stargate.
- Building Your Own BaaS With Apache Usergrid & Docker: Lessons Learned At Scale - Introductory presentation to Apache UserGrid.
-
-
Packages
-
Tools
- JetBrains Datagrip DB IDE - The Cross-Platform IDE for Databases & SQL by JetBrains, with support for Cassandra.
- Ansible-Galaxy: Cassandra - Documentation for Ansible-Galaxy: Cassandra.
- dseansible - DSE Installation and Upgrade Ansible Playbooks/Roles for Ubuntu Linux.
- DBeaver - Free Universal Database Tool - Third party tool for dealing with all sorts of databases including Cassandra.
- RazorSQL - Multi DB Manager Tool - Multi-db tool for Linux, Mac, and Windows that works with Cassandra.
- Cassandra Reaper - Automated repairs for Cassandra. Supports all versions.
- Web: Cassandra Calculator - Simple calculator to see how size / replication factor affect the system's consistency.
- Cassandra-web - Web interface for Cassandra.
- Presto - Distributed SQL Query Engine for Big Data. Presto allows querying data where it lives, including Hive, Cassandra, relational databases or even proprietary data stores.
- CQL Data Modeler - Very useful tool to test out a CQL schema and visualize what the partition would like in relationship to the columns and rows.
- Datadog - Third party tool that allows monitoring and metrics for Cassandra nodes and clusters.
- tlp-cluster - Provisioning tool for Cassandra designed for developers looking to benchmark and test Cassandra. It assists with builds and starting instances on AWS.
-
Open Source Applications
- Cassandra-Tools - Python Fabric scripts to help automate the launching and managing of cluster testing on AWS.
- Cherami - Distributed, scalable, durable, and highly available message queue system.
-
Logging /Metrics
- cassandra-log4j-appender - Cassandra appenders for Log4j.
- Metrics Collector for Cassandra - Metric collection and Dashboards for Cassandra (2.2, 3.0, 3.11, 4.0) clusters. Comes with dashboards for Graphana.
-
Categories
Sub Categories
Blogs
42
Books
15
Cassandra Architecture
13
Tools
12
Cassandra Maintenance
9
Cassandra Performance Tuning
9
Using Cassandra
9
Integrating with Cassandra
8
Communities
7
Cassandra Deployment
7
Cassandra Data Modeling
7
Cassandra Monitoring
6
Slides
5
Cassandra Distributions
5
Graph
4
Cassandra from Relational
4
Videos
3
Miscellaneous
3
Timeseries Databases
3
Cassandra Security
2
Open Source Applications
2
Logging /Metrics
2
Cassandra Use Cases
2
Cassandra History
1
Keywords
cassandra
5
database
3
cpp
2
distributed-database
2
nosql
2
kubernetes
2
backup
2
postgresql
1
oracle-database
1
mysql
1
microservices
1
microservice
1
dynamodb
1
distributed-systems
1
distributed-databases
1
cosmos-db
1
amazon-aurora
1
sql
1
scale-out
1
multi-region
1
multi-cloud
1
high-performance
1
distributed-sql-database
1
distributed-sql
1
cloud-native
1
seastar
1
scylla
1
c-plus-plus
1
prometheus
1
grafana
1
collectd
1
storage
1
sstables
1
sstable
1
s3
1
restoring
1
restore
1
oracle
1
ops
1
netapp-public
1
minio
1
gcp
1
clouds
1
ceph
1
backuping
1
azure
1
aws
1
apache
1
transaction
1