Apache Cassandra is an open-source NoSQL database used at companies known for their large-scale operations, such as CERN and Instagram. Designed to handle large volumes of data across commodity servers, Cassandra benefits from being containerized. In fact, it is one of the most popular images in the DockerHub with over 5 million pulls.
This guide will present best practices for running Cassandra in Docker containers. You’ll learn how to:
- Accelerate performance by colocating containers and their data on the same hosts
- Simplify deployments using the Network Topology placement strategy
- Improve cluster recovery time after node failure
- Increase container density by safely running multiple rings on the same hosts
- Achieve these benefits using your scheduler of choice: Kubernetes, DC/OS, or Swarm