Cassandra Architecture - Mastering Apache Cassandra

Database Reference

In-Depth Information

Understanding the architecture of

Cassandra

Cassandra is a relative latecomer in the distributed data-store war. It takes advantage of two

proven and closely similar data-store mechanisms, namely Bigtable: A Distributed Storage

System for Structured Data , 2006 ( http://static.googleusercontent.com/external_content/un-

Amazon's Highly Available Key-value Store , 2007 ( http://www.read.seas.harvard.edu/

~kohler/class/cs239-w08/decandia07dynamo.pdf ). The following diagram displays the read

throughputs that show linear scaling of Cassandra:

Like BigTable, it has a tabular data presentation. It is not tabular in the strictest sense. It is

rather a dictionary-like structure where each entry holds another sorted dictionary/map.

This model is more powerful than the usual key-value store and it is named a table,

formerly known as a column family. The properties such as eventual consistency and de-

centralization are taken from Dynamo.

We'll discuss column family in detail in a later chapter. For now, assume a column family is

a giant spreadsheet, such as MS Excel. But unlike spreadsheets, each row is identified by a

row key with a number (token), and unlike spreadsheets, each cell may have its own unique

Search WWH ::

Custom Search

Home