Performance Tuning - Mastering Apache Cassandra

Database Reference

In-Depth Information

data is persisted) to be done only after the flush is done or the batch window is over,

whichever is sooner. This means the batch modes will always be slower than the periodic

modes.

For most practical cases, the default periodic value and default fsync() period of ten

seconds will do just fine.

column_index_size_in_kb

The column_index_size_in kb property tells Cassandra to add a column index if

the size of a row (serialized size) grows beyond the KBs mentioned by this property. In

other words, if row size is 314 KB and column_index_size_in_kb is set to 64

(KB), there will be a column index with at least five entries, each containing the start and

the finish column name in the chunk and its offset and width.

If the row contains many columns (wide rows) or you have columns with really large-

sized values, you may want to increase the default. It has a con; for a large column index

in KB, Cassandra will need to read at least this much amount of data, even a single

column of a small row with small values needs to be read. On the other hand, a small

value for this property, large index data will need to be read at each access. The default is

okay for most of the cases.

commitlog_total_space_in_mb

The commit log file is memory mapped ( mmap ). This means that the file takes the virtu-

al address space. Cassandra flushes any unflushed MemTables that exist in the oldest

memory mapped commit log segment to the disk. Thinking from the I/O point of view, it

does not make sense to keep this property small because the total space of smaller commit

logs will be filled up quickly, requiring frequent write to the disk and higher disk I/O. On

the other hand, we do not want commit logs to hog all the memory. Note that the data that

was not flushed to the disk. In the event of a shutdown, it is replayed from the commit log.

So, the larger the commit log, the more the replay time it will take to restart.

The default for the 32-bit JVM is 32 MB and for 64-bit JVM is 1024 MB. You may tune it

based on the memory availability on the node.

Search WWH ::

Custom Search

Home