Building high-availability solutions with NoSQL - Making Sense of NoSQL

Databases Reference

In-Depth Information

move on to another site. There are other factors, not as easily measured, which can

lead to losses as well, such as poor reputation and lack of confidence.

Measuring overall system availability is more than generating a single number. To

fairly evaluate NoSQL systems, you need an understanding of the subtleties of avail-

ability measurements.

If a business unit indicates they can't afford to be down more than eight hours in a

calendar year, then you want to build an infrastructure that would provide three-nines

availability. Most land-line telephone switches are designed for five-nines availability,

or no more than five minutes of downtime per year. Today five nines is considered the

gold standard for data services, with few situations warranting greater availability.

Although the use of counted nines is a common way to express system availability,

it's usually not detailed enough to understand business impact. An outage for 30 sec-

onds may seem to users like a slow day on the web. Some systems may show partial out-

age but have other functions that can step in to take their place, making the system

only appear to work slowly. The end result is that no simple metric can be used to

measure overall system availability. In practice, most systems look at the percentage of

service requests that go beyond a specific threshold.

As a result, the term service level agreement or SLA is used to describe the detailed

availability targets for any data service. An SLA is a written agreement between a ser-

vice provider and a customer who uses the service. The SLA doesn't concern itself with

how the data service is provided. It defines what services will be provided and the avail-

ability and response time goals for the service. Some items to consider when creating

an SLA are

 What are the general service availability goals of the service in terms of percent-

age uptime over a one-year period?

 What are the typical average response times for the service under normal oper-

ations? Typically these are specified in milliseconds between service request and

response.

 What is the peak volume of requests that the service is designed to handle? This

is typically specified in requests per second.

 Are there any cyclical variations in request volumes? For example, do you

expect to see peaks at specific times of day, days of the week or month, or times

of the year like holiday shopping or sporting events?

 How will the system be monitored and service availability reported?

 What is the shape of the service-call response distribution curve? Keeping track

of the average response time may not be useful. Organizations focus on the

slowest 5% of service calls.

 What procedures should be followed during a service interruption?

NoSQL system configuration may be dependent on some of the exceptions to the

general rules. The key focus should not be a single availability metric.

Search WWH ::

Custom Search

Home