Database Reference
In-Depth Information
Whichever solution you decide on, you are probably going to want to roll
your own PowerShell scripts to build your cluster so that it is a repeatable
process.
Manageability
Operational considerations are often overlooked when considering
deployment factors. Consider the following:
• How will the system be monitored?
• Which human resources will be needed to support the environment?
• What availability requirements exist, and what disaster recovery
strategy is in place?
• How will security be addressed?
These are all common operational questions that you need to answer.
Deployment Topologies
Now that we have an understanding of the factors that might influence a
deployment, we can focus on the topologies themselves before moving on to
compare them with each other.
In this next section we'll compare the following options:
• On-Premise Hadoop
• Infrastructure as a Service Hadoop
• Platform as a Service Hadoop
Hadoop on Premise
You can always follow the traditional path, which is to build your Hadoop
cluster on premise. Most Hadoop clusters in the world today are built using
Linux as the operating system. However, as you learned in Chapter 1,
Hortonworks has a distribution for Windows.
The biggest challenge when picking Hadoop on premise as your deployment
option is knowing how to size it. How many data nodes will you really need ?
Of course, after you have done that, you then need to procure all the
hardware and rack and configure it. You will also have taken on the
management and monitoring of the cluster and so will need to figure that
Search WWH ::




Custom Search