Database Reference
In-Depth Information
Figure 13.18 HDInsight JavaScript console results
With that, your HDInsight sandbox tour is complete. Unlike the Amazon
EMR sandbox, you must manually delete your cluster when you finish.
Failure to delete your cluster will result in charges for idle-processing time,
which is wasteful and runs counter to the paradigm discussed previously.
Storing Your Data in the Cloud
Previously in this chapter, you worked with cloud storage as you set up
and provisioned your Amazon and Microsoft Hadoop clusters. This section
looks more closely at cloud storage and some of the common features and
rationalizations as they relate to big data.
Storing Data
Storing your big data in the cloud is a more natural fit than it might appear
at first glance. Cloud data storage works best when the “write once, read
many” or WORM principle is applied. This principle implies that data is
written once and read many times from many different sources. This is one
of the core principles of big data and Hadoop data systems in general.
Cloud service providers offer cheap, scalable, and redundant data storage
that is highly and even globally available when necessary, making it an
 
Search WWH ::




Custom Search