Database Reference
In-Depth Information
Note
The Java runtime is also deployed in the same directory.
Summary
In this chapter, you read about enabling Remote Desktop and logging on to the HDInsight cluster's name node with
proper cluster credentials. The name node is the heart of the cluster, and you can do all the operations from the name
node that you can from the management portal or the .NET SDK and PowerShell scripts.
The name node gives you access to the Hadoop command line and the web interfaces that are available with the
distribution. HDInsight simulates WASB as HDFS behind the scenes for the end users. You saw how actually all
the input and output files are saved back to your Azure storage account dedicated for the cluster through the Azure
Management portal. The WASB mechanism is an abstraction to the user, who sees a simulation of HDFS when dealing
with file system operations. You learned to execute basic HDFS/MapReduce commands using the command line and
about the different unique Windows services for Hadoop. You also had a look at the different supporting projects like
Hive, Sqoop, and Pig and how they can be invoked from the command line as well as from PowerShell scripts. Finally,
we navigated through the installation files and folder hierarchies of Hadoop and the other projects in the C:\apps\
dist directory of the name node.
 
 
Search WWH ::




Custom Search