Database Reference
In-Depth Information
Figure 6-6. The Hadoop command line
This will look very familiar to traditional Hadoop users because this is exactly what you find in the Apache Open
Source project. Again, the point to be noted here is HDInsight is built on top of core Hadoop, so it supports all the
interfaces available with core Hadoop, including the command prompt.
For example, you can run the standard ls command to list the directory and file structure of the current
directory. The command in Listing 6-1 lists the files and folders you have in the root of your container.
Listing 6-1. The HDFS directory structure
hadoop dfs -ls /
This command lists the files and folders in the root of your storage account container in Azure, as shown
in Figure 6-7 .
Figure 6-7. The ls command output
You can run the word-count MapReduce job through the command prompt on the source file provided in the
/example/data/gutenburg directory in your WASB to generate the output file much like you did from the .NET
and PowerShell code in Chapter 5. The command to invoke the MapReduce job is provided in Listing 6-2.
 
Search WWH ::




Custom Search