Operational Big Data Management - Microsoft Big Data Solutions

Database Reference

In-Depth Information

Installing the Hadoop Metrics Sink

In these next steps, we will prepare the Hadoop Metrics Sink files to be used

later in the process.

1. Create a c:\Ambari folder on each host within the cluster.

2. Retrieve the Microsoft JDBC Driver 4.0 for SQL Server sqljdbc4.jar

file from here ( http://www.microsoft.com/en-us/download/

details.aspx?id=11774 ). Download the Linux version of the driver,

extract it within the Downloads folder, and then find the

sqljdbc4.jar file in the extraction. Copy and place this

sqljdbc4.jar file on each node in the folder created in step 1

( c:\Ambari ).

3. Retrieve the metrics-sink-version.jar file from the

metrics-sink.zip package extracted during the SQL Server

configuration and place it on each node in the folder created in step 1.

Your c:\Ambari folder should now look like Figure 16.3 , with only

these two files.

Figure 16.3 Ambari folder on each node in the cluster

Now you must set up the Hadoop Metrics2 Interface on each node in the

cluster. This will allow it to use the SQLServerSink and send the metric

information from your Hadoop cluster to SQL Server.

1. Edit the Hadoop-metrics2.properties file on each node in the

cluster located by default at {C:\hadoop\install\dir}\bin folder.

On your single node cluster from Chapter 3, “Setting Up for Big Data

with Microsoft,” this location is

c:\hdp\hadoop\hadoop-1.2.0.1.3.0.0-380\bin\hadoop-metrics2.properties .

Replace Server, port, username, and password with the SQL Server

name and port that you configured earlier, along with the username and

password that you created for access to the HadoopMetrics database.

Your Hadoop-metrics2.properties file should look similar to

Figure 16.4 when you are done:

Search WWH ::

Custom Search

Home