Database Reference
In-Depth Information
These errors were caused by incorrect configuration file entries while I was learning to use Hunk. I changed the
configuration file entries and restarted Hunk to solve this problem.
If you encounter any of these or additional errors, another good resource to consult is the answers section of the
splunk.com website, including the support menu options. Try to be a good Splunk citizen by adding as much detail
as possible to any questions or answers that you might post on the forum. If you find a solution to your problem, then
post that solution to help future users.
Talend Reports
Expanding on the big data ETL work discussed in Chapter 10, it's time to examine the reporting capabilities of the
Talend Enterprise big data product—specifically, the profiling functionality. With Talend, you can check the quality of
Hive-based data and build reports from table-based data.
Installing Talend
To download the Talend Enterprise big data application, I go to the URL www.talend.com/download , then select
the Big Data tab and click the Download Free Trial button. I need to enter my details, so I can't simply execute the
Linux wget command from the Linux command line, as in previous download examples in this topic. The package
download is 2 GB, so it will take some time.
The license that was automatically emailed to me does not allow me to access the profiling function. I need
to request a different license from Talend to “unlock” the profiling function on the user interface. (You can contact
Talend via www.talend.com/contact to request similar access.) The Sales Solutions Group Manager at Talend kindly
supplied the license, while others at Talend offered help and documentation so I could develop the example I will
present here.
I install the Talend software on the Centos Linux host hc1nn, using that machine as a client to access the Centos 6
Cloudera CDH5 Hadoop cluster whose name node resides on the server hc2nn. To unpack the software, I use the root
account, accessing it via the Linux su (switch user) command:
[hadoop@hc1nn ~]$ su -
I move to the Linux hadoop account Downloads directory, where the package was downloaded, and I examine
the downloaded file using the Linux ls command to create a long listing:
[root@hc1nn ~]# cd /home/hadoop/Downloads
[root@hc1nn Downloads]$ ls -lh
-rw-r--r-- 1 hadoop hadoop 2.0G Nov 3 18:01 Talend-Tools-Installer-r118616-V5.5.1-installer.zip
The Talend release file is a zipped archive; I unpack it with the Linux unzip command:
[root@hc1nn Downloads]# unzip Talend-Tools-Installer-r118616-V5.5.1-installer.zip
 
Search WWH ::




Custom Search