Database Reference
In-Depth Information
Figure 11-18. Talend's profiling perspective
The Repository pane on the left of the screen shown in Figure 11-18 has a list of previously created analysis
reports and metadata database connections. Earlier, I had created a Hive-based Cloudera CDH5 connection named
hive_db_connection 0.1, as you can see in the list under DB Connections. I also opened the Hive rawtrans table
column analysis report named rawtrans_col_analysis 0.1. The items are expanded in Figure 11-18 to familiarize you
with the display.
To run Talend profiling reports against a CDH5-based Hive data warehouse, I need to know a number of
properties about the Hive installation: which host it is running on, what port number to use to connect to it, the
Linux-based user name of the account to use, the password for that account, and the version of Hive in use. I know
that Hive is installed on my cluster on the server hc2nn, and that the account used will be called hadoop. I also know
 
Search WWH ::




Custom Search