Database Reference
In-Depth Information
Figure 11-32. Regular-expression data-quality report on “paydate” column
The Figure 11-32 report shows the output and the fact that over 92 percent of the data in the “paydate” column
has failed the basic date-format test, probably because it is null. The content here is not important, but the usefulness
of these reports is. If you take care when selecting the rules for generating a data-quality report, you can produce a set
of reports like this that have great importance, especially when you are attempting to ensure the quality of HDFS- and
Hive-based data.
Potential Errors
I did encounter a few errors while using Talend's profiling. Generally these were not problems with Talend itself but,
rather, were configuration issues. By examining my solutions, you will be able to either avoid these errors or use them
to devise your own solutions.
I received the following error, followed by a message stating that I needed to install the library Zql.tar:
zql/ParseException
 
Search WWH ::




Custom Search