Databases Reference
In-Depth Information
Let's create the job that does this:
1. Create a new job and name it
Normalize
.
2. Let's create a
File delimited
metadata component for our input file,
categories-to-normalize.csv
, as we have done previously. Note that the
field separator in the file is |. Make sure that this is set in step 3 of the metadata
wizard as, by default, the Studio will try to use
;
as a field separator.
3. Now drag the new metadata component onto the Job Designer and select
tFileInputDelimited
from the pop-up window.
4. We need a normalize component, so search for
normalize
in the Palette
and drop this onto the Job Designer.
5. We also need a delimited output file, so search for
delimited
in the Palette
and add a
tFileOutputDelimited
component to the Job Designer.
6.
Connect the components together—right-click on the input delimited
component, select
Row | Main
, and drop the connector onto the
normalize
component. Connect from the
normalize
component to the delimited output
using the same
Row | Main
method.
7.
Going back to the
normalize
component, we need to specify how it
normalizes. There are two key settings to consider—
Column to normalize
and
Item separator
. In our case, we want to normalize the categories, so
select that from the drop-down list. Our categories are separated by a semi-
colon, so enter
;
in the
Item separator
box.
Search WWH ::
Custom Search