Database Reference
In-Depth Information
cleansing process took 7 seconds.
[DQS Cleansing] Information: DQS Cleansing component
records chunk status count - Invalid: 0, Autosuggest: 21,
Corrected: 979, Unknown: 0, Correct: 0.
[DQS Cleansing] Information: DQS Cleansing component
records total status count - Invalid: 0, Autosuggest:
115, Corrected: 4885, Unknown: 0, Correct: 0.
DQS Extensions on CodePlex
A number of DQS extensions are available from CodePlex. These are listed in Table
5-3 . They do not come with SQL Server 2014, but they are very useful for automating
data cleansing scenarios. A brief description on how to use each one is included here.
You can find more information on how to use them from their project pages on
CodePlex.
Table 5-3 . DQS Extensions on CodePlex
Extension
Description
DQS
Matching
This transform allows you to do automated data deduplication within an SSIS
data flow. It provides similar capabilities as the SSIS Fuzzy Grouping transform
but also leverages the DQS matching policy defined within your knowledge
base to give more accurate results.
DQS Do-
main Value
Import
This destination component allows you to bulk load values into a DQS domain.
It is useful for automation scenarios where your domain values are defined with-
in an external system (such as Master Data Services).
Publish
DQS
Knowledge
Base
This task is used to commit changes to your knowledge base (referred to as pub-
lishing in DQS terminology). The task is typically used in conjunction with the
DQS Domain Value Import transform.
Note The CodePlex extensions for DQS were created by OH22 Data ( ht-
tp://data.oh22.net/ ) and are freely available. They are not officially supported
by Microsoft. The extensions can be downloaded from ht-
tps://ssisdqsmatching.codeplex.com/ and ht-
tps://domainvalueimport.codeplex.com/ .
 
Search WWH ::




Custom Search