Database Reference
In-Depth Information
discovered analyzing the content of the source database in the requirements
phase.
Finally, for human resource management (Columns 4a-4c in Table
10.1
),
we need to analyze sales by employee supervisors. Thus, in dimension
Employee
, we defined a recursive hierarchy denoted
Supervision
.
Check Data Availability and Specify Mappings
The next step in the method is to check the availability of data in the
source systems for all elements included in the data warehouse schema. In our
example, the logical schema of the data source is depicted in Fig.
10.4
,thus
facilitating the task of specifying mappings. In the absence of a conceptual
representation of the source systems, their logical structures can be used
instead. Table
10.3
shows an example of a table that specifies the way in which
source tables and attributes of the operational databases are related to the
levels and attributes of the data warehouse. The rightmost column indicates
whether a transformation is required. For example, data representing the
ProductName
,
QuantityPerUnit
,and
UnitPrice
of products in the operational
database can be used without any transformation in the data warehouse for
the corresponding attributes of the
Product
level. Note that Table
10.3
is
just a simplification of the information that should be collected. Additional
documentation should be delivered that includes more detailed specification
of the required mappings and transformations.
Develop Final Conceptual Schema and Mappings
Revision and additional consultation with users are required in order to
adapt the multidimensional schema to the content of the data sources. When
this has been done, the final schema and the corresponding mappings are
developed. In our example, some of the issues found during the revision
process were:
We need to create and populate the dimension
Time
. The time interval of
this dimension must cover the dates contained in the table
Orders
of the
Northwind operational database.
The dimensions
Customer
and
Suppliers
share the geographic hierarchy
starting with
City
. However, this information is incomplete in the opera-
tional database. Therefore, the data for the hierarchy
State
,
Country
,and
Continent
must be obtained from an external source.
Metadata for the source systems, the data warehouse, and the ETL
processes are also developed in this step. Besides the specification of trans-
formations, the metadata include abstract descriptions of various features
mentioned earlier in this section. For example, for each source system, its
Search WWH ::
Custom Search