Databases Reference
In-Depth Information
FIGURE 10.9
Data virtualization-based Big Data integration.
Operational costs—in this architecture the operational cost calculation has fixed and variable cost
components. The variable costs are related to processing and computing infrastructure and labor
costs. The fixed costs are related to maintenance of the data virtualization platform and its related
costs.
Pitfalls to avoid:
Loosely coupled data integration.
Incorrect data granularity across the different systems.
Poor metadata across the systems.
Lack of data governance.
Complex data integration involving too many computations at the integration layer.
Poorly designed semantic architecture.
There are many more possible architectural deployments to integrate Big Data and create the next-
generation data warehouse platform. This chapter's goal is to provide you a starter kit to begin look-
ing at what it will take for any organization to implement the next-generation data warehouse. In the
next section we discuss the semantic framework approach.
Semantic framework
Building the next-generation data warehouse requires strong metadata architecture for data integra-
tion, but that does not solve the data exploration requirements. When data from multiple sources and
systems is integrated together, there are multiple layers of hierarchies including jagged and skewed
hierarchies, data granularity at different levels, and data quality issues especially with unstructured
 
Search WWH ::




Custom Search