Database Reference
In-Depth Information
infrastructure as quickly as possible. In Chapter 10, we provide an example
that illustrates the Information Server design canvas, using drag-and-drop
gestures to create a flow that involves a Hadoop data source.
InfoSphere Data Replication integrates with the analytics-based IBM Pure-
Data Systems through its low-impact, low-latency database monitoring that
provides real-time replicated data to either repository. IBM's Information Inte-
gration satisfies the batch and real-time integration requirements of Big Data
projects, because it captures rich design and operational metadata to support
data lineage and data governance analysis. This enables business and IT users
to understand how enterprise data is being used within the Big Data platform.
And because the Big Data platform can leverage Information Integration in
this way, Big Data projects can leverage just about any data source that matters.
InfoSphere Master Data Management
IBM has demonstrated BigInsights working with the IBM InfoSphere Master
Data Management (MDM) suite of products. This work was done by the IBM
Research team in conjunction with a number of clients who wanted to draw
on events and entity resolution from Big Data sources to populate master
profiles in the enterprise. BigInsights is used to analyze raw data and to
extract entities, such as customers and suppliers, for entity analysis algo-
rithms. The data is then further refined (identifying relationships between
entities, for example) before being loaded into the MDM system.
The MDM probabilistic matching engine has also been integrated with
BigInsights, enabling matching on Big Data sets. Raw data is refined and
structured, and then matched against other records. Many customers ask
about the use of Big Data with MDM. The link is a natural one. Think of
MDM as “bookending” the process; it can provide clean master data entities
to a Big Data system for further analysis, and the insight gleaned from the
Big Data system can be fed back to MDM for action.
InfoSphere Guardium
InfoSphere Guardium Database Security (Guardium) is IBM's leading data
activity monitoring (DAM) solution, whose benefits were recently extended to
Hadoop. Guardium integrates with BigInsights as well as open source Hadoop,
to monitor Hadoop systems and to ensure the security of your enterprise's
 
Search WWH ::




Custom Search