Composable Data at Cerner - Hadoop: The Definitive Guide

Database Reference

In-Depth Information

Moving Forward

We are looking to two major steps to maximize the value from this system more efficiently.

First, we want to create prescriptive practices around the Hadoop ecosystem and its sup-

porting libraries. A number of good practices are defined in this topic and elsewhere, but

they often require significant expertise to implement effectively. We are using and building

libraries that make such patterns explicit and accessible to a larger audience. Crunch offers

some good examples of this, with a variety of join and processing patterns built into the lib-

rary.

Second, our growing catalog of datasets has created a demand for simple and prescriptive

data management to complement the processing features offered by Crunch. We have been

adopting the Kite SDK to meet this need in some use cases, and expect to expand its use

over time.

The end goal is a secure, scalable catalog of data to support many needs in healthcare, in-

cluding problems that have not yet emerged. Hadoop has shown it can scale to our data and

processing needs, and higher-level libraries are now making it usable by a larger audience

for many problems.

Search WWH ::

Custom Search

Home