Geoscience Reference
In-Depth Information
the section “Example of Using Emerging Science to Address Regulatory Issues
and Support Decision-Making: ToxCast Program” in Chapter 3).
As information trends move from long-term data to data that are gathered
in nearly real time from dispersed geographic sites, there will not be time for a
traditional cycle in which the desired information needs to be extracted from the
original compilation, reformatted to a specific standard, and finally loaded into
an analytic application. It will instead be necessary to literally “send the algo-
rithm to the data” and receive and collect the results centrally. In other words,
the complex formulas developed to analyze the data may be used at the site and
time of data collection rather than being sent to a central data-processing site for
analyses. That approach, first developed by Google in 2004, is named Map Re-
duce and uses a functional programming model (Dean and Ghemawat 2004).
Hadoop, a widely available implementation of Map Reduce, is available in
open-source form and from several major vendors. Not only can Hadoop pro-
gramming parallelize the problem of accessing widely distributed data; it is es-
pecially useful for processing unstructured data or combining them with tradi-
tional structured data.
REFERENCES
Baumgartner, C., M. Osl, M. Netzer, and D. Baumgartner. 2011. Bioinformatic-driven
search for metabolic biomarkers in disease. J. Clin. Bioinform. 1:2, doi:10.1186/
2043-9113-1-2.
Beran, B., and M. Piasecki. 2009. Engineering new paths to water data. Comput. Geosci.
35(4):753-760.
Bois, F.Y. 2000. Statistical analysis of Fisher et al. PBPK model of trichloroethylene
kinetics. Environ. Health Perspect. 108(suppl. 2):275-282.
Casey, M., C. Gennings, W.H. Carter, V.C. Moser, and J.E. Simmons. 2004. Detecting
interaction(s) and assessing the impact of component subsets in a chemical mixture
using fixed-ratio mixture ray designs. J. Agr. Biol. Environ. Stat. 9(3):339-361.
Cockcroft, A. 2011. Net Cloud Architecture. Velocity Conference, June 14, 2011
[online]. Available: http://www.slideshare.net/adrianco/netflix-velocity-conference-
2011 [accessed Apr. 10, 2012].
Dean, J., and S. Ghemawat. 2004. MapReduce: Simplified data processing on large
clusters. Pp. 137-149 in Proceedings of the 6th Symposium on Operating Systems
Design and Implementation (OSDI '04), December 5, 2004, San Francisco, CA
[online]. Available: http://static.usenix.org/event/osdi04/tech/full_papers/dean/dean.
pdf [accessed Mar. 30, 2012].
Dockery, D.W., C.A. Pope, III, X. Xu, J.D. Spengler, J.H. Ware, M.E. Fay, B.G. Ferris,
and F.A. Speizer. 1993. An association between air pollution and mortality in six
US cities. N. Engl. J. Med. 329(24):1753-1759.
Dominici, F., R.D. Peng, M.L. Bell, L. Pham, A. McDermott, S.L. Zeger, and J.M.
Samet. 2006. Fine particulate air pollution and hospital admission for cardiovascu-
lar and respiratory diseases. JAMA 295(10):1127-1134.
Dzemydienė, D., S. Maskeliūnas, and K. Jacobsen. 2008. Sustainable management of
water resources based on web services and distributed data warehouses. Technol.
Econ. Dev. Econ. 14(1):38-50.
Search WWH ::




Custom Search