Database Reference
In-Depth Information
paradigms. (Hive-SQL or Pig Latin anyone?) Don't expect to find any one
single person to handle the entire ecosystem himself. This solution will
likely require a team of individuals, and they will likely need some help
getting ramped up on some or all of the technologies. If you are using
existing staff with little to no experience with the Hadoop ecosystem, be
prepared to get them some training and to provide ample time for ramp
up. You can expect several months of ramp-up time for your staff to get
comfortable enough with big data technologies in order for them to create
an enterprise ready solution.
Training opportunities abound. Hortonworks provides training for their
Hortonworks Data Platform (HDP) on Windows ( http://hortonworks.com/
hadoop-training/hadoop-on-windows-for-developers/ ).Thisisagoodplace
to start because they walk through the basics of Hadoop and Hortonworks
Data Platform. In addition, they will walk through the ecosystem of C#, Pig,
Hive, HCatalog, Sqoop, Oozie, and Microsoft Excel.
Coursera is another great place to provide very applicable training to your
employees on big data concepts and technology. Courses on linear algebra
provide a good refresher or ramp-up for the concepts and methods of linear
algebra and how to use those concepts to think about computational
problems that arise in computer science. Several statistics classes provide
the principles of the collection, display, and analysis of data to make valid
and appropriate conclusions about said data. Other courses that are
applicable are Machine Learning, data mining, and statistical pattern
recognition classes.
Finally, many universities offer graduate-level courses in big data and
business analytics. Universities see the future need for workers able to
traverse and understand large sets of data and so are providing the
necessary classes to provide that workforce. Carnegie Mellon offers a
Masters of Information Technology Strategy with a concentration in big
data and analytics ( http://www.cmu.edu/mits/curriculum/concentration/
bigdata.html ). The MITS degree provides a multidisciplinary education that
allows students to understand and conceptualize the development and
management of big data information technology solutions.
Stanford University offers a Graduate Certificate on Mining Massive Data
Sets ( http://scpd.stanford.edu/public/category/
courseCategoryCertificateProfile.do?method=load&certificateId=10555807 ).
The four-course certificate teaches “powerful techniques and algorithms for
Search WWH ::




Custom Search