Data Science on Hadoop: How Cloudera Impala Unlocks New Productivity and Insights

Justin Erickson (Cloudera), Marcel Kornacker (Cloudera, Inc.)
Data Science Hadoop: Tools & Technology, Beekman / Sutton North (NY Hilton)
Average rating: ****.
(4.00, 4 ratings)

This talk will cover what tools and techniques work and don’t work well for data scientists working on Hadoop today and how Cloudera Impala increases the productivity of data science and analysis on Hadoop. Cloudera Impala builds upon experiences and leading edge technology from big data systems at Facebook, Google, and Yahoo.

Photo of Justin Erickson

Justin Erickson

Cloudera

Product manager at Cloudera, the standard for Hadoop, for HDFS, HBase, and part of Hive. Previously lead development of the new high availability and disaster recovery solution for Microsoft SQL Server 2012 and a Stanford University graduate.

Photo of Marcel Kornacker

Marcel Kornacker

Cloudera, Inc.

Tech lead at Cloudera for new products. Graduated in 2000 with a PhD in databases from UC Berkeley, followed by engineering jobs at a few database-related startup companies. Marcel joined Google in 2003, where he worked on several ads serving and storage infrastructure projects. His last engagement was as the tech lead for the distributed query engine component of Google’s F1 project.

Comments on this page are now closed.

Comments

David Magaha
11/11/2012 9:29am EST

Justin,

Are you publishing the materials in PDF format?? What about the source Hive code and the information concerning configuration?

Thanks

Dave

Sponsors

Sponsorship Opportunities

For information on exhibition and sponsorship opportunities, contact Susan Stewart at sstewart@oreilly.com.

Media Partner Opportunities

For information on trade opportunities contact Kathy Yu at mediapartners
@oreilly.com

Press and Media

For media-related inquiries, contact Maureen Jennings at maureen@oreilly.com

Contact Us

View a complete list of Strata contacts.