Skip to main content

Securing the Apache Hadoop Ecosystem

Aaron Myers (Cloudera, Inc.), Shreepadma Venugopalan (Cloudera)
Hadoop Platform Grand Ballroom East
Average rating: ***..
(3.22, 9 ratings)
Slides:   1-PPTX 

When a Hadoop deployment is used to process sensitive data, several security requirements arise that may be dictated by internal policies and/or government regulations. They may require strong authentication, selective authorization to access data/resources, and data confidentiality. This session covers in detail how different components in the Hadoop ecosystem and external applications can interact with each other in a secure manner providing authentication, authorization, and confidentiality when accessing services and transferring data to/from/between services. This session will cover Hive security and how Apache Sentry (incubating) can be used to implement a fine-grained role-based access control policy for Hive. In particular, we will talk about how Sentry can be used in a multi-tenant environment to secure access to different data warehouse objects – tables, schemas, views, partitions, UDFs, and SerDes as well as queries that comprise of sub-queries and involve cross database joins. Finally, the session will cover topics like Kerberos authentication, Web UI authentication, File System permissions, delegation tokens, MR/YARN Access Control Lists, ProxyUser impersonation and network encryption.

Photo of Aaron Myers

Aaron Myers

Cloudera, Inc.

Aaron T. Myers is a Software Engineer at Cloudera and an Apache Hadoop Committer. Aaron’s work is primarily focused on HDFS. Prior to joining Cloudera, Aaron was a Software Engineer and VP of Engineering at Amie Street, where he worked on all components of the software stack, including operations, infrastructure, and customer-facing feature development. Aaron holds both an Sc.B. and Sc.M. in Computer Science from Brown University.

Photo of Shreepadma Venugopalan

Shreepadma Venugopalan


Shreepadma Venugopalan is a software engineer in the Platform Team at Cloudera. Prior to Cloudera, Shreepadma was a member of the Server Technologies group at Oracle where she focused on the relational engine, query optimizer, and unstructured data management. She holds a Master’s degree in Computer Science from the University of Wisconsin-Madison.

Comments on this page are now closed.


Picture of Aaron Myers
Aaron Myers
11/01/2013 4:01pm EDT

Thanks for asking, Marek. I believe I’ve just done what it takes to post the slides online. Please let me know if it didn’t work.

Marek Kolodziej
10/30/2013 8:33pm EDT

Would it be possible to post the slides here, like the other speakers have?


Sponsorship Opportunities

For exhibition and sponsorship opportunities, contact Susan Stewart at

Media Partner Opportunities

For information on trade opportunities with O'Reilly conferences email mediapartners

Press & Media

For media-related inquiries, contact Maureen Jennings at

Contact Us

View a complete list of Strata + Hadoop World 2013 contacts