Druid: Interactive Queries Meet Real-time Data

Eric Tschetter (Metamarkets), Danny Yuan (Netflix Platform Engineering Team)
Beyond Hadoop Great America Ballroom J

On-the-fly aggregation with human-time (or “interactive”) queries against fresh, at-the-moment data represents a growing trend. Many newly announced systems are starting to provide interactive queries on batched data streams. This talk will discuss how Druid allows users to have interactive queries on real-time data at scale; we feature a case study with Netflix leveraging Druid to obtain at-the-moment insight as it ingests over two terabytes per hour.

Photo of Eric Tschetter

Eric Tschetter

Metamarkets

Eric Tschetter is the lead architect of Druid, Metamarkets’ distributed, in-memory database. He held senior engineering positions at Ning and LinkedIn before joining Metamarkets. At LinkedIn, Eric productized LinkedIn’s PYMK with Hadoop. He holds bachelors degrees in Computer Science and Japanese from the University of Texas at Austin, and a M.S. from the University of Tokyo in Computer Science.

Photo of Danny Yuan

Danny Yuan

Netflix Platform Engineering Team

Danny Yuan is a cloud system architect in the Platform Engineering Team of Netflix. He leads the effort of building and operating Netflix’s data collection pipeline, as well as the real-time insight project of the Platform Engineering Team. He also built Netflix’s crypto service, which manages all the crypto keys used by Netflix applications in the cloud and serves billions of crypto operations every day.

Sponsors

Sponsorship Opportunities

For information on exhibition and sponsorship opportunities at the conference, contact Susan Stewart at sstewart@oreilly.com

Media Partner Opportunities

For information on trade opportunities with O'Reilly conferences contact Kathy Yu at mediapartners
@oreilly.com

Press and Media

For media-related inquiries, contact Maureen Jennings at maureen@oreilly.com

Contact Us

View a complete list of Strata contacts