On-the-fly aggregation with human-time (or “interactive”) queries against fresh, at-the-moment data represents a growing trend. Many newly announced systems are starting to provide interactive queries on batched data streams. This talk will discuss how Druid allows users to have interactive queries on real-time data at scale; we feature a case study with Netflix leveraging Druid to obtain at-the-moment insight as it ingests over two terabytes per hour.
Eric Tschetter is the lead architect of Druid, Metamarkets’ distributed, in-memory database. He held senior engineering positions at Ning and LinkedIn before joining Metamarkets. At LinkedIn, Eric productized LinkedIn’s PYMK with Hadoop. He holds bachelors degrees in Computer Science and Japanese from the University of Texas at Austin, and a M.S. from the University of Tokyo in Computer Science.
Danny Yuan is a cloud system architect in the Platform Engineering Team of Netflix. He leads the effort of building and operating Netflix’s data collection pipeline, as well as the real-time insight project of the Platform Engineering Team. He also built Netflix’s crypto service, which manages all the crypto keys used by Netflix applications in the cloud and serves billions of crypto operations every day.
For information on exhibition and sponsorship opportunities at the conference, contact Susan Stewart at firstname.lastname@example.org
For information on trade opportunities with O'Reilly conferences contact Kathy Yu at mediapartners
For media-related inquiries, contact Maureen Jennings at email@example.com
View a complete list of Strata contacts