Schedule: Hadoop & Beyond sessions

Hadoop & Beyond, Grand West (NY Hilton)
Michael Manoochehri (Google, Inc.), Jim Caputo (Google, Inc.)
Average rating: ***..
(3.47, 15 ratings)
Google’s Dremel is a scalable, interactive ad-hoc query system capable of running SQL-like queries over trillion-row tables in seconds. BigQuery is the externalization of this technology as a REST API and web app. This session will discuss the capabilities of Dremel and dive into the design challenges necessary to make this technology accessible and performant for developers and business users. Read more.
Hadoop & Beyond, Grand West (NY Hilton)
Frank Weigel (Couchbase, Inc.)
Average rating: ****.
(4.00, 4 ratings)
OMGPOP’s Draw Something broke all records when it went viral, skyrocketing to more than 50 million downloads and billions of drawings within a few weeks of launch – with no downtime. This session highlights the application architecture and data management technology that enabled this growth, and provides a real-time data management model for developers of any interactive web application. Read more.
Hadoop & Beyond, Grand West (NY Hilton)
Rich Hickey (Datomic)
Average rating: ***..
(3.33, 3 ratings)
The big data movement has highlighted the value of historical information, and storage is readily available, so why are you still using an update-in-place database? In this talk we'll deconstruct the traditional monolithic database with an eye towards leveraging the scaling properties of distributed architectures, while meeting the business needs for complete historical information. Read more.
Hadoop & Beyond, Grand West (NY Hilton)
Mike Driscoll (Metamarkets), Eric Tschetter (Metamarkets)
Average rating: ****.
(4.00, 9 ratings)
Hadoop is considered THE technology for addressing Big Data. While it shines as a processing platform, it does not respond anywhere close to "human time". In developing our solution, we needed the ability to query across billions of rows in seconds. Hear how and why we developed Druid, our distributed, in-memory OLAP data store after investigating various commercial and open source alternatives. Read more.
Hadoop & Beyond, Grand West (NY Hilton)
David Blair (Akamai Technologies)
Average rating: ****.
(4.00, 2 ratings)
Trecul is a dataflow system that powers Akamai's Online Adversting business, processing billions of events hourly. Trecul is built on top of HDFS & Hadoop Pipes to achieve fantastic runtime performance. We'll talk about it's use of LLVM-based JIT compilation so everything runs as native C++ code, no Java and no runtime interpreter. Akamai has open-sourced Trecul and it is available on Github. Read more.
Nilesh Jain (Intel Corp)
Average rating: ****.
(4.50, 2 ratings)
The exponential growth of graph-based data analysis is fueling the need for machine learning. Recently, frameworks have emerged to perform these computations at large scale. But, feeding data to these frameworks is a challenge in itself. This talk introduces the GraphBuilder library for Hadoop, which makes the job easier for programmers. Several case studies showacse the utility of library. Read more.
Paul Kent (SAS)
Average rating: ****.
(4.33, 3 ratings)
To unlock the value of Big Data, analytics must be applied. Some enterprises hire platoons of data analysts but many others can't afford to pring on such skilled and expensive resources. How do those businesses uncover opportunity and insight within Big Data assets? They use analytic tools that offload some data discovery to business professionals or deploy intelligent analytic appications. Read more.
Hadoop & Beyond, Grand West (NY Hilton)
Kurt Brown (Netflix)
Average rating: ****.
(4.29, 7 ratings)
Our Data Science tech stack has shifted from best-of-breed, "classic" business intelligence technologies to a hybrid environment, fully leveraging Hadoop and other Big Data solutions. Our philosophy has also evolved, now distilled in thinking and practice into "data science as a service". Why did we do it? What does it look like? What are the benefits? Come find out. Read more.
Hadoop & Beyond, Gramercy West (NY Hilton)
Kenneth Duda (Arista Networks), Amr Awadallah (Cloudera, Inc.)
Explore the network capabilities and architecture necessary to build multi-petabyte clusters. Compare and contrast different networking architectures for Big Data. Use real-world case studies from many of the largest HDFS deployments. Explain how topology aware file systems interact with the network substrate. Discuss differences in architecture based on workload profile and data set size Read more.
Hadoop & Beyond, Gramercy West (NY Hilton)
Matt Wood (Amazon Web Services)
Average rating: ****.
(4.00, 3 ratings)
In this talk we will explore how businesses are marrying human judgment with large scale processing, improving the accuracy of Big Data analytics without sacrificing efficiency or scalability. Real-world examples will be discussed in which Hadoop and crowdsourcing are combined through the Amazon Web Services technologies Elastic MapReduce and Mechanical Turk. Read more.
Hadoop & Beyond, Beekman / Sutton North (NY Hilton)
Gabriel Eisbruch (Mercadolibre.Com), Luis Darío Simonassi (MercadoLibre.Com), Jonathan Leibiusky (MercadoLibre.com)
Average rating: ***..
(3.33, 3 ratings)
The quantity of digital information collected and processed every day is growing at an exponential rate. To make sense of this mountain of data we can no longer afford the delays of batch processing systems. In this track we'll introduce Storm, a new, real-time analytic framework, and show how to use it to massively parallelize information analysis, to get instant results from your data. Read more.
Hadoop & Beyond Hadoop: Tools & Technology, Gramercy West (NY Hilton)
Avi Bryant (Stripe)
Average rating: *****
(5.00, 3 ratings)
Start on low heat with a base of Hadoop; map, then reduce. Flavor, to taste, with Scala's concise, functional syntax and collections library. Simmer with some Pig bones: a tuple model and high-level join and aggregation operators. Mix in Cascading to hold everything together and boil until it's very, very hot, and you get Scalding, a new API for MapReduce out of Twitter. Read more.

Sponsors

Sponsorship Opportunities

For information on exhibition and sponsorship opportunities, contact Susan Stewart at sstewart@oreilly.com.

Media Partner Opportunities

For information on trade opportunities contact Kathy Yu at mediapartners
@oreilly.com

Press and Media

For media-related inquiries, contact Maureen Jennings at maureen@oreilly.com

Contact Us

View a complete list of Strata contacts.