Strata Conference + Hadoop World Tutorials

Please note: there is an additional fee required to attend tutorials at Strata + Hadoop World.

Visualization & Interface, Beekman / Sutton North (NY Hilton)
Tutorial Please note: to attend, your registration must include Tutorials.
Average rating: ****.
(4.14, 7 ratings)
Communicating Data Clearly describes how to draw clear, concise, accurate graphs that are easier to understand than many of the graphs one sees today. The tutorial emphasizes how to avoid common mistakes that produce confusing or even misleading graphs. Graphs for one, two, three, and many variables are covered as well as general principles for creating effective graphs. Read more.
Hadoop: Tools & Technology, Grand East (NY Hilton)
Tutorial Please note: to attend, your registration must include Tutorials.
Amandeep Khurana (Cloudera), Matteo Bertozzi (Cloudera)
Average rating: **...
(2.50, 10 ratings)
HBase is one of the more popular open source NoSQL databases that have cropped up over the last few years. Building applications that use HBase effectively is challenging. This tutorial is geared towards teaching the basics of building applications using HBase and covers concepts that a developer should know while using HBase as a backend store for their application. Read more.
Hadoop: Tools & Technology, Gramercy Suite (NY Hilton)
Tutorial Please note: to attend, your registration must include Tutorials.
Tom Wheeler (Cloudera, Inc.)
Average rating: ****.
(4.62, 8 ratings)
This tutorial will explore the tools and techniques you need to ensure that your MapReduce applications are both correct and efficient. You'll learn how to do unit testing, integration testing and performance testing for your Hadoop jobs, as well as how to intepret diagnostic information to isolate and solve problems in your code. Read more.
Hadoop: Tools & Technology, Murray Hill (NY Hilton)
Tutorial Please note: to attend, your registration must include Tutorials.
Mark Fei (Cloudera)
Average rating: ****.
(4.17, 18 ratings)
Apache Hadoop is enabling companies across many different industries that need to process and analyze large data sets. In this tutorial you will learn why and how people are using Hadoop and related technologies like Hive, Pig and HBase. Read more.
Data Science, Regent Parlor (NY Hilton)
Tutorial Please note: to attend, your registration must include Tutorials.
Susan E. McGregor (Columbia University), Alice Brennan (The New York World), Michael Sullivan (The New York World)
Average rating: ***..
(3.14, 7 ratings)
This tutorial will provide novice users with an overview of a range of common tools use for data cleaning and analysis - including Microsoft Excel, Google Refine, Python and R - along with their relative strengths and weaknesses. Attendees will not only learn useful new skills, and they will know what kind of expertise they need to seek out for help with more complex tasks. Read more.
Data Science, Sutton Center / Sutton South (NY Hilton)
Tutorial Please note: to attend, your registration must include Tutorials.
Roy Hyunjin Han (CrossCompute)
Average rating: ***..
(3.62, 8 ratings)
Python is the language of choice when it comes to integrating analytical components. We will present a series of concepts and walkthroughs that illustrate how easy scientific computing is in Python, from machine learning and time series to spatial relationships and network analysis. Read more.
Business & Industry Data Driven Business Day, Grand West (NY Hilton)
Tutorial Please note: to attend, your registration must include Tutorials.
For business strategists, marketers, product managers, and entrepreneurs, Data Driven Business Day looks at how to use data to make better business decisions faster. Packed with case studies, panels, and eye-opening presentations, this fast-paced day focuses on how to solve today's thorniest business problems with Big Data. It's the missing MBA for a data-driven, always-on business world. Read more.
Bridge to Big Data, Nassau (NY Hilton)
Tutorial Please note: to attend, your registration must include Tutorials.
Average rating: ****.
(4.33, 6 ratings)
For CIOs, IT executives, and technology professionals, Strata's Bridge to Big Data lays out the roadmap to get your organization up to speed on big data. In this all-day event, learn how to create big data strategy, manage your first pilot project, demystify vendor solutions and understand how big data differs from BI. Read more.
Hadoop: Tools & Technology, Gramercy Suite (NY Hilton)
Tutorial Please note: to attend, your registration must include Tutorials.
Dean Wampler (Typesafe)
Average rating: ***..
(3.75, 4 ratings)
This hands-on tutorial teaches you how to setup and use Hive, a high-level, data warehouse tool for Hadoop. Hive provides a SQL-like query language, HiveQL, that is easy to learn for people with prior SQL experience, making Hive attractive for data warehousing teams. Hive leverages the power of Hadoop for working with massive data sets without requiring expertise in MapReduce programming. Read more.
Hadoop: Tools & Technology, Sutton Center / Sutton South (NY Hilton)
Tutorial Please note: to attend, your registration must include Tutorials.
Ed Kohlwey (Booz Allen Hamilton), Stephanie Beben (Booz Allen Hamilton)
Average rating: ***..
(3.43, 7 ratings)
In this tutorial, we’ll provide an introduction to an open source Map/Reduce library for R called RHadoop that makes Map/Reduce programming convenient and easy to understand for statistical modeling users. The session will cover the basics of RHadoop, common techniques and best practices, and some interactive real-world examples. Read more.
Hadoop: Tools & Technology, Regent Parlor (NY Hilton)
Tutorial Please note: to attend, your registration must include Tutorials.
Hari Shreedharan (Cloudera Inc.), Will McQueen (Cloudera Inc.), Arvind Prabhakar (Cloudera), Prasad Mujumdar (Cloudera Inc.), Mike Percy (Cloudera)
Average rating: ***..
(3.00, 4 ratings)
Apache Flume (incubating) is a scalable, reliable, fault-tolerant, distributed system designed to collect and transfer massive amounts of event data from disparate systems into some storage tier such as Hadoop HDFS. In this tutorial we show how to easily build a large-scale data collection and transfer system in a scalable way using Flume NG, the next generation of Flume. Read more.
Business & Industry Data Science, Grand East (NY Hilton)
Tutorial Please note: to attend, your registration must include Tutorials.
Robert Grossman (Open Data Group), Collin Bennett (Open Data Group)
Average rating: ****.
(4.25, 4 ratings)
A successful big data analytic project is not just about selecting the right algorithm for building a predictive model, but also about how to deploy the model efficiently into operational systems, how to evaluate the effectiveness of the model, and how to continuously improve it. In this tutorial we cover best practices for each of these phases in the life cycle of a predictive model. Read more.
Hadoop: Case Studies Hadoop: Tools & Technology, Murray Hill (NY Hilton)
Tutorial Please note: to attend, your registration must include Tutorials.
Sewook Wee (Accenture), Ryan Tabora (Think Big Analytics), Jason Rutherglen (Datastax)
Average rating: *....
(1.80, 5 ratings)
This tutorial will help participants understand why distributed search is important and teach them how to use the landscape of tools available. Based on our hands-on experience at NetApp, we will lead a tutorial session that will teach participants how to setup and use search technologies such as Apache Solr and Lucene to enable real-time Big Data analytics with Hadoop, HBase, and other NoSQL. Read more.
Visualization & Interface, Beekman / Sutton North (NY Hilton)
Tutorial Please note: to attend, your registration must include Tutorials.
Average rating: ****.
(4.33, 6 ratings)
This workshop is a jumpstart lesson on how to get from a blank page and a pile of data to a useful data visualization. We'll focus on the design process, not specific tools. Bring your sample data and paper or a laptop; leave with new visualization ideas. Read more.

Sponsors

Sponsorship Opportunities

For information on exhibition and sponsorship opportunities, contact Susan Stewart at sstewart@oreilly.com.

Media Partner Opportunities

For information on trade opportunities contact Kathy Yu at mediapartners
@oreilly.com

Press and Media

For media-related inquiries, contact Maureen Jennings at maureen@oreilly.com

Contact Us

View a complete list of Strata contacts.