Schedule: Tutorial sessions

Half-day tutorials dive deep into necessary skills and tools. Please note: there is an additional fee required to attend tutorials at Strata 2012.

Ballroom CD
Please note: to attend, your registration must include Tutorials.
Sarah Sproehnle (Cloudera, Inc.)
Average rating: ****.
(4.83, 6 ratings)
This tutorial provides a solid foundation for those seeking to understand large scale data processing with MapReduce and Hadoop, plus its associated ecosystem. This session is intended for those who are new to Hadoop and are seeking to understand where Hadoop is appropriate and how it fits with existing systems. No programming experience is required. Read more.
Ballroom E
Please note: to attend, your registration must include Tutorials.
Ken Krugler (Scale Unlimited)
Average rating: **...
(2.75, 4 ratings)
Want to extract and process Big Data from the web? This tutorial will show you how to use key open source technologies such as Hadoop, Cascading, Bixo, Tika, Mahout and Solr to create scalable, reliable web mining solutions. Read more.
Ballroom H
Please note: to attend, your registration must include Tutorials.
Dean Wampler (Typesafe), Jason Rutherglen (Datastax)
Average rating: ***..
(3.00, 1 rating)
This hands-on tutorial teaches you how to setup and use Hive, a high-level, data warehouse tool for Hadoop. Hive provides a SQL-like query language, HiveQL, that is easy to learn for people with prior SQL experience, making Hive attractive for data warehousing teams. Hive leverages the power of Hadoop for working with massive data sets without requiring expertise in MapReduce programming. Read more.
GA J
Please note: to attend, your registration must include Tutorials.
Average rating: ****.
(4.50, 2 ratings)
This workshop is a jumpstart lesson on how to get from a blank page and a pile of data to a useful data visualization. We'll focus on the design process, not specific tools. Bring your sample data and paper or a laptop; leave with new visualization ideas. Read more.
Ballroom G
Please note: to attend, your registration must include Tutorials.
Joseph Rickert (Revolution Analytics)
Average rating: ****.
(4.50, 4 ratings)
This tutorial will enable anyone with some programming experience to begin analyzing data with the R programming language Read more.
Ballroom F
Please note: to attend, your registration must include Tutorials.
James Dixon (Pentaho), Chris Deptula (OpenBI)
The big data world is extremely chaotic based on technology in its infancy. Learn how to tame this chaos, integrate it within your existing data environments (RDBMS, analytic databases, applications), manage the workflow, orchestrate jobs, improve productivity and make using big data technologies accessible to a much wider spectrum of developers, analysts and data scientists. Read more.
Ballroom H
Please note: to attend, your registration must include Tutorials.
Jock Mackinlay (Tableau Software), Ross Perez (Tableau Software)
Average rating: ****.
(4.00, 2 ratings)
In this hands-on class, learn how to turn data into effective, interactive visualizations. You do not require a Tableau license to participate, but must bring a Windows laptop or virtual machine. Read more.
Ballroom G
Please note: to attend, your registration must include Tutorials.
Nate McCall (Apigee)
This presentation goes beyond the hype, buzzwords, and rehashed slides and actually presents the attendees with a hands-on, step-by-step tutorial on how to write a Java application on top of Apache Cassandra. It focuses on concepts such as idempotence, tunable consistency, and shared-nothing clusters to help attendees get started with Apache Cassandra quickly while avoiding common pitfalls. Read more.
Ballroom E
Please note: to attend, your registration must include Tutorials.
Simon Rogers (Guardian), Michael Brunton-Spall (Guardian News and Media)
Average rating: ****.
(4.00, 1 rating)
Learn first hand from award-winning Guardian journalists how they mix data, journalism and visualization to break and tell compelling stories: all at newsroom speeds. Read more.
Ballroom CD
Please note: to attend, your registration must include Tutorials.
Jeremy Howard (Kaggle), Mike Bowles (Biomatica)
Average rating: ****.
(4.44, 9 ratings)
Wouldn't it be great if there were just use two algorithms which could handle most of your predictive modeling needs? It turns out that actually this is the case. Noted machine learning instructor Dr Mike Bowles and champion data miner Jeremy Howard will teach you everything you need to know to apply them successfully. Read more.
GA J
Please note: to attend, your registration must include Tutorials.
Sarah Sproehnle (Cloudera, Inc.)
Average rating: ****.
(4.25, 4 ratings)
Learn now how to use a Hadoop cluster for data analysis using Java MapReduce, Apache Hive and Apache Pig, and get an overview of using the HBase Hadoop database. Some programming experience is strongly recommended for this session. Read more.
Ballroom F
Please note: to attend, your registration must include Tutorials.
Richard Taylor (HPCC Systems from LexisNexis Risk Solutions)
While extracting entities from massive amounts of text is a major problem, a proven solution exists. This tutorial will demonstrate a natural language parsing technology to extract entities from all kinds of text using massively parallel clusters. Read more.

Sponsors

  • EMC
  • Microsoft
  • HPCC Systems™ from LexisNexis® Risk Solutions
  • MarkLogic
  • Shared Learning Collaborative
  • Cloudera
  • Digital Reasoning Systems
  • Pentaho
  • Rackspace Hosting
  • Teradata Aster
  • VMware
  • IBM
  • NetApp
  • Oracle
  • 1010data
  • 10gen
  • Acxiom
  • Amazon Web Services
  • Calpont
  • Cisco
  • Couchbase
  • Cray
  • Datameer
  • DataSift
  • DataStax
  • Esri
  • Facebook
  • Feedzai
  • Hadapt
  • Hortonworks
  • Impetus
  • Jaspersoft
  • Karmasphere
  • Lucid Imagination
  • MapR Technologies
  • Pervasive
  • Platform Computing
  • Revolution Analytics
  • Scaleout Software
  • Skytree, Inc.
  • Splunk
  • Tableau Software
  • Talend

For information on exhibition and sponsorship opportunities at the conference, contact Susan Stewart at sstewart@oreilly.com.

For information on trade opportunities with O'Reilly conferences contact Kathy Yu at mediapartners
@oreilly.com

For media-related inquiries, contact Maureen Jennings at maureen@oreilly.com

View a complete list of Strata contacts