Skip to main content

Building a Data Platform

John Akred (Silicon Valley Data Science), Richard Williamson (Silicon Valley Data Science), Stephen OSullivan (Silicon Valley Data Science)
Hadoop Platform Grand Ballroom West
Tutorial Please note: to attend, your registration must include Tutorials on Monday.
Average rating: ***..
(3.71, 17 ratings)
Slides:   external link

What are the essential components of a data platform? This tutorial will explain how the various parts of the Hadoop and big data ecosystem fit together in production to create a data platform supporting batch, interactive and realtime analytical workloads.

By tracing the flow of data from source to output, we’ll explore the options and considerations for components, including:

  • Acquisition: from internal and external data sources
  • Ingestion: offline and real-time processing
  • Storage
  • Providing data services: exposing data to applications
  • Analytics: batch and interactive
  • Data management: data security, lineage, metadata and quality

We’ll give also advice on:

  • tool selection
  • the function of the major Hadoop components and other big data technologies
  • hardware sizing and cloud provisioning
  • integration with legacy systems
Photo of John Akred

John Akred

Silicon Valley Data Science

With over 15 years in advanced analytical applications and architecture, John is dedicated to helping organizations become more data-driven. He combines deep expertise in analytics and data science with business acumen and dynamic engineering leadership.

Photo of Richard Williamson

Richard Williamson

Silicon Valley Data Science

Richard has been at the cutting edge of big data since its inception, leading multiple efforts to build multi-petabyte Hadoop platforms, maximizing business value by combining data science with big data. He has extensive experience creating advanced analytic systems using data warehousing and data mining technologies

Photo of Stephen OSullivan

Stephen OSullivan

Silicon Valley Data Science

A leading expert on big data architecture and Hadoop, Stephen brings over 20 years of experience creating scalable, high-availability, data and applications solutions. A veteran of WalmartLabs, Sun and Yahoo!, Stephen leads data architecture and infrastructure.

Comments on this page are now closed.

Comments

10/26/2013 2:17am EDT

Does this session require any pre-reads?

Picture of Stephen OSullivan
10/25/2013 7:35pm EDT

Prashant, no software needs to be installed.

Thanks

10/25/2013 6:32pm EDT

Hi In order to participate, do you want us to install any software beforehand? – Thanks,Prashant

Sponsors

Sponsorship Opportunities

For exhibition and sponsorship opportunities, contact Susan Stewart at sstewart@oreilly.com

Media Partner Opportunities

For information on trade opportunities with O'Reilly conferences email mediapartners
@oreilly.com

Press & Media

For media-related inquiries, contact Maureen Jennings at maureen@oreilly.com

Contact Us

View a complete list of Strata + Hadoop World 2013 contacts