Skip to main content
Related Resources

Big Data Now 2012

Free Report The latest anthology of O'Reilly Radar's data coverage, released October 2012.

Planning for Big Data

Free Report A handbook for CIOs & managers, this reports outlines big data technologies and creating data-driven products.

More Resources »

About Strata + Hadoop World

Why You Should Attend | Experience Strata | Kudos | Who Should Attend | Program Committee | Track Hosts

Strata + Hadoop World is where big data's most influential decision makers, architects, developers, and analysts gather to shape the future of their businesses and technologies. Since joining forces last year, Strata + Hadoop World is also one of the largest gatherings of the Apache Hadoop community in the world, with emphasis on hands-on and business sessions on the Hadoop ecosystem. If you want to tap into the opportunities brought by big data, data science, and pervasive computing, you'll want to be there.

The future belongs to those who understand how to collect and use their data successfully. And that future happens at Strata.

Why You Should Attend

  • Be among the first to understand how to take advantage of this change, and survive the resulting disruption
  • Find new ways to leverage your data assets across industries and disciplines
  • Learn how to take big data from science project to real business application
  • Discover training, hiring, and career opportunities for data professionals
  • Meet-face-to face with other innovators and thought leaders

The Strata Experience

  • Three days of inspiring keynotes and intensely practical, information-rich sessions exploring the latest advances, case studies, and best practices
  • A Sponsor Pavilion with key players and latest technologies
  • A vibrant "hallway track" for attendees, speakers, journalists, and vendors to debate and discuss important issues
  • Plenty of events and networking opportunities to meet other business leaders, data professionals, designers, and developers

Kudos from Previous Strata + Hadoop World Attendees

“In a crowded market place of "Big Data" conferences, Strata has firmly established itself as the place where you go to meet people who think and do data science.”

“One of the most valuable events to advance my career.”

“Everyone in big data is there”

“A must for anyone who wants a peek into the future of Big Data."

“At first I was dizzied by the scope of the discussion. The technology gets very deep. The analytical techniques go quite far. You can be anyone—from a person like me who is just learning, to someone who has a PhD in machine learning. The difference in expertise is great, and people are coming from all kinds of different spaces with different research agendas. I found that both dizzying and fascinating. Strata helped me to develop a vocabulary for what I was getting into, and helped me to see where the horizons were and how to map out the space in front of me.”

See more Kudos

Who Attends Strata

Strata attendees have job titles including:

  • Analytics Manager, Analytics Director
  • Big Data Lead / Director of Big Data
  • Big Data Architect / Big Data Systems Engineer
  • Bioinformatics Scientist
  • Business Analyst
  • Business Intelligence Analyst / BI Director
  • Business Systems Director
  • CEO / CIO / COO / CTO
  • Chief Architect, Chief Data Architect
  • Chief Data Officer / Chief Data Scientist
  • Cloud Architect / Cloud Engineer
  • Software Engineer / Lead Engineer
  • Software Architect / Sr. Software Architect
  • Solutions Architect
  • Statistician
  • Systems Analyst / Sr. Systems Analyst
  • Systems Engineer / Sr. Systems Engineer
  • Technology Director
  • Venture Capitalist / Investor
  • Visual Designer
  • Web Analyst
  • Web Applications Developer / Web Developer

See more job titles

Strata attendees are from organizations such as:

  • Accenture
  • Adobe Systems
  • AKQA
  • Allstate
  • Amazon Web Services
  • American Express
  • American Public Media
  • Apple
  • ATT Interactive
  • Bank of America
  • Bed Bath and Beyond
  • DuPont Pioneer
  • eBay
  • Edmunds
  • Etrade
  • Etsy
  • Experian
  • Federal Reserve Board
  • Gartner
  • General Electric
  • Google
  • Lawrence Berkeley Labs
  • Opera Solutions
  • Orbitz Worldwide
  • Paypal
  • Raytheon / Jet Propulsion Lab
  • Salesforce.com
  • Samsung
  • SEOMoz
  • Shopzilla
  • Smithsonian Institution
  • Skype
  • Stanford University

See more organizations

Program Chairs

Edd Dumbill Edd Dumbill
is a founding program chair of Strata, and chaired the O'Reilly Open Source Convention for six years. A technologist and writer, Edd is currently VP Strategy at Silicon Valley Data Science. Edd was the editor of XML.com for many years, and program chair for the XML Europe and XTech conferences.

Alistair Croll Alistair Croll
is the founder of Bitcurrent, a research firm focused on emerging technologies. He's founded a variety of startups, and technology accelerators, including Year One Labs, CloudOps, Rednod, Coradiant (acquired by BMC in 2011) and Networkshop. He's a frequent speaker and writer on subjects such as entrepreneurship, cloud computing, Big Data, Internet performance and web technology, and has helped launch a number of major conferences on these topics.

Alistair has co-authored three books on Internet technology, including Web Operations (2010, O'Reilly), Complete Web Monitoring (2009, O'Reilly), and Managing Bandwidth: Deploying QOS in Enterprise Networks (1999, Prentice-Hall). A product manager by trade, he's also an advisor to a number of venture firms and the founder of the Bitnorth conference.

Most of what he says winds up on his blog, solveforinteresting.com, or on Twitter (@acroll).

Committee Members

  • Michael Abbott, Kleiner Perkins Caufield & Byers
  • Joseph Adler, LinkedIn
  • Amr Awadallah, Cloudera
  • Dave Beckett, Rackspace
  • Matt Biddulph, Product Club
  • Ron Bodkin, Think Big Analytics
  • Carla Borsoi, iMatchative
  • Edward Capriolo, Dstillery
  • Drew Conway, IA Ventures
  • Alistair Croll, Solve for Interesting
  • Paul Dix, Errplane
  • Edd Dumbill, Silicon Valley Data Science
  • Kellan Elliott-McCrea, Etsy
  • DJ Patil, RelateIQ
  • Nick Grossman, Union Square Ventures
  • Martin Hall, Karmasphere
  • Jeff Hammerbacher, Cloudera
  • Mark Hansen, UCLA
  • Mike Hendrickson, O'Reilly Media
  • J. C. Herz, GNS Healthcare
  • Coco Krumme, MIT Media Lab
  • Noah Iliinsky, Complex Diagrams
  • Adam Laiacano, Tumblr Inc.
  • Maribel Lopez, Lopez Research
  • Ben Lorica, O'Reilly Media
  • Michael Loukides, O'Reilly Media
  • Hilary Mason, Bit.ly
  • Sarah Novotny, SarahNovotny.com
  • Claudia Perlich, Dstillery
  • Jake Porway, DataKind
  • John Rauser, Pinterest
  • Toby Segaran, Google
  • Maxwell Shron, Shron LLC
  • Peter Skomoroch, LinkedIn
  • Julie Steele, O'Reilly Media
  • Bradford Stephens, Ping Identity
  • James Stogdill, O'Reilly Media
  • Kaitlin Thaney, Mozilla Foundation
  • Matt Turck, Bloomberg Ventures
  • Joseph Turian, Ph.D., MetaOptimize
  • Andrew Vande Moere, Infosthetics
  • Simon Wardley, LEF (Leading Edge Forum)
  • Chris Wensel, Concurrent
  • John Myles White, MIT
  • Chris Wiggins, Hackny.org & Columbia.edu
  • Chrys Wu, New York Times
  • Charles Zedlewski, Cloudera

Track Hosts

Data, Connectivity, and Society

Renee DiResta is a Principal at O'Reilly AlphaTech Ventures (OATV), where she evaluates seed-stage investments. Prior to joining OATV in June of 2011, Renee spent seven as a trader at Jane Street Capital, a quantitative proprietary trading firm in New York City. She is interested in meeting interesting startups, data science, and improving liquidity and transparency in private markets.

Laurel Ruma is the Director of Talent for O'Reilly Media. Most recently she co-chaired Where 2.0, OSCON Java, and Gov 2.0 Expo. She joined the company in 2005 after working for five years at various IT analyst firms in the Boston area. Laurel is also the co-editor of Open Government, published by O'Reilly.

Data Science

Anna Smith is a resident data scientist at bitly in New York while being in absentia from the University of Oregon physics doctorate program. Recently, she has published in both Forbes and Publications of the Astronomical Society of Australia. Her interests include manipulating data and catching up on the latest celebrity gossip.

Max Shron is a New York-based data strategist. He provides expertise and mentorship ranging from specification design and platform architecture to strategy execution, to organizations across a wide gamut of sizes and industry verticals. This work encompasses a complete data pipeline including definition, collection, analysis, visualization, and insight. Max previously was lead data scientist at New York-based OkCupid, and participated as the big-data side of its successful OkTrends blog. His work has appeared worldwide, in outlets including the New York Times, Chicago Tribune, Huffington Post and WNYC. Max holds a degree in Mathematics from the University of Chicago.

Design

Noah Iliinsky strongly believes in the power of intentionally crafted communication. He has spent the last several years researching, writing, and speaking about best practices for designing visualizations, informed by his graduate work in user experience and interaction design. He is a frequent speaker in both industry and academic contexts. He has a master’s in Technical Communication from the University of Washington, and a bachelor's in Physics from Reed College. Noah works as a Visualization Expert at IBM's Center for Advanced Visualization.

Julie Steele is the Content Editor for Strata at O'Reilly Media. She is co-author of Beautiful Visualization and Designing Data Visualizations. She finds beauty in exploring complex systems, and thinks in metaphors. She is particularly drawn to the visual medium as a way to understand and transmit information.

Enterprise Data

Mona Vernon leads the Emerging Technologies group at Thomson Reuters. ]Mona is leading the charge on how to create an effective culture of Innovation at Thomson Reuters with a focus of external innovation and reaping the benefits of open innovation. She created and is running several successful initiatives that fall under the broad concept of “intrapreneurship” or applying “lean start-up” principles for the enterprise. She has also been instrumental in creating external innovation partnerships between Thomson Reuters on emerging and disruptive technologies. Mona holds a B.S. and M.S. in Mechanical Engineering from Tufts University and a S.M. from M.I.T. where her research focused on the role of customer experience in digital business strategy. Prior to joining Thomson Reuters, Mona worked in Technology start-ups in product development and management roles.

Mark Madsen designs and builds analysis and decision support systems, and building data management and access infrastructure. Research focus these days is on analysis techniques, emerging technology and practices in analytics, BI, information management, user experience for data access & delivery applications. I speak at a lot of conferences on anything data, with a bunch of history of science and technology mixed in.

Hadoop Platform

Eric Baldeschwieler served as VP Hadoop Software Engineering for Yahoo!, where he led the evolution of Apache Hadoop from a 20 node prototype to a 42,000 node service that is behind every click at Yahoo!. Eric also served as a technology leader for Inktomi’s web service engine, which Yahoo! acquired in 2003. Prior to Inktomi, Eric developed software for video games, video post production systems and 3D modeling systems. Eric has a Master’s degree in Computer Science from the University of California, Berkeley and a Bachelor’s degree in Mathematics and Computer Science from Carnegie Mellon University.

Tom White has been an Apache Hadoop committer since February 2007, and is a member of the Apache Software Foundation. He is the author of "Hadoop: The Definitive Guide" for O'Reilly. Previously he worked as an independent consultant specializing in Hadoop, and before that was co-founder and Lead Developer at Kizoom, a UK mobile application startup. Tom has a Bachelor's degree in Mathematics from the University of Cambridge, and a Master's degree in History and Philosophy of Science from the Universities of Leeds, UK, and Florence, Italy.

Eric Sammer is currently a Principal Solution Architect at Cloudera where he helps customers plan, deploy, develop for, and use Hadoop and the related projects at scale. His background is in the development and operations of distributed, highly concurrent, data ingest and processing systems. He's been involved in the open source community and has contributed to a large number of projects over the last decade.

Hadoop in Action

Martin Hall is co-founder, Chairman & Executive Vice President of Corporate Development at Karmasphere. He brings a strong entrepreneurial track record and a history of pioneering new Internet technologies and markets. Prior to founding Karmasphere, Martin was a founder of Aventail, a leading computer security company acquired by SonicWall. Prior to that, he was the founding CEO of Stardust, an Internet technology services company sold to Penton Media. Martin has chaired and participated in a number of industry groups including WinSock, Quality of Service, Internet Multicast and Wireless Multimedia Forums. He holds a Masters of Computer Science from Staffordshire University in Stafford, England.

Andrew Musselman is Chief Data Scientist in the global big data practice at Accenture. His background is in math, front- and back-end web, recommenders, and other large-scale modeling and prediction systems. In addition to building systems for clients, Andrew builds internal tools for performing data science and engineering quicker and more rigorously, and does recruitment and training in a growing team. He is a big fan of Hadoop, Pig, and Mahout, and actively promotes new tools within the firm and with clients.

Hadoop & Beyond

Justin Borgman is Co-Founder and CEO of Hadapt. Prior to Hadapt, Justin led product development for COVECTRA, an anti-counterfeit technology firm. Before that, Justin founded an online social media company and spent the first six years of his career as a software developer at MIT Lincoln Laboratory and Raytheon.

Ted Dunning has been involved with a number of startups with the latest being MapR Technologies where he is Chief Application Architect working on advanced Hadoop-related technologies. He is also a PMC member for the Apache Zookeeper and Mahout projects. Opinionated about software and data-mining and passionate about open source, he is an active participant of Hadoop and related communities and loves helping projects get going with new technologies.

Reynold Xin is an Apache Spark committer and the lead developer for Shark and GraphX, two computation frameworks built on top of Spark. He is also a co-founder of Databricks. Before Databricks, he was pursuing a PhD focusing on large scale data systems in the UC Berkeley AMPLab.

Hardcore Data Science

Lynn Cherny has a Ph.D. from Stanford and an M.Phil. from Cambridge University. She has 19 years of experience managing and doing interface design and data analysis at companies including TiVo, Excite, Adobe, The MathWorks, and AT&T Labs. Her consulting work now focuses on data analysis and visualization.

Ben Lorica is the Senior Analyst in the Market Research Group at O'Reilly Media, Inc.. He has applied Business Intelligence, Data Mining and Statistical Analysis in a variety of settings including Direct Marketing, Consumer and Market Research, Targeted Advertising, Text Mining, and Financial Engineering. His background includes stints with an investment management company, internet startups, and financial services. At O'Reilly, Ben works on custom research and consulting projects, open source data warehousing and analytics.

Sponsors

Sponsorship Opportunities

For exhibition and sponsorship opportunities, contact Susan Stewart at sstewart@oreilly.com

Media Partner Opportunities

For information on trade opportunities with O'Reilly conferences email mediapartners
@oreilly.com

Press & Media

For media-related inquiries, contact Maureen Jennings at maureen@oreilly.com

Contact Us

View a complete list of Strata + Hadoop World 2013 contacts