Hadoop Plugin for MongoDB: The Elephant in the Room

Steve Francia (10gen)

Learn how to integrate MongoDB with Hadoop for large-scale distributed data processing. Using tools like MapReduce, Pig and Streaming you will learn how to do analytics and ETL on large datasets with the ability to load and save data against MongoDB. With Hadoop MapReduce, Java and Scala programmers will find a native solution for using MapReduce to process their data with MongoDB. Programmers of all kinds will find a new way to work with ETL using Pig to extract and analyze large datasets and persist the results to MongoDB. Python and Ruby Programmers can rejoice as well in a new way to write native Mongo MapReduce using the Hadoop Streaming interfaces.

Photo of Steve Francia

Steve Francia

10gen

Steve Francia leads the public side of the engineering organization at 10gen including integration, evangelism, support and consulting. Steve brings to this role his experience as VP of engineering at OpenSky where he build the worlds first ecommerce site powered by MongoDB and one of the first PHP sites backed by MongoDB. Steve has been an engineer, entrepreneur and executive since 1995 when he built one of the first ecommerce sites while working for American Telecom. His previous roles include CIO/COO at Portero, VP of Development at Takkle and Founder & CTO of Supernerd. Steve loves open source. He has contributed to dozens of open source projects including MongoDB, Doctrine, Symfony2 and Zoop and has started a few of his own. Steve holds a BA from Brigham Young University in Philosophy, where among other things, he created and taught a course on dynamic web development.

Sponsors

  • EMC
  • Microsoft
  • HPCC Systems™ from LexisNexis® Risk Solutions
  • MarkLogic
  • Shared Learning Collaborative
  • Cloudera
  • Digital Reasoning Systems
  • Pentaho
  • Rackspace Hosting
  • Teradata Aster
  • VMware
  • IBM
  • NetApp
  • Oracle
  • 1010data
  • 10gen
  • Acxiom
  • Amazon Web Services
  • Calpont
  • Cisco
  • Couchbase
  • Cray
  • Datameer
  • DataSift
  • DataStax
  • Esri
  • Facebook
  • Feedzai
  • Hadapt
  • Hortonworks
  • Impetus
  • Jaspersoft
  • Karmasphere
  • Lucid Imagination
  • MapR Technologies
  • Pervasive
  • Platform Computing
  • Revolution Analytics
  • Scaleout Software
  • Skytree, Inc.
  • Splunk
  • Tableau Software
  • Talend

For information on exhibition and sponsorship opportunities at the conference, contact Susan Stewart at sstewart@oreilly.com.

For information on trade opportunities with O'Reilly conferences contact Kathy Yu at mediapartners
@oreilly.com

For media-related inquiries, contact Maureen Jennings at maureen@oreilly.com

View a complete list of Strata contacts