Strata Conference + Hadoop World Keynotes

Alistair Croll

Alistair Croll, Solve For InterestingFounder

Alistair has been an entrepreneur, author, and public speaker for nearly 20 years. He’s worked on a variety of topics, from web performance, to big data, to cloud computing, to startups, in that time. In 2001, he co-founded web performance startup Coradiant (acquired by BMC in 2011), and since that time has also launched Rednod, CloudOps, Bitcurrent, Year One Labs, the Bitnorth conference, the International Startup Festival and several other early-stage companies.

Alistair is the chair of O’Reilly’s Strata conference; Techweb’s Cloud Connect; and the International Startup Festival. He’s written four books on analytics, technology, and entrepreneurship, including the best-selling Lean Analytics which is being translated into eight languages. He lives in Montreal, Canada and tries to mitigate chronic ADD by writing... Read More.

9:00am Thursday, 10/25/2012
We’re excited to launch the Strata Data Innovation Awards to recognize disruptive, innovative technologies in big data and data science, highlight data science as an increasing importance for companies, and showcase the highlights of the growing data community.
Full Details
Location: Grand Ballroom (NY Hilton)
8:45am Thursday, 10/25/2012
Opening remarks by the Strata program chairs, Edd Dumbill and Alistair Croll.
Full Details
Location: Grand Ballroom (NY Hilton)
8:45am Wednesday, 10/24/2012
Opening remarks by the Strata program chairs, Edd Dumbill and Alistair Croll.
Full Details
Location: Grand Ballroom (NY Hilton)
Doug Cutting

Doug Cutting, ClouderaArchitect

Doug (@cutting) is the founder of numerous successful open source projects, including Lucene, Nutch, Avro, and Hadoop. Doug joined Cloudera in 2009 from Yahoo!, where he was a key member of the team that built and deployed a production Hadoop storage and analysis cluster for mission-critical business analytics. Doug holds a Bachelor’s degree from Stanford University and sits on the Board of the Apache Software Foundation.

9:20am Thursday, 10/25/2012
Hadoop started as an offline, batch-processing system. It made it practical to store and process much larger datasets than before. Subsequently, more interactive, online systems emerged, integrating with Hadoop.
Full Details
Location: Grand Ballroom (NY Hilton)
Edd Dumbill

Edd Dumbill, Silicon Valley Data ScienceVP Strategy

Edd Dumbill is a technologist, writer and programmer based in California. He’s helping drive businesses with data as VP Strategy for Silicon Valley Data Science.

Edd was the founding program chair for the O’Reilly Strata, and chaired the Open Source Convention for six years. He is Editor in Chief of the journal Big Data.

Edd was the founder and creator of the Expectnation conference management system, and a co-founder of the Pharmalicensing.com online intellectual property exchange.

A veteran of open source, Edd has contributed to various projects, such as Debian and GNOME, and created the DOAP Vocabulary for describing software projects.

Edd has written four books, including O’Reilly’s “Learning Rails”. He writes regularly on Google+ and on his... Read More.

9:00am Thursday, 10/25/2012
We’re excited to launch the Strata Data Innovation Awards to recognize disruptive, innovative technologies in big data and data science, highlight data science as an increasing importance for companies, and showcase the highlights of the growing data community.
Full Details
Location: Grand Ballroom (NY Hilton)
8:45am Thursday, 10/25/2012
Opening remarks by the Strata program chairs, Edd Dumbill and Alistair Croll.
Full Details
Location: Grand Ballroom (NY Hilton)
8:45am Wednesday, 10/24/2012
Opening remarks by the Strata program chairs, Edd Dumbill and Alistair Croll.
Full Details
Location: Grand Ballroom (NY Hilton)
Tim Estes

Tim Estes, Digital ReasoningCEO

Tim Estes is the Chairman, CEO, and founder for Digital Reasoning. Tim’s academic work at the University of Virginia focused in the areas of Philosophy of Language, Mathematical Logic, Semiotics, Epistemology, and Phenomenology. It was that eclectic academic background, coupled with the belief that in the future all software would learn from data as a core capability that gave rise to Digital Reasoning. Tim and his team at Digital Reasoning work closely with leaders in government and industry to solve extraordinarily valuable and morally compelling problems in National Security, Finance, Legal, and Health Care by automating the understanding of unstructured data.

10:00am Wednesday, 10/24/2012
The onset of the Big Data phenomenon has created a unique opportunity, but the challenge ahead of us is to move beyond Big Data infrastructure to morally and practically useful applications. This requires new technologies that close the "Understanding Gap" and, by doing so, can make great strides to prevent evil, reduce suffering, and create more actualized human potential.
Full Details
Location: Grand Ballroom (NY Hilton)

Michael Flowers, NYC Mayor's Office of Policy and Strategic PlanningAnalytics Director

Michael P. Flowers is Analytics Director for the Mayor’s Office of Policy and Strategic Planning and Director of the Financial Crime Task Force of the City of New York. Prior to joining the Bloomberg Administration, Mr. Flowers was Counsel to the U.S. Senate Permanent Subcommittee on Investigations for the 110th and 111th Congress, where he led bipartisan investigations into off-shore tax haven abuses; failures in the mortgage-backed securitization market by U.S. investment and commercial banks and government agencies; and deceptive financial transactions by the North Korean government. From March 2005 to December 2006, Mr. Flowers was Deputy Director of DOJ’s Regime Crimes Liaison’s Office in Baghdad, Iraq, supporting the investigations and trials of Saddam Hussein and other high-ranking members of his regime. From 2003... Read More.

9:15am Wednesday, 10/24/2012
New York City is a complex, thriving organism. Hear how data science has played a surprising and effective role in helping the city government provide services to over 8 million people, from preventing public safety catastrophes to improving New Yorkers' quality of life.
Full Details
Location: Grand Ballroom (NY Hilton)
Anthony Goldbloom

Anthony Goldbloom, KaggleFounder

Anthony is the Founder and CEO of Kaggle. He assists companies with framing modeling tasks as data prediction competitions, ensuring that competitions reflect real-life projects. Before founding Kaggle, Anthony worked in the macroeconomic modelling areas of the Reserve Bank of Australia and before that the Australian Treasury. In these roles, Anthony was responsible for building macroeconomic models, generating economic forecasts and simulating the impact of changes in interest rates and fiscal policy on the Australian economy. Anthony holds a first class honours degree in economics and econometrics from the University of Melbourne and has published in The Economist magazine and the Australian Economic Review.

9:25am Wednesday, 10/24/2012
Data science is a team sport. Collaboration inside and outside your organization is the ultimate Big Data technique. Success depends on having a collaboration platform and solving the number one problem of the Big Data era: the supply and demand for data scientists. Learn how you can take action today to accelerate the success of your data science efforts.
Full Details
Location: Grand Ballroom (NY Hilton)
Rich Hickey

Rich Hickey, DatomicAuthor

Rich Hickey, the author of the Clojure programming language and designer of the Datomic database system, is a software developer with over 20 years of experience in various domains. Rich has worked on scheduling systems, broadcast automation, audio analysis and fingerprinting, database design, yield management, exit poll systems, and machine listening, in a variety of languages.

9:35am Wednesday, 10/24/2012
While moving away from single powerful servers, distributed databases still tend to be monolithic solutions. But e.g. key-value storage is rapidly becoming a commodity service, on which richer databases might be built. What are the implications?
Full Details
Location: Grand Ballroom (NY Hilton)
Annika Jimenez

Annika Jimenez, PivotalGlobal Head of Data Science Services

Annika is a seasoned leader of analytics initiatives, and came from Greenplum where she built the “Data Science Dream Team” – an industry-leading group of Data Scientists – representing a rich combination of vertical domain and horizontal analytical expertise – who are facilitating Data Science-driven transformations for Greenplum customers.

Before Greenplum, Annika worked for Yahoo! for six years in data leadership roles: She led Audience and International data solutions for Yahoo!’s central data organization, Strategic Data Solutions, and led Insights Services – comprised of a team of 40 researchers covering Web analytics, satisfaction/brand health metrics, and audience/ad measurement. Annika is a recognized evangelist for “applied data” and well known for her acute focus on action-enablement.

9:25am Wednesday, 10/24/2012
Data science is a team sport. Collaboration inside and outside your organization is the ultimate Big Data technique. Success depends on having a collaboration platform and solving the number one problem of the Big Data era: the supply and demand for data scientists. Learn how you can take action today to accelerate the success of your data science efforts.
Full Details
Location: Grand Ballroom (NY Hilton)
Paul Kent

Paul Kent, SASVP, Platform R&D

Paul Kent is Vice President of Big Data initiatives at SAS. He spends his time discussing, evangelizing and developing software at the confluence of big data and high performance computing.

Paul was previously Vice President of the Platform R&D division at SAS and led groups responsible for the SAS foundation and mid-tier technologies, teams that develop, maintain and test Base SAS®, as well as related data access, storage, management, presentation, connectivity and middleware software products.

Paul joined SAS in 1984 and contributed to the development of SAS software components including PROC SQL, TCP/IP connectivity, the Output Delivery System (ODS) and high-performance initiatives.

A strong customer advocate, Paul is widely recognized within the SAS community for his active... Read More.

9:30am Thursday, 10/25/2012
In this rapid-fire keynote, we’ll introduce how virtually every new technology trend is inextricably linked – or should be to attain maximum leverage. We’ll discuss how you can use technologies such as cloud and mobility to spread the value of analytics pervasively across your virtual organization, and how that positively impacts your employees, customers and partners.
Full Details
Location: Grand Ballroom (NY Hilton)
Irfan Khan

Irfan Khan, SAPVP & CTO, Database & Technology

Mr. Irfan Khan joined Sybase in 1992. As Senior Vice President and Chief Technology Officer, Mr. Khan oversees all technology offices in each of Sybase’s business units, ensuring market needs and customer aspirations are reflected within the company’s innovation and product development. Mr. Khan is also responsible for setting the architecture and technology direction for the worldwide technical sales organization. As part of CTO responsibilities, Mr. Khan oversees the technology evangelism efforts for Sybase, responsible for seeding new innovation and driving new technologies within Sybase’s most strategic accounts. Mr. Khan also is in charge of the Sybase Developer Network. In 2010, Mr. Khan received the InfoWorld CTO Top 25 Award and was named to the International Advisory Board of Cloud Expo. Mr. Khan... Read More.

9:45am Thursday, 10/25/2012
You need more than a database 'hammer' for today's Big Data projects. Organizations need a 'data platform' providing integrated tools to capture, store, process and present data. Without it companies can achieve - volume, velocity, or variety - but not all three. Join us to learn the extreme capabilities needed to distill new business signals from big data.
Full Details
Location: Grand Ballroom (NY Hilton)
James  Markarian

James Markarian, InformaticaCTO & EVP Platform & Corporate Development

James Markarian serves as executive vice president and chief technology officer at Informatica with responsibility for the strategic direction of Informatica products, including the delivery of the Informatica Platform. He also runs the corporate development group, including acquisitions. Previously, Markarian held the role of general manager of the Data Integration Business, where he led the delivery of both the flagship PowerCenter product and the Informatica Platform. Prior to joining Informatica in 1998, Markarian spent 10 years at Oracle Corporation, where he was architect for the Oracle Tools Division, responsible for cross-product architectural planning and integration decisions. He also served on Oracle’s Architecture Review Board. Markarian earned a bachelor’s degree in Computer Science and Economics and a master’s degree in Economics from Boston University.

9:45am Wednesday, 10/24/2012
Data integration for Big Data projects can consume up to 80% of the development effort and yet too many developers reinvent the wheel by hand-coding custom connectors, data parsers, and data integration transformations. A metadata-driven, codeless IDE with pre-built transformations and data quality rules have proven to be up to 10X more productive than hand coding and easier to maintain.
Full Details
Location: Grand Ballroom (NY Hilton)
Cathy O'Neil

Cathy O'Neil, Intent MediaLead Data Scientist

Cathy O’Neil earned a Ph.D. in math from Harvard, was postdoc at MIT in the math department, and a professor at Barnard College where she published a number of research papers in arithmetic algebraic geometry. She then chucked it and switched over to the private sector. She worked as a quant for the hedge fund D.E. Shaw in the middle of the credit crisis, and then for RiskMetrics, a risk software company that assesses risk for the holdings of hedge funds and banks. Since this spring she’s been a data scientist for the startup media company “Intent Media”;http://www.intentmedia.com/.

9:35am Thursday, 10/25/2012
A fireside chat with Cathy O'Neil about why universities can't make data scientists. Lots of companies want to hire data scientists, and there aren't enough to go around. Some universities are adding data science graduate departments, but they're facing an uphill battle, thanks to a lack of good data for academics, political infighting, and scalability issues.
Full Details
Location: Grand Ballroom (NY Hilton)
Mike Olson

Mike Olson, ClouderaCSO and Chairman

Mike (@mikeolson) co-founded Cloudera in 2008 and served as its CEO until 2013 when he took on his current role of chief strategy officer (CSO.) As CSO, Mike is responsible for Cloudera’s product strategy, open source leadership, engineering alignment and direct engagement with customers. Prior to Cloudera Mike was CEO of Sleepycat Software, makers of Berkeley DB, the open source embedded database engine. Mike spent two years at Oracle Corporation as vice president for Embedded Technologies after Oracle’s acquisition of Sleepycat in 2006. Prior to joining Sleepycat, Mike held technical and business positions at database vendors Britton Lee, Illustra Information Technologies and Informix Software. Mike has a Bachelor’s and a Master’s Degree in Computer Science from the University of California, Berkeley.

8:55am Wednesday, 10/24/2012
Society confronts enormous challenges today: How will we feed nine billion people? How can we diagnose and treat diseases better, and more cheaply? How will we produce more energy, more cleanly, than ever before? Big questions like these demand new approaches, and "Big Data" is a crucial of the toolkit we will use over the coming years to answer them.
Full Details
Location: Grand Ballroom (NY Hilton)
Samantha Ravich

Samantha Ravich, National Commission for the Review of R&D Programs in the Intelligence CommunityCo-Chair

Dr. Samantha Ravich consults with both private industry and Federal and State governments on international security, financial, and political risk. She is also a Senior Advisor to The Chertoff Group and Co-Chairs the National Commission for Review of Research and Development Programs in the IC. From 2009-2011, Ravich was Senior Vice President at IPS, a software and analysis firm.

Ravich was Deputy National Security Advisor to Vice President Cheney and served in the White House for 5½ years during the 2001-2008 time period. Ravich was the Vice President’s representative on Asian and Middle East Affairs as well as on Counter-Terrorism and Counter-Proliferation. During her tenure in OVP, she also worked on Free Market and Democracy Promotion, spearheading a number of government wide initiatives.

... Read More.
10:00am Thursday, 10/25/2012
Samantha Ravich, former National Security Advisor to Vice President Richard Cheney, will discuss the challenges that face strategic decision makers from the wealth of data now provided by advances in technology.
Full Details
Location: Grand Ballroom (NY Hilton)
John Schroeder

John Schroeder, MapR TechnologiesCEO and Co-Founder

John Schroeder founded MapR to address core limitations that were limiting the adoption of Hadoop and expand Hadoop capabilities to serve new business critical and real-time applications. John has led companies creating innovative and disruptive business intelligence, database management, storage and virtualization technologies at early stage ventures through success as large public companies. John was previously CEO of Calista Technologies (Microsoft), CEO of Rainfinity (EMC) and Senior Vice President of products and marketing at Brio Technologies (BRYO).

9:10am Thursday, 10/25/2012
This session will provide insights into how the combination of scale, efficiency, and analytic flexibility creates the power to expand the applications for Hadoop to transform companies as well as entire industries.
Full Details
Location: Grand Ballroom (NY Hilton)
Sharmila Shahani-Mulligan

Sharmila Shahani-Mulligan, ClearStory DataCEO & Founder

Sharmila has spent 18+ years building game-changing software companies in a variety of markets. She has been EVP & CMO at numerous software companies, including Netscape, Kiva Software, AOL, Opsware, and Aster Data. She drove the creation of several multi-billion dollar market categories, including application servers, data center automation and big data analytics. She is on the board of Hadapt and Lattice Engines, advisor to numerous companies, large and small, and an active investor in early stage companies.

9:50am Wednesday, 10/24/2012
In recent years, "Big Data" has matured from a vague description of massive corporate data to a household term that refers to not just volume but the diversity of data and velocity of change. Today, there's a wealth of data trapped in corporate data repositories, new platforms like Hadoop, a new generation of data marketplaces and volumes generated hourly on the Web.
Full Details
Location: Grand Ballroom (NY Hilton)
Rick Smolan

Rick Smolan, Against All Odds ProductionsCEO

Rick Smolan, a former Time, Life, and National Geographic photographer, is best known as the co-creator of the “Day in the Life” book series.

Today Smolan is the CEO of Against All Odds Productions which orchestrates global photography projects that combine creative storytelling with state-of-the-art technology. Pulling together teams of leading journalists and photographers (plus hundreds of thousands of members of the general public) to focus on emerging topics such as the global water crisis, the effect of the Internet on civilization or how the human race is learning to heal itself, Smolan and his team produce bestselling-illustrated books, TV specials, apps, and exhibits.

Many of Smolan’s books have appeared on the New York Times best-seller lists and have been featured on the covers... Read More.

8:50am Thursday, 10/25/2012
Over the past two decades, Rick Smolan, creator of the best selling "Day in the Life" books, has produced a series of ambitious global projects in collaboration with hundreds of the world’s leading photographers, writers, and graphic designers. This year Smolan invited more than 100 journalists around the globe to explore the world of Big Data.
Full Details
Location: Grand Ballroom (NY Hilton)
Julie Steele

Julie Steele, O'Reilly Media, Inc.Acquisitions Editor

Julie Steele is the Content Editor for Strata at O’Reilly Media. She is co-author of Beautiful Visualization and Designing Data Visualizations. She finds beauty in exploring complex systems, and thinks in metaphors. She is particularly drawn to the visual medium as a way to understand and transmit information.

Julie holds a Master’s degree in Political Science (International Relations) from Rutgers University in Newark. She lives in New York City, where she cooks, reads, designs, and practices yoga. You can find her blogging occasionally for O’Reilly Radar, or on Twitter.

9:35am Thursday, 10/25/2012
A fireside chat with Cathy O'Neil about why universities can't make data scientists. Lots of companies want to hire data scientists, and there aren't enough to go around. Some universities are adding data science graduate departments, but they're facing an uphill battle, thanks to a lack of good data for academics, political infighting, and scalability issues.
Full Details
Location: Grand Ballroom (NY Hilton)
Ben Werther

Ben Werther, PlatforaFounder and CEO

Ben Werther is the Founder & CEO of Platfora. He founded the company in 2011 to realize his vision of how Hadoop and Big Data Analytics will transform the way every business user uses data and move beyond the fiction, feeling and faith that underlies most business decisions.

Under Werther’s direction, Platfora has grown from an idea sketched on a napkin to one of the hottest enterprise startups in Silicon Valley and a leader of the Big Data Analytics category. Platfora’s mission is to empower customers to leverage Big Data Analytics to transform their businesses into Fact-Based Enterprises. Designed for business users, the company’s product is the first visual self-service platform for interactively and iteratively interrogating enormous amounts of data, and masking the complexity... Read More.

9:10am Wednesday, 10/24/2012
Hadoop is scalable, inexpensive and can store near-infinite amounts of data. But driving it requires exotic skills and hours of batch processing to answer straightforward questions. Learn how everything is about to change.
Full Details
Location: Grand Ballroom (NY Hilton)

Sponsors

Sponsorship Opportunities

For information on exhibition and sponsorship opportunities, contact Susan Stewart at sstewart@oreilly.com.

Media Partner Opportunities

For information on trade opportunities contact Kathy Yu at mediapartners
@oreilly.com

Press and Media

For media-related inquiries, contact Maureen Jennings at maureen@oreilly.com

Contact Us

View a complete list of Strata contacts.