Skip to main content

Strata + Hadoop World Keynotes

New Keynotes are added continuously. Please check back to see the latest updates to the program.

John Choi

John Choi, IBMDirector of Product Management

John Choi is Director of Product Management at IBM Software Group where he is responsible for product direction and strategy for the Big Data software portfolio. Prior to this role, he has held various product/portfolio management and strategy responsibilities in IBM including Information Management and WebSphere with a focusing on portfolio strategy and emerging technologies. John received his BA and MBA from Yale University.

9:40am Wednesday, 10/30/2013
What is Big Data? What will it mean for my organization? What technologies do I need? In this session, we will provide a view of what Big Data really means for organizations and how people, processes, and technologies, when brought together, can catalyze a transformational journey.
Full Details
Location: Grand Ballroom
Michael Chui

Michael Chui, McKinsey Global InstituteSenior Fellow

Michael Chui is a Senior Fellow of the McKinsey Global Institute. He is based in San Francisco, CA, where he directs research on the impact of information technologies, such as Big Data, Web 2.0 and the Internet of Things, on business and the economy. He co-authored the MGI report entitled “Big data: The next frontier for innovation, competition and productivity.” He has served clients in the High Tech, Media and Telecom industries on strategy, innovation and product development, IT, sales & marketing, M&A and organization. His research has been cited globally in publications such as the Wall Street Journal, New York Times, Financial Times, Fast Company, Forbes, The Economist, The Times of London, and Les Échos.

Michael holds a B.S. in Symbolic Systems from... Read More.

10:05am Tuesday, 10/29/2013
Michael Chui, Senior Fellow, McKinsey Global Institute
Full Details
Location: Grand Ballroom
Quentin Clark

Quentin Clark, MicrosoftCVP

As corporate vice president of program management for the Microsoft Data Platform Group, Quentin Clark oversees the design and delivery of the entire family of SQL Server products as well as the Azure Data Platform services. The Azure Data Platform is a complete end-to-end platform serving data management and processing capability, data integration and refinement, and business analytics as Microsoft Azure services and Microsoft Office and Office 365 offerings. Leading a team of technical engineers, his responsibilities include product direction and definition through program management, user experience and design, and customer engagement programs. This spans SQL Server’s work in all workloads – databases, integration and business intelligence, as well as the release forms of the product – software, appliances and the cloud services.

... Read More.
9:35am Tuesday, 10/29/2013
The idea that big data will transform businesses and the world is indisputable, but are there enough resources to fully embrace this opportunity? Join Quentin Clark, Microsoft Corporate Vice President, who will share Microsoft’s bold goal to consumerize big data - simplifying the data science process and providing easy access to data with everyday tools.
Full Details
Location: Grand Ballroom

Peta Clarke, Black Girls Code - NYTechnical Lead

10:00am Wednesday, 10/30/2013
Details to come..
Full Details
Location: Grand Ballroom
Alistair Croll

Alistair Croll, Solve For InterestingFounder

Alistair has been an entrepreneur, author, and public speaker for nearly 20 years. He’s worked on a variety of topics, from web performance, to big data, to cloud computing, to startups, in that time. In 2001, he co-founded web performance startup Coradiant (acquired by BMC in 2011), and since that time has also launched Rednod, CloudOps, Bitcurrent, Year One Labs, the Bitnorth conference, the International Startup Festival and several other early-stage companies.

Alistair is the chair of O’Reilly’s Strata conference; Techweb’s Cloud Connect; and the International Startup Festival. He’s written four books on analytics, technology, and entrepreneurship, including the best-selling Lean Analytics which is being translated into eight languages. He lives in Montreal, Canada and tries to mitigate chronic ADD by writing... Read More.

10:00am Tuesday, 10/29/2013
A presentation of the winners from the Strata New York + Hadoop World 2013 Startup Showcase.
Full Details
Location: Grand Ballroom
8:45am Tuesday, 10/29/2013
Program Chairs, Edd Dumbill and Alistair Croll, welcome you to the first day of keynotes.
Full Details
Location: Grand Ballroom
8:45am Wednesday, 10/30/2013
Program Chairs, Edd Dumbill and Alistair Croll, welcome you to the second day of keynotes.
Full Details
Location: Grand Ballroom
Doug Cutting

Doug Cutting, ClouderaChief Architect

Doug (@cutting) is the founder of numerous successful open source projects, including Lucene, Nutch, Avro, and Hadoop. Doug joined Cloudera in 2009 from Yahoo!, where he was a key member of the team that built and deployed a production Hadoop storage and analysis cluster for mission-critical business analytics. Doug holds a Bachelor’s degree from Stanford University and sits on the Board of the Apache Software Foundation.

8:50am Wednesday, 10/30/2013
Doug will talk broadly about the future capability of Hadoop in the context of the road traveled so far. What are the limits of Hadoop? How should you think about workloads like SQL and Search? What's next?
Full Details
Location: Grand Ballroom
Edd Dumbill

Edd Dumbill, Silicon Valley Data ScienceVP Strategy

Edd Dumbill is a technology analyst, writer and entrepreneur based in California. He’s helping drive businesses with data as VP Strategy for Silicon Valley Data Science.

Edd was the founding program chair for the O’Reilly Strata, and chaired the Open Source Convention for six years. He was the Founding Editor of the journal Big Data.

A startup veteran, Edd was the founder and creator of the Expectnation conference management system, and a co-founder of the Pharmalicensing.com online intellectual property exchange.

An advocate and contributor to open source software, Edd has contributed to various projects, such as Debian and GNOME, and created the DOAP Vocabulary for describing software projects.

Edd has written four books, including O’Reilly’s “Learning Rails”. He writes... Read More.

10:00am Tuesday, 10/29/2013
A presentation of the winners from the Strata New York + Hadoop World 2013 Startup Showcase.
Full Details
Location: Grand Ballroom
8:45am Tuesday, 10/29/2013
Program Chairs, Edd Dumbill and Alistair Croll, welcome you to the first day of keynotes.
Full Details
Location: Grand Ballroom
8:45am Wednesday, 10/30/2013
Program Chairs, Edd Dumbill and Alistair Croll, welcome you to the second day of keynotes.
Full Details
Location: Grand Ballroom
Shawndra Hill

Shawndra Hill, University of PennsylvaniaAssistant Professor

Shawndra Hill is an Assistant Professor in Operations and Information Management at the Wharton School of the University of Pennsylvania. Generally, she studies data mining, machine learning and statistical relational learning and their alignment with business problems. Specifically, she researches the value to companies of mining data on how consumers interact with each other — for targeted marketing, advertising and fraud detection. Here current research focusses on the interactions between TV content and Social Media. Her past and present industry partners include AT&T Labs Research, ClearForest, and Siemens Energy & Automation. Her recent work appears in IEEE Transactions on Data and Knowledge Engineering, Journal of Computational and Graphical Statistics, SIGKDD Explorations, and Statistical Science. Her research is funded in part by the Office... Read More.

9:20am Wednesday, 10/30/2013
In this keynote I will discuss how TV networks and advertisers can derive value from all of the online social activity about TV.
Full Details
Location: Grand Ballroom
Jim Kaskade

Jim Kaskade, InfochimpsCEO

Jim Kaskade is CEO of Infochimps, a Big Data subsidiary of CSC. The Infochimps Big Data Platform is an open-standards based analytics platform for private cloud deployments used by enterprise global 2000. It is recognized as the fastest way to deploy big data analytic environments.

Prior to Infochimps Jim was an Entrepreneur-in-Residence at PARC, a Xerox company, where he established PARC’s Big Data program. His work helped PARC understand how to best integrate its in-memory data processing technologies and high-performance data graph analytics to the burgeoning online services ecosystem, with a focus on predictive analytics for the retail sector. Jim also helped build PARC’s Private Cloud platform.

Jim also served as the SVP, General Manager and Chief of Cloud at... Read More.

9:50am Wednesday, 10/30/2013
Data and analytics is a means to an end. Jim highlights a new revolution of analytic applications with some touching examples in the healthcare industry with cancer research and medication therapy management.
Full Details
Location: Grand Ballroom
Josh Klahr

Josh Klahr, PivotalVP Data Platform Product Management

Josh has been working with data and analytics since 2000, including being the product manager for the first “Datamart in a Box” (Broadbase) and running product management for one of the largest Data and Analytics operations in the world (Yahoo!). Josh is now applying these learnings at Pivotal, where he is building the industry’s first unified Big Data and Analytics Platform.

9:05am Wednesday, 10/30/2013
Data is coming at us from everywhere – in small quantities, large magnitudes, and in almost every format. As Pivotal’s Vice President of Data Platform Product Management, Josh Klahr has the know-how to provide insights on how to build an organization that strategically manages this data in today’s modern and complex enterprise environments.
Full Details
Location: Grand Ballroom

Donna Knutt, Black Girls CodeTechnical Lead

Donna Knutt is a mom, host, and serial entrepreneur. She is the founder of LuxieLab.com, a Marketing & Web Design studio that specializes in creating professional websites for companies looking to plan, launch, or grow their business. She is also the Co-Tech Lead of the NY Chapter of Back Girls Code, a nonprofit dedicated to empowering young girls by teaching them to be innovators and leaders in STEM fields. When Donna isn’t brainstorming with other passionate techies and entrepreneurs, she’s usually travelling, exercising, or spending time with family and friends. You can find Donna on twitter (@donnaknutt) where she shares motivational messages, tips, & advice on living a full life through entrepreneurship & service.

10:00am Wednesday, 10/30/2013
Details to come..
Full Details
Location: Grand Ballroom
Roger Magoulas

Roger Magoulas, O'Reilly MediaResearch Director

Roger Magoulas is the research director at O’Reilly Media and chair of the Strata conferences. Roger and his team build the analysis infrastructure, and provide analytic services and insights on technology adoption trends to business decision-makers at O’Reilly and beyond. We find what excites key innovators and use those insights to gather and analyze faint signals from various sources to make sense of what others may adopt and why.​

9:55am Tuesday, 10/29/2013
Roger Magoulas, incoming Strata chair and Director of Research at O'Reilly, will share insights into the state of data science as a profession and preview Strata in 2014.
Full Details
Location: Grand Ballroom
Will Marshall

Will Marshall, Planet LabsCEO

Will is responsible for setting the company’s vision and and for architecting the company strategy. Previously, Will was a Scientist at NASA/USRA where he served as Co-Investigator for PhoneSat, Science Team member on the LCROSS and LADEE lunar missions. He led research projects in orbital space debris remediation. Will has published over 30 articles in scientific publications. Will received his Ph.D. in Physics from the University of Oxford and was a Postdoctoral Fellow at Harvard University.

9:30am Wednesday, 10/30/2013
Planet Labs is launching the largest ever fleet of Earth-imaging satellites in December. These will enable high resolution imagery of the entire planet to be taken on a more frequent basis. The data is of large potential value: humanitarian applications range from monitoring deforestation and the ice caps to disaster relief and improving agriculture yields in developing nations.
Full Details
Location: Grand Ballroom
Douglas Merrill

Douglas Merrill, ZestFinanceCEO and Founder

Dr. Douglas C. Merrill is the founder and CEO of ZestFinance, a financial services technology startup dedicated to serving the needs of the underbanked. He is also the author of Getting Organized in the Google Era, a book on personal and workplace organization published by Random House. Previously, Merrill was CIO and VP of Engineering of Google Inc. where he oversaw all aspects of internal engineering, including Google’s 2004 IPO. He most recently served as COO of New Music and President of Digital Business at EMI Music. Merrill holds an MA and Ph.D. in Psychology from Princeton University, and a BA from the University of Tulsa in Social and Political Organization.

10:05am Wednesday, 10/30/2013
Most people think success in big data analysis is about the right mix of vast amounts of data, mathematics and Ph.D.’s (oh my!). Those people are wrong. You need artistry too. This talk will provide some examples of "pure" ML failures and give suggestions on how to build an appropriately artistic team.
Full Details
Location: Grand Ballroom
Jack Norris

Jack Norris, MapR TechnologiesCMO

Jack leads worldwide marketing efforts for MapR. Jack has over 20 years of enterprise software marketing and product management experience in defining and delivering analytics, storage, and information delivery products. Jack has also held senior executive roles with EMC, Rainfinity, Brio Technology, SQRIBE, and Bain and Company. Jack earned an MBA from UCLA Anderson and a BA from Stanford University.

9:10am Tuesday, 10/29/2013
According to Gartner, Hadoop is near the top of the Hype Cycle. While some customers have questions about the enterprise capabilities of Hadoop, the answers are clear as production deployments continue to expand. This session will use successful customer experiences to highlight the power of Hadoop and separate the myths from reality.
Full Details
Location: Grand Ballroom
Mike Olson

Mike Olson, ClouderaCSO and Chairman

Mike (@mikeolson) co-founded Cloudera in 2008 and served as its CEO until 2013 when he took on his current role of chief strategy officer (CSO.) As CSO, Mike is responsible for Cloudera’s product strategy, open source leadership, engineering alignment and direct engagement with customers. Prior to Cloudera Mike was CEO of Sleepycat Software, makers of Berkeley DB, the open source embedded database engine. Mike spent two years at Oracle Corporation as vice president for Embedded Technologies after Oracle’s acquisition of Sleepycat in 2006. Prior to joining Sleepycat, Mike held technical and business positions at database vendors Britton Lee, Illustra Information Technologies and Informix Software. Mike has a Bachelor’s and a Master’s Degree in Computer Science from the University of California, Berkeley.

... Read More.
8:55am Tuesday, 10/29/2013
As Hadoop and the surrounding projects & vendors mature, their impact on the data management sector is growing. Mike will talk about his views on how that impact will change over the next five years. How central will Hadoop be to the data center of 2020? What industries will benefit most? Which technologies are at risk of displacement or encroachment?
Full Details
Location: Grand Ballroom
David Parker

David Parker, SAPVice President, SAP Big Data

David is responsible for the solution packaging and pricing, definition and execution of the Big Data organization go-to-market strategy, education and enablement. David is a seasoned IT professional with over 28 years within the finance and banking industry, covering both business and technical areas.

David has also held several senior management positions in other industries including retail, telecommunications and academia, providing architectural solutions and consultative services for real-time analytics and data warehouse projects.

In early 2000, David joined Aleri as vice president of services to drive its Complex Event Processing (CEP) product into new markets in the UK, and subsequently moving to New York to help grow the company. David was responsible for customer adoption and success, the internal IT infrastructure and management of... Read More.

9:15am Wednesday, 10/30/2013
Big Data is impacting society in ways never possible before – enabling us all to gain insights that can transform the way we do business, work with others, and live our lives. SAP recognizes that this transformation needs grassroots support...
Full Details
Location: Grand Ballroom
Claudia Perlich

Claudia Perlich, DstilleryChief Scientist

Claudia Perlich serves as Chief Scientist at m6d and in this role designs, develops, analyzes and optimizes the machine learning that drives digital advertising to prospective customers of brands. An active industry speaker and frequent contributor to industry publications, Claudia enjoys acting as a guide in world of data and was recently named winner of the Advertising Research Foundation’s (ARF) Grand Innovation Award and was selected as member of the Crain’s NY annual 40 Under 40 list. She has published numerous scientific articles, and holds multiple patents in machine learning and won many data mining competitions. Prior to joining m6d in February 2010, Claudia worked in Data Analytics Research at IBM’s Watson Research Center, concentrating on data analytics and machine learning for complex real-world... Read More.

9:40am Tuesday, 10/29/2013
Coverage of online advertising fraud finally hit the newsstand a few months ago. But the story really started much earlier. Somewhat surprisingly it was predictive modeling on large data streams from real time bid environment that was the first to pick up symptoms of the yet largest online advertising scam. We tell the tale where models “too good to be true” lead to quite a sinister discovery.
Full Details
Location: Grand Ballroom
Foster Provost

Foster Provost, NYU | Stern Professor | NEC Faculty Fellow

Foster Provost is coauthor of the O’Reilly best-selling book, Data Science for Business (http://data-science-for-biz.com). He has designed data science solutions for businesses for over two decades, and has co-founded several successful companies focusing on data science for advertising (incl., Dstillery & Integral Ad Science). In his current job as Professor and NEC Faculty Fellow at the NYU Stern School of Business, Foster teaches in the MS in Data Science, MS in Business Analytics, MBA, and PhD programs. His data science research has won many awards and is broadly cited. He served as Program Chair for the ACM SIGKDD Conference and for many years as Editor-in-Chief for the journal Machine Learning.

10:15am Wednesday, 10/30/2013
Predictive analytics is one of the most mature areas of data science and an area where "big data" often is associated with competitive advantage. However, concrete results supporting the advantage conferred by big data are few and far between.
Full Details
Location: Grand Ballroom

Ken Rudin, FacebookDirector of Analytics

9:20am Tuesday, 10/29/2013
In this talk, Ken will discuss several best practices focused on getting the biggest impact from big data and driving a proactive, data-driven culture.
Full Details
Location: Grand Ballroom
Tony Salvador

Tony Salvador, Intel Corporation Director of Experience Insights Research Interaction & Experience Research Lab

Dr. Tony Salvador, Senior Principal Engineer, currently directs research in the Experience Insights Lab within Intel Corporation. His team’s role is to identify new, strategic opportunities for technology based on an understanding of fluctuating, global socio-cultural values. Tony leads a team of social scientists and business analysts to look for, find and develop viable opportunities to create local, sustainable value with new high tech products, services and infrastructures. His ongoing research interests concern disruptive innovation practice, development and new market creation with an ethnographic perspective.

Previously, he directed research for the Emerging Markets Platforms Group and was instrumental in the research and design of the Intel powered classmate PC. Prior to that he was a research scientist and co-founder of Intel’s People & Practices Group.

... Read More.
9:30am Tuesday, 10/29/2013
This talk will cover five major mobile trajectories for the next 10 years creating a brand new world : Seven billion futures, Hyper Digitization, Hyper Individualism, Hyper Collectivity & Hyper Differentiation.
Full Details
Location: Grand Ballroom
Sharmila Shahani-Mulligan

Sharmila Shahani-Mulligan, ClearStory DataCEO & Founder

Sharmila is CEO and founder of ClearStory Data. She has spent 18+ years building game-changing software companies in a variety of markets. Sharmila has been EVP & CMO at numerous software companies, including Netscape, Kiva Software, AOL, Opsware, and Aster Data. She drove the creation of several multi-billion dollar market categories, including application servers, data center automation and big data analytics. She is on the board of Hadapt and Lattice Engines, advisor to numerous companies, large and small, and an active investor in early stage companies.

9:45am Wednesday, 10/30/2013
Is your big data analysis constrained by slow cycles, specialist-only access, and a process of one-shot, big data analysis? Traditional approaches are painful, costly and tedious. See a whole new way to speed the cycle, converge and analyze diverse data, and interact on insights.
Full Details
Location: Grand Ballroom
Ben Werther

Ben Werther, PlatforaFounder and CEO

Ben Werther is the Founder & CEO of Platfora. He founded the company in 2011 to realize his vision of how Hadoop and Big Data Analytics will transform the way every business user uses data and move beyond the fiction, feeling and faith that underlies most business decisions.

Under Werther’s direction, Platfora has grown from an idea sketched on a napkin to one of the hottest enterprise startups in Silicon Valley and a leader of the Big Data Analytics category. Platfora’s mission is to empower customers to leverage Big Data Analytics to transform their businesses into Fact-Based Enterprises. Designed for business users, the company’s product is the first visual self-service platform for interactively and iteratively interrogating enormous amounts of data, and masking the complexity... Read More.

9:50am Tuesday, 10/29/2013
During the session attendees will learn how Big Data Analytics is the difference between fact-based enterprises and those focused on the shallow BI beauty contest.
Full Details
Location: Grand Ballroom

Sponsors

Sponsorship Opportunities

For exhibition and sponsorship opportunities, contact Susan Stewart at sstewart@oreilly.com

Media Partner Opportunities

For information on trade opportunities with O'Reilly conferences email mediapartners
@oreilly.com

Press & Media

For media-related inquiries, contact Maureen Jennings at maureen@oreilly.com

Contact Us

View a complete list of Strata + Hadoop World 2013 contacts