New speakers are added continuously. Please check back to see the latest updates to the Strata Conference + Hadoop World program.
Joseph Adler has many years of experience in data mining and data analysis at companies including DoubleClick, American Express, and VeriSign. He graduated from MIT with an B.Sc. and M.Eng in Computer Science and Electrical Engineering. He is the inventor of several patents for computer security and cryptography, and the author of “Baseball Hacks” and “R in a Nutshell”. Currently, he is a senior data scientist at LinkedIn.
Jim Adler is the Chief Privacy Officer and General Manager of Data Systems at Intelius. As part of the executive leadership team, Jim is the chief consumer advocate for the company as well as responsible for the data systems that power Intelius’ products. Jim is a national leader on privacy, cryptography, and security issues having testified before the Federal Trade Commission, U.S. Congress, National Institute of Standards and Technology and leading academic institutions. Jim also serves on the advisory boards of the Future of Privacy Forum and Helios Voting. Jim blogs and tweets about privacy and data at jimadler.me and @jim_adler.
Prior to Intelius, Jim served as president and chief technology officer at Identity.net, an Internet company giving consumers control of... Read More.
Kiyan joined WibiData in 2011. He holds a BS in Computer Science from Penn State and a MS in Computer Science from Cornell. His graduate research focused on applying large scale data mining and machine learning to the areas of optimization and multi-agent systems. Kiyan enjoys writing, baking, comic books and video gaming when he gets the chance.
Jonathan Alexander is VP Engineering at Vocalocity, a leader in hosted VoIP for small businesses. Previously he built and managed software teams at vmSight which was acquired by Liquidware Labs, and Radnet which was acquired by Epiphany. He studied computer science at UCLA, and began his career writing software for author Michael Crichton. He is the author of Codermetrics: Analytics for Improving Software Teams, 2011 from O’Reilly.
He is the author of a number of books, and from time to time he also stands in front of cameras. You can often find him at conferences talking about interesting things, or deploying sensors to measure them. He recently rolled out a mesh network of five hundred sensors motes covering the entire of Moscone West during Google I/O. He’s still recovering.
Jesper develops experimental online services designed to introduce emotional contexts into online relationships, creating more authentic experiences. He is the co-founder of Bloom Studios, developing novel data interface applications for web and tablet platforms. He is also an accomplished data scientist, working on problems including home valuations for Trulia, credit card fraud for Visa, and social network analysis for Visible Path. Jesper speaks frequently at international technology and design conferences and has appeared in print and broadcast media for projects like Avoidr, Freerisk, and his Foursquare privacy hack. He holds a B.Sc. in Physics from Haverford College and an M.B.A. in Econometrics from University of Chicago.
I am a Curriculum Developer and Instructor with Cloudera. I run our local developers group. I like to find creative ways of using technology to solve practical problems.
Dr. Amr Awadallah
Amr is Co-Founder and CTO of Cloudera. Prior to Cloudera Amr was an Entrepreneur in Residence at Accel Partners. Before that he served as Vice President of Engineering at Yahoo!, and led a team that used Apache Hadoop extensively for data analysis and business intelligence across the Yahoo! online services. Amr joined Yahoo! after they acquired his first startup, VivaSmart, in mid-2000. Amr holds a Bachelor’s and Master’s degrees in Electrical Engineering from Cairo University, Egypt, and a Doctorate in Electrical Engineering from Stanford University.
Architect in the eXtreme Computing Group of Microsoft Research developing tools and cloud services for the exploration of data.
Dr. David W. Bauer Jr. has over 20 years of experience in the field of distributed computing, specializing in HPC, large-scale data analysis and experiment design. Dr. Bauer is a pioneer in the field of HPC, BigData and Cloud Computing; he developed a cloud computing platform for the Defense Information Systems Agency (DISA) in 2006, and currently is the Chief Architect and Technical Director for the DCGS-Army Standard Cloud. Dr. Bauer is only one of a handful of people in the world who has built applications harnessing the computing power of hundreds of thousands of processing elements, and regularly works with peta-scale data sets.
Stephanie Beben is an analytics engineer and developer at Booz Allen Hamilton with two years experience designing and implementing solutions to big data problems using cloud technologies for U.S. government clients.
Prior to joining Booz Allen Hamilton, Stephanie received a M.S. in Mathematics from Texas A&M University.
Collin Bennett is a principal at Open Data
Group. In three and a half years with the company, Collin has worked on the open source Augustus scoring engine and a cloud-based environment for rapid analytic prototyping called RAP. Additionally, he has released open source projects for the Open Cloud Consortium. One of these, MalGen, has been used to benchmark several parallel computation frameworks. Previously, he led software development for the Product Development Team at Acquity Group, an IT consulting firm head-quartered in Chicago. He also worked at startups Orbitz (when it was still was one) and Business Logic Corporation. He has co-authored papers on Weyl tensors, large data clouds, and high performance wide area cloud testbeds. He holds degrees in English,... Read More.
As the Director of Data Visualization at GfK Custom Research (a global market research firm), Bitsy designs data visualization applications to tell compelling stories about research data. In addition to developing new methodologies and templates for current GfK design processes, she also educates colleagues on current and emergent visualization tools, techniques and best practices.
Prior to joining GfK she worked as a freelance consultant, designing and hand coding interactive data displays for technology companies as well as traditional market research firms.
Bitsy has six years of experience in the market research industry, and holds a B.F.A. in Industrial Design from the University of Wisconsin Stout.
Val joined NetApp in 1998 and leads the Strategic Planning Team within the office of the CTO. Working with customers, analysts, and alliance partners, Val focuses on next-generation research projects and is responsible for NetApp’s product vision.
As an early Big Data adopter, Val chairs the Enterprise Data Architecture Track of the Hadoop Summit. Val also introduced the first Cloud Standard to the industry as chairman of SNIA’s Cloud Storage Initiative, whose mission is to foster the growth and success of the cloud storage market. Previously, Val served as the vice-chair of SNIA’s Solid-State Storage Initiative.
Val has over 25 years of IT industry experience spanning NCR (AT&T) and Cognos (IBM), with 12 years in storage at EMC and... Read More.
Software Engineer at Cloudera, currently focused on the Apache HBase project.
Kevin Beyer is the Principal Architect at Platfora with 20 years of experience in building database systems. As a Research Staff Member at IBM, he created Jaql, a scripting language for large-scale, semi-structured data processing on Hadoop. Prior to the Jaql project, he added XML indexing support to IBM DB2. His Ph.D. dissertation at the University of Wisconsin focused on analytical query processing.
Before the Julia effort began, Jeff Bezanson worked as a software engineer at Interactive Supercomputing, which developed the Star-P parallel extension to MATLAB. At the company, Jeff was a principal developer of M#, an implementation of the MATLAB language running on .NET. He is now a second-year graduate student at MIT. Jeff received an A.B. in Computer Science from Harvard University in 2004, and has experience with applications of technical computing in medical imaging.
As Director of Program Management of the Database Systems Group at Microsoft, Shawn Bice has responsibility for Azure SQL Database, Big Data & SQL Server RDBMS, including guiding the vision, strategy, engineering and market delivery.
As part of his charter, Shawn is focused on delivering Microsoft’s core data platform solutions and services that allow IT Professionals, DBAs, and Developers to successfully Develop, Deploy, and Manage data applications across the SQL Azure platform and SQL Server workload.
Since joining Microsoft in 1997, Shawn has served in engineering leadership positions across the Server and Tools Business (STB) including Windows Server, System Center, SQL Server, Azure SQL Database and spent a tour in field sales, affording him the opportunity to... Read More.
David Blair is a Principal Software Engineer at Akamai where he works on the Akamai Data Platform. He has been working with scalable data processing applications for over 10 years both at Akamai and in his previous roles of Director of Product Architecture at MetraTech, Inc. and Director of Engineering at Torrent Systems. He has a PhD in Mathematics from Brandeis University and B.S. in Mathematics from University of California, Berkeley.
Ron founded Think Big Analytics to help customers leverage new data processing technologies like Hadoop and NoSQL databases and R for statistical analysis. Works with customers to identify opportunities and rapidly develop solutions that integrate data and extract information.
Previously Ron was the VP of Engineering for Quantcast. Each day Quantcast ingests 10 billion events and processes two petabytes of data using Hadoop. The Quantcast map reduce stack handles production data processing, ad hoc analysis, data mining and machine learning. Prior to that Ron was a founder of enterprise consulting companies C-bridge and New Aspects.
David is passionate about helping people make better decisions using data. He is SVP of insight for the EMI Music Group where he setup and now oversees the team that delivers regular, large scale research to EMI in 25 countries. The insight from more than 750,000 interviews in the last couple of years has reached the heart of decision-making at EMI and helps—alongside skills and judgement—with everything from signing new artists to product development and marketing plans for EMI’s biggest artists like The Beatles and Pink Floyd. He also runs insight at zeebox which is using rich consumer surveys and detailed user information to drive product development and product / relationship personalisation.
He has worked in London for New Philanthropy Capital... Read More.
I’ve been designing and building innovative information visualizations for 20 years, ranging from one of the first interactive 3D financial visualizations on the web in 1995 to visualizations embedded in financial data systems used every day by thousands of market professionals.
Alice Brennan is an award winning radio and television journalist with ten years experience at Australia’s public broadcaster, the ABC. She has hosted, produced and researched her own radio shows and news items on issues from indigenous housing to politics to drug taking. Alice has trained journalists from all over Asia and she’s freelanced for NPR, Deutsche Welle and the BBC. A graduate MA in political journalism at Columbia University, Alice has been working with The New York World, a digital government accountability website based out of the school. She’s enamored with the journalistic power of data and is scrambling madly to learn how to code!
Kurt leads the Data Science & Engineering (DSE) Platform team at Netflix. His group architects and manages the technical infrastructure that enables Netflix’s data-centric decision making. The Netflix DSE infrastructure includes both traditional BI tools (e.g. Teradata and MicroStrategy) and various Big Data technologies (e.g. Hadoop, Hive, and Pig).
Ryan is a Distinguished Engineer with Cerner Corporation, one of the leading healthcare IT companies worldwide. He has built infrastructure for healthcare systems over the past decade, and currently is leading the design of Cerner’s big data infrastructure. Ryan also dabbles in writing, such as contributing a couple chapters to the book 97 Things Every Programmer Should Know.
Avi Bryant founded the company behind Dabble DB, where he also hacked on Seaside, MagLev, and other tasty Smalltalk treats. After that company’s acquisition by Twitter, he spent a while building data tools and products for their ads team. He’s now an engineering manager at Etsy.
Jim Caputo is the technical lead for Google’s BigQuery team, and heads the engineering efforts to externalize Google’s large scale data processing for developers and enterprise customers. Prior to his tenure at Google, Jim worked on product teams at Expedia and Microsoft.
Ms. Casey is Principal at CDO, LLC, a boutique consultancy supporting the development of large-scale enterprise information management, data governance, and data security strategic plans and implementation efforts. CDO, LLC helps both private and public sector organizations create customer-centric approaches to information and service delivery; improve information accessibility; streamline and link cross-departmental business processes; reduce information fragmentation; maximize the return on funding investments in information architecture and application development; produce accurate and actionable business intelligence and performance metrics; and, improve risk management and cyber security postures. Ms. Casey also authors the dataTrending blog, at www.dataTrending.wordpress.com .
Prior to CDO, LLC, Ms. Casey was the first state Chief Data Officer in the country, and part of the Governor’s Office in... Read More.
Scott Chastain, Engineering Manager, Information Management and Delivery, SAS
Scott empowers the SAS Americas sales and technical groups with the architecture, strategy and implementation of SAS’ business analytics infrastructure. He has direct responsibilities for information management, visualization and business intelligence.
Nagaraju Chayapathi is a Data Integration Architect at Paypal. He has 18+ years’ data management experience, creating and implementing data architectures, leading Integration Competency Centers (ICCs), and developing strategies for data integration, data governance, data profiling, data masking, metadata management, and other data related initiatives. At Paypal he architected a unified, scalable data acquisition and delivery platform feeding an enterprise data warehouse and a wide array of data marts in both real-time and batch. He also spearheaded a transition from several legacy technologies to Informatica for a single data integration and ETL platform. Prior to working at Paypal, Nagaraju held various BI and data warehouse manager and architect positions at industry leading companies including CSC, Cisco Systems, Perot Systems, Wells Fargo, and AT&T.... Read More.
Lynn Cherny has a Ph.D. from Stanford and an M.Phil. from Cambridge University. She has 19 years of experience managing and doing interface design and data analysis at companies including TiVo, Excite, Adobe, The MathWorks, and AT&T Labs. Her consulting work now focuses on data analysis and visualization.
Stephen Coller is a Senior Program officer in the Next Generation Models team. In this role Stephen sponsors current and emerging technologies and platforms that help children to attain college readiness and enable system transformation in the process.
Stephen joined the Foundation in July 2010 from Microsoft, where he worked as a Senior Director of Technology Policy and Strategy. In this role Stephen helped develop Microsoft’s vision for the connection between technology and core societal issues and convert that vision into a set of policy recommendations for senior government leaders and regulatory bodies around the world. In the U.S., Stephen served on Secretary Duncan’s National Education Technology Plan working group. Earlier in his career at Microsoft, he worked on virtual server, cloud platform, and application... Read More.
Rob Coneybeer brings to Shasta Ventures deep experience in building early-stage companies. Prior to co-founding Shasta Ventures, he was a general partner at New Enterprise Associates (NEA) where he led 15 investments in core infrastructure technologies spanning semiconductors, software and networking equipment.
Prior to joining the venture capital industry, Rob served as a lead integration and test engineer in the Astro Space division of Martin Marietta. While at Martin Marietta, Rob helped build the first EchoStar spacecraft.
Deborah M. Cooper possesses over 15 years of industry experience in successfully applying market strategy and customer analytics within financial services, insurance, and life sciences. At Fidelity Investments, she leveraged a customer-focused data architecture strategy to support product growth, advanced analytics to increase retirement season sales by 40%, and KPI development to align the highly-matrixed Retail organization to acquisition goals. With Liberty Mutual Group, she developed a competitive intelligence function and market-sizing strategy for a national and middle market client base.
Most recently, Ms. Cooper served as Vice President of Marketing Applications & Analytics for Putnam Investments. Prior to her work with financial services, she authored and presented research findings in molecular genetics and medical demand modeling. Ms. Cooper holds a Masters degree in... Read More.
Terence Craig is CEO and CTO of PatternBuilders, a big data analytics companies that produces advanced applications for financial services, retail and other data intensive industries.
Terence has an extensive background in building, implementing, and selling analytically-driven enterprise applications across such diverse domains as enterprise resource planning (ERP), retail sales channel optimization, professional services automation (PSA), and semi-conductor process control and analytics in both public and private companies. He has been part of the ERP/SCM industry as it has evolved, from the VAX and HP 3000 to its current heyday of client-server, GUIs, and relational databases and is looking forward to exploring what the next generation of solutions, powered by the Internet of Things and big... Read More.
Alistair has been an entrepreneur, author, and public speaker for nearly 20 years. He’s worked on a variety of topics, from web performance, to big data, to cloud computing, to startups, in that time. In 2001, he co-founded web performance startup Coradiant (acquired by BMC in 2011), and since that time has also launched Rednod, CloudOps, Bitcurrent, Year One Labs, the Bitnorth conference, the International Startup Festival and several other early-stage companies.
Alistair is a chair for Strata + Hadoop World conferences; Techweb’s Cloud Connect; and the International Startup Festival. He’s written four books on analytics, technology, and entrepreneurship, including the best-selling Lean Analytics which is being translated into eight languages. He lives in Montreal, Canada and tries to mitigate chronic ADD... Read More.
Doug (@cutting) is the founder of numerous successful open source projects, including Lucene, Nutch, Avro, and Hadoop. Doug joined Cloudera in 2009 from Yahoo!, where he was a key member of the team that built and deployed a production Hadoop storage and analysis cluster for mission-critical business analytics. Doug holds a Bachelor’s degree from Stanford University and sits on the Board of the Apache Software Foundation.
Raanan is responsible for Big Data Product Marketing at Splunk. He is focused on helping customers harness their big machine data to gain operational intelligence and driving an effective market-driven product strategy. Raanan has spent the past 10 years focused on data, information and distributed systems. Prior to Splunk, Raanan worked at Cloudera where he was a Hadoop sales engineer and before that, spent 9 years as an engineer and sales engineer at Oracle. Raanan is a certified Hadoop developer and administrator and holds a BS in Computer Science and MS in Management.
Jianyong Dai is a Apache Pig PMC member/committer and worked on Pig for almost 3 years at Yahoo and later at Hortonworks. I received my PhD in computer science specialize in computer security, data mining and distributed computing from University of Central Florida. I am interested in data science, large scale processing, Hadoop, Pig, HCatalog, Hive, and more.
Richard co-founded Pentaho in 2004 and is responsible for strategic initiatives, customer and partner relationships and leading product strategies including Big Data, customer adoption and Cloud Analytics. Richard has held key executive management positions in the business intelligence software market for over 20 years, starting his career at IBM. Richard was a Co-Founder at AppSource Corporation (acquired by Arbor Software which later merged into Hyperion Solutions) and Keyola (acquired by Lawson Software) and is an avid water skier.
Danielle is currently a Data Scientist at Nokia, and a doctoral student in Quantitative Psychology at the University of North Carolina at Chapel Hill. Danielle enjoys working to advance and apply statistical methods to address complex business questions, especially using random effects models and mixture models. Danielle is specifically interested in studying issues that arise when modeling developmental phenomenon over time and in the integration of different modeling frameworks. At Nokia, she teams with other data scientists in Nokia’s Insights group to deliver descriptive and predictive insights into Nokia’s new centralized data asset and into business analytic projects and to deliver visualizations of the data and their predictions.
Marc Demarest is CEO and a principal in Noumenal, Inc., an international management consulting firm based in the Pacific Northwest and the UK that provides a range of management and technical consulting services to high-tech, biotech, nanotech, and greentech firms. Widely known as an early proponent of data marting and tiered enterprise data warehousing models, Demarest is currently writing a book on nontechnical aspects of enterprise data warehousing.
Brian is a developer that runs marketing at GitHub, which often involves analyzing a lot of interesting data.
Michael is a serial entrepreneur who has built data platforms in life sciences, online retail, and digital media over the past decade. Michael began his career as a software engineer for Human Genome Project, and later founded CustomInk.com, an early pioneer in customized apparel. More recently, Michael conceived and led Dataspora, where he remains Chairman, helping deliver data science to telcos, insurance firms, and retail banks. Michael holds an A.B. from Harvard College and a Ph.D. in Bioinformatics from Boston University. He enjoys speaking and writing about big data and analytics.
Founder, CTO and SVP, Software Engineering
Kenneth Duda is a pioneer in high-performance networking software and lead architect of Arista Networks EOS, a stateful modular operating system for all Arista Networks products. He is also the co-author of network virtualization specifications including VXLAN with VMware and NVGRE with Microsoft. From 2005 to 2008, Ken was also the Acting President of Arista Networks.
Prior to joining Arista Networks, Ken was the CTO at There.com, where he played a lead role in designing a real-time 3-D distributed system that scaled to thousands of simultaneous users. Ken was also the first employee of Granite Systems and led the software development effort for the Catalyst 4000 product line after the acquisition by... Read More.
Kathleen P. Duff is an attorney licensed to practice in New York State. She holds an A.B. in History from Dartmouth College (‘02), cum laude, as well as a J.D. from Vanderbilt University Law School (’06). During law school, Ms. Duff was awarded the Vanderbilt Scholastic Excellence Award three separate times for earning the highest marks in each of Corporations & Business Entities, Professional Responsibility, and Legal Writing I. Ms. Duff began her career as an associate in the New York office of Skadden, Arps, Slate, Meagher & Flom LLP, where her practice focused on antitrust and commercial litigations. She also served as an associate at Richards Kibbe & Orbe LLP, where she worked on complex civil litigations. Ms. Duff joins the panel... Read More.
Edd Dumbill is a technology analyst, writer and entrepreneur based in California. He’s helping drive businesses with data as VP Strategy for Silicon Valley Data Science.
A startup veteran, Edd was the founder and creator of the Expectnation conference management system, and a co-founder of the Pharmalicensing.com online intellectual property exchange.
An advocate and contributor to open source software, Edd has contributed to various projects, such as Debian and GNOME, and created the DOAP Vocabulary for describing software projects.
Edd has written four books, including O’Reilly’s “Learning Rails”. He writes... Read More.
Serial startup and artist and open-source innovator, particularly interested in large data systems and statistical modeling.
An Apache Cassandra committer, Gary Dusbabek is a life-long programmer specializing in distributed systems. His past experience includes working with large-scale text and image indexes in the newspaper industry and high-volume advertisement booking software. He currently works on the Cloud Monitoring team at Rackspace.
Gabriel is a computer science student and works as an Software Architect in Mercadolibre (NASDAQ MELI) since 2007.
He is tasked with researching technologies and developing projects.
In the last year he has specialized in big data analysis, implementing Mercadolibre’s hadoop cluster.
Dan is the Managing Partner for Corporate Strategy, Innovation and Technology at Accenture and an Executive on Grounds at the McIntire School of Commerce of the University of Virginia.
Most recently, he has focused on the impact of cloud technologies on the firm’s businesses. Previously, he was the Managing Partner for the Communications Industry in Accenture’s Communications & High Tech Global Operating Group. In this role, he helped drive Accenture’s business in the wireline, wireless, and cable segments.
Mr. Elron is a member of the IEEE and ACM. He also serves as the adviser for the information technology industries for the World Economic Forum and as the Associate Director of the Center for the Management of Information Technology at the University of Virginia.... Read More.
Product manager at Cloudera, the standard for Hadoop, for HDFS, HBase, and part of Hive. Previously lead development of the new high availability and disaster recovery solution for Microsoft SQL Server 2012 and a Stanford University graduate.
Tim Estes is the Chairman, CEO, and founder for Digital Reasoning. Tim’s academic work at the University of Virginia focused in the areas of Philosophy of Language, Mathematical Logic, Semiotics, Epistemology, and Phenomenology. It was that eclectic academic background, coupled with the belief that in the future all software would learn from data as a core capability that gave rise to Digital Reasoning.
Tim and his team at Digital Reasoning work closely with leaders in government and industry to solve extraordinarily valuable and morally compelling problems in National Security, Finance, Legal, and Health Care by automating the understanding of unstructured data.
Robert Faludi is the Collaborative Strategy Leader in R&D for Digi International, with a mandate to forge stronger connections with the community of innovators, discover outstanding new work, contribute to outside projects, and support the people making that work. Faludi is also a professor in the MFA program at the School of Visual Arts in Manhattan and in the Interactive Telecommunications program at NYU. He specializes in behavioral interactions through physical computing and networked objects. Rob is the author of Building Wireless Sensor Networks, with ZigBee, XBee, Arduino and Processing published by O’Reilly Media, 2011. His work has appeared in The New York Times, Wired Magazine, Good Morning America, BBC World, the Chicago Museum of Science & Industry and MoMA... Read More.
Lee Feinberg is the founder of DecisionViz, a company designed to helped organizations escape the legacy of reporting data by transforming complex data into simple pictures for making decisions. Industry leaders frequently ask Lee to address domestic and international audiences, most recently including O’Reilly Media Strata, Tableau Software European and North American Customer Conferences, CBS Interactive, and WARC. Lee is also the author of the BeyeNETWORK expert channel, HD Visualizations, Driving BI and Analytics. Lee most recently worked at Nokia where he introduced Tableau Software into the company and led the effort to launch Tableau across the enterprise. He served on Tableau’s customer advisory board and is now a Tableau Partner, as well as the founder of the NJ and NY Tableau User... Read More.
Michael P. Flowers is Analytics Director for the Mayor’s Office of Policy and Strategic Planning and Director of the Financial Crime Task Force of the City of New York. Prior to joining the Bloomberg Administration, Mr. Flowers was Counsel to the U.S. Senate Permanent Subcommittee on Investigations for the 110th and 111th Congress, where he led bipartisan investigations into off-shore tax haven abuses; failures in the mortgage-backed securitization market by U.S. investment and commercial banks and government agencies; and deceptive financial transactions by the North Korean government. From March 2005 to December 2006, Mr. Flowers was Deputy Director of DOJ’s Regime Crimes Liaison’s Office in Baghdad, Iraq, supporting the investigations and trials of Saddam Hussein and other high-ranking members of his regime. From 2003 to... Read More.
Kevin Foster is a Big Data Solution Architect at IBM. His focus is in helping IBM customers and partners to survive and prosper from the large amounts of “big data” generated by mobile phones, smart grids, city traffic, hospitals and other increasingly computerized aspects of our modern lives. Kevin has a Bachelors degree in Mathematics from California State University Stanislaus and a Masters degree in Computer Science from Stanford University.
Currently the Director of Infrastructure Engineering and Lead Architect at Rent the Runway. Previously a Vice President at Goldman Sachs. Apache ZooKeeper committer and PMC member.
Steve Francia leads the developer experience of MongoDB at 10gen. This includes drivers, web, technical writing, integration and evangelism. Steve brings to this role his experience as VP of engineering at OpenSky where he build the worlds first e-commerce site powered by MongoDB and one of the first PHP sites backed by MongoDB. Steve has been an engineer, entrepreneur and executive since 1995 when he was responsible for one of the first ecommerce sites while working for American Telecom.
Steve loves building things, teaching and speaking. He has been involved in open source for over 15 years and prefers to code in Vim. He is the author and contributing author of two O’Reilly books on databases (and development). He has spoken at dozens of... Read More.
Hjalmar is a serial entrepreneur, founder of four startups in the gaming, mobile and web sectors since 1996. His company, DataMarket, provides information companies with tools to effectively publish their data and reach new audiences. Their data portal, DataMarket.com, may be the largest collection of open statistics and numerical data available online. DataMarket is based largely on Hjalmar’s vision of the need for a global exchange for such data.
Michael, co-founder of farsite, directs farsite’s overall strategy and manages business development.
Michael has a bachelor’s degree in classics from the University of Michigan and a master’s degree in history from the University of Minnesota. Michael also has an MBA from Fisher College of Business at The Ohio State University. He has worked as a White House Intern in the Office of Public Liaison Business Affairs Desk, and also for SmartBrief, a Washington-DC based start-up that publishes daily B2B news briefs.
Anthony is the Founder and CEO of Kaggle. He assists companies with framing modeling tasks as data prediction
competitions, ensuring that competitions reflect real-life projects. Before founding Kaggle, Anthony worked in the
macroeconomic modelling areas of the Reserve Bank of Australia and before that the Australian Treasury. In these
roles, Anthony was responsible for building macroeconomic models, generating economic forecasts and simulating
the impact of changes in interest rates and fiscal policy on the Australian economy. Anthony holds a first class
honours degree in economics and econometrics from the University of Melbourne and has published in The Economist
magazine and the Australian Economic Review.
As Chicago’s first Chief Data Officer, Brett Goldstein’s role is to lead the city’s approach to and use of data to help improve the way the government works for its residents. Thus, the responsibilities of the Chief Data Officer are threefold: promoting transparency through open data, constructing a comprehensive strategy for citywide data usage and storage, and facilitating data-driven decision-making through data analytics, including groundbreaking predictive analytics.
Before coming to City Hall as Chief Data Officer, Brett Goldstein founded and directed the Chicago Police Department’s Predictive Analytics Group, which aims to predict when and where crime will happen. Goldstein is a former Commander in the Chicago Police Department. Previously, Goldstein was an early employee with OpenTable. He earned his Bachelor’s degree from Connecticut College, his... Read More.
Daniel L. Goroff is a Program Director at the Alfred P. Sloan Foundation who is especially interested in economics, finance, mathematics, the scientific and technical work force, and education. He is also Professor Emeritus of Mathematics and Economics at Harvey Mudd College in Claremont, California, where he previously served as Vice President for Academic Affairs and Dean of the Faculty.
Goroff earned his B.A.-M.A. degree in mathematics summa cum laude at Harvard as a Borden Scholar, an M.Phil. in economics at Cambridge University as a Churchill Scholar, a Masters in mathematical finance at Boston University, and a Ph.D. in mathematics at Princeton University as a Danforth Fellow. His first faculty appointment was at Harvard University in 1983. During over two decades there, he rose to... Read More.
Sean Gourley, Quid co-founder and CTO, did research into the
mathematics of war for his PhD thesis at Balliol College, Oxford. His findings appeared as the featured article in “Nature” (December 2009) and were the subject of a popular TED talk (2009). His work on statistical analysis, probability, and algorithm development applied to complex systems and large datasets inspired the creation of Quid. Sean is a Rhodes Scholar PhD in Physics (Complexity) from the University of Oxford; his is undergraduate degree in Physics is from the University of Canterbury, Christchurch, New Zealand.
Ilya Grigorik is a web peformance engineer and advocate at Google, an open-source evangelist, an analytics geek, and a proverbial early adopter of all things digital. Prior to focusing on web performance Ilya was the founder and CTO of PostRank, a social analytics company which became the core of social analytics within Google Analytics.
Mr. Groom is responsible for the Kognitio Analytics Center of Excellence, a group that includes some of the world’s finest experts in business analytics, data warehousing and business intelligence. In this role, Paul leads the pre-sales technical consulting team and is crucial to the development of Kognitio as a fully-integrated analytical platform. Paul has over twenty years’ experience working with Very Large Databases (VLDBs) with in depth knowledge of databases and data warehouse technologies. He began his career on a database research project at the UK national mapping agency, Ordnance Survey, followed by five years as a Lead Systems Engineer providing pre- and post-sales technical support in the UK and Europe within the then-leading vendors of database machines: Britton Lee (ShareBase), and Teradata/NCR.
In... Read More.
Robert Grossman (@bobgrossman) is the Founder and a Partner of Open Data Group, which specializes in building predictive models over big data. He is a Core Faculty and Senior Fellow at the Institute for Genomics and Systems Biology (IGSB) and the Computation Institute (CI) at the University of Chicago. He has led the development of new open source software tools for analyzing big data, cloud computing, data mining, distributed computing and high performance networking. Prior to starting Open Data Group, he founded Magnify, Inc. in 1996, which provides data mining solutions to the insurance industry. Grossman was Magnify’s CEO until 2001 and its Chairman until it was sold to ChoicePoint in 2005. He blogs about big data, data science, and... Read More.
Kristian Hammond is CTO of Narrative Science, a Chicago start up focused on the generation of narratives from data. He is also a professor of Computer Science and Journalism at Northwestern University and a researcher in the areas of human-machine interaction, context-driven information systems and artificial intelligence. In 1998, Kris founded Northwestern University’s Intelligent Information Laboratory (InfoLab) where his team is creating technology that bridges the gap between people and the information they need. From 2000 to 2001, Dr. Hammond also enjoyed a run as the weekly technology correspondent for WTTW’s Chicago Tomorrow. Kris received his PhD from Yale University.
Roy Hyunjin Han is a Computational Engineer at CrossCompute, working on mathematical models to help cities run effectively. Previously, he worked with the Modi Research Group at the Earth Institute of Columbia University to develop an electricity infrastructure planning system currently used by countries in Africa and a remote sensing system that finds buildings in satellite images using convolutional neural networks. He holds a M.S. in machine learning and a B.S. in mathematics and neuroscience.
Joseph M. Hellerstein is a Chief Strategy Officer at Trifacta and Chancellor’s Professor of Computer Science at UC Berkeley. His work focuses on data-centric systems and the way they drive computing. He is an ACM Fellow, an Alfred P. Sloan Fellow and the recipient of three ACM-SIGMOD Test of Time awards for his research. He has been listed by Fortune Magazine among the 50 smartest people in technology, and MIT Technology Review included his work on their TR10 list of the 10 technologies most likely to change our world.
Bjoern is the founder of the Startup Genome and Startup Compass. In early 2011 he & his team set out to decrease the massive failure rate of startups and SMEs to accelerate the pace of innovation by transforming the business world from gut driven to data driven. Before Bjoern founded and led 4 for and non profit ventures in Germany, Bangladesh & the US. He also worked as an executive in Russia for one year for a mid sized marketing company. His personal passion or “leitmotif” is: unleashing human potential. His higher purpose is to strive to maximize the capitalization of every single individual. In his free time he speaks at conferences to pass on his lessons learned.
Rich Hickey, the author of the Clojure programming language and designer of the Datomic database system, is a software developer with over 20 years of experience in various domains. Rich has worked on scheduling systems, broadcast automation, audio analysis and fingerprinting, database design, yield management, exit poll systems, and machine listening, in a variety of languages.
Steven Hillion has been leading large engineering and analytics projects for fifteen years. Before joining Alpine Data Labs, he founded the analytics group at Greenplum, leading a team of data scientists and also designing and developing new open-source and enterprise analytics software. Before that, he was Vice President of Engineering at M-Factor, Inc. (acquired by DemandTec) where he built analytical applications that became a global standard for demand modeling. Earlier, at Kana Communications, Steven led the engineering group during the two largest releases of its flagship product. At Scopus Technology (later Siebel Systems) he co-founded development groups for finance, telecom and other verticals. He received his Ph.D. in mathematics from the University of California, Berkeley, and was a King Charles I Scholar at Oxford University.... Read More.
Sheridan has more than 15 years experience in Business Intelligence, Analytics, Web, Product Management, Business Strategy and Organization Effectiveness. He is currently VP of Data Products at Auction.com, the leading online real estate auctioneers in the United States.
He has also held positions at two leading online ganmes companies, Kabam and Playfirst, where he has built out Analytics and Big Data groups from inception. Earlier on in his career, Sheridan worked at Procter and Gamble in the areas of Decision Support and Executive Information Systems. He also worked as a Managing Consultant at Towers Perrin where he led many business strategy and organizational effectiveness engagements for large companies.
Sheridan holds a Bachelor of Arts degree in mathematics from Cambridge University and an MBA from... Read More.
High Performance Analytics Team Lead for Monsanto’s R&D IT
Born in England, Nigel Holmes studied illustration at the Royal College of Art in London and then freelanced for magazines and newspapers for 12 years in London before coming to New York in 1978 to work for Time Magazine. He became graphics director and stayed there for 16 years.
At Time, his pictorial explanations of complex subjects gained him many imitators and a few academic enemies who thought he was trivializing information. But he remains committed to the power of pictures and humor to help readers understand otherwise abstract numbers and difficult scientific concepts.
Since 1994 he has run his own business, Explanation Graphics, explaining all sorts of things for a variety of clients. These have included American Express, The Smithsonian Institution and United Healthcare.... Read More.
Jonathan is a Software Engineer with Cloudera, currently focused on the Apache HBase project. He is an Apache HBase committer and PMC member, as well as a committer on the Apache Sqoop project, and a committer and founder of the Apache Flume (incubating) project. Jonathan has an M.S. in Computer Science from University of Washington and also has an M.S. and a B.S. in Electrical and Computer Engineering from Carnegie Mellon University.
Dr. Stan Humphries is the Chief Economist of Zillow Inc. (NASDAQ: Z), the leading real estate information marketplace. Stan joined the company as one of its earliest employees in 2005 and created the Zestimate and its first algorithm. Since that time, Stan has built out the industry-leading economics and analytic team at Zillow. Prior to joining Zillow, Stan spent five years at Expedia where he ran the advanced analytics team. Before Expedia, Stan served as a researcher and faculty member at the University of Virginia, and was previously a Presidential Management Fellow where he served at NASA, the Office of Science and Technology Policy in the Executive Office of the President, and the Technology Administration within the Department of Commerce. Stan has also... Read More.
Noah Iliinsky strongly believes in the power of intentionally crafted communication. He has spent the last several years researching, writing, and speaking about best practices for designing visualizations, informed by his graduate work in user experience and interaction design. He is a frequent speaker in both industry and academic contexts. He has a master’s in Technical Communication from the University of Washington, and a bachelor’s in Physics from Reed College. Noah works as a Visualization Expert at IBM’s Center for Advanced Visualization.
Nilesh Jain is Sr. Research Scientist with the Cluster Computing Architecture team in Intel Labs. His current research focus is on emerging frameworks for large-scale machine learning and big data analytics. His other research interests include systems architectures and technologies that improve scaling, performance, and power consumption of distributed parallel computing. Before joining Intel Labs in 2007, Nilesh spent 11 years working on various telecom and I/O technologies within Intel product groups and at a premier telecom research organization (C-DOT) in India. Nilesh was an open source contributor to Linux Standard Base (LSB).
Brian Jepson is an editor for O’Reilly Media; he covers a number of areas, including Arduino, wireless sensor networks, mobile devices, as well as some Microsoft and Apple topics.
He likes to hack on gadgets such as Arduino and the Netduino in his spare time, and he is also the co-founder and co-host of Providence Geeks, a monthly gathering in Providence, RI.
Annika is a seasoned leader of analytics initiatives, and came from Greenplum where she built the “Data Science Dream Team” – an industry-leading group of Data Scientists – representing a rich combination of vertical domain and horizontal analytical expertise – who are facilitating Data Science-driven transformations for Greenplum customers.
Before Greenplum, Annika worked for Yahoo! for six years in data leadership roles: She led Audience and International data solutions for Yahoo!’s central data organization, Strategic Data Solutions, and led Insights Services – comprised of a team of 40 researchers covering Web analytics, satisfaction/brand health metrics, and audience/ad measurement. Annika is a recognized evangelist for “applied data” and well known for her acute focus on action-enablement.
David Jonker leads Big Data product marketing at SAP, and is helping to drive Big Data go-to-market initiatives across SAP’s data management and analytics platforms.
Previously, David Jonker lead product marketing teams for Sybase’s data management & analytics product lines offering solutions for Big Data, data warehousing, cloud, mobility, and embedded systems. David’s career also includes more than 14 years in software engineering and product management roles.
David has a degree in Systems Design engineering from the University of Waterloo.
PV Kannan co-founded 247 in April 2000. Under PV’s leadership, 247 has become the leader in intuitive customer service software and services. The company was founded based on PV’s vision to improve the consumer’s experience by leveraging big data, intuitive design and a learning platform to simplify customer service and make it easier for consumers to get things done.
PV has been featured in various publications as a thought leader in the field of global customer service, including Tom Friedman’s recent book “That Used to Be Us,” and in his earlier work “The World is Flat.” He is also a regular speaker at industry events and has been a panelist in Forbes Conference 2006 and Academy of 2006 Annual... Read More.
Gary Kearns is Group Executive, Information Services for MasterCard Advisors. He has global responsibility for leading efforts to develop differentiated data assets, drive growth within core target areas and identify new revenue opportunities in emerging segments at MasterCard. MasterCard data assets encompass 1.7 billion cards representing more than 30 million merchants and 65 billion transactions annually. Prior to joining MasterCard, Gary spent five years as president, Decision Analytics North America at Experian. In that capacity, he was responsible for leading B2B software and predictive analytics businesses serving financial institutions, government agencies, asset managers, retailers, utilities and telecom companies. He also was managing director and global leader for Risk Solutions at Standard & Poor’s. Prior to that, Gary spent twenty years in the banking industry. Gary... Read More.
Russ Kennedy brings more than 20 years experience in the storage industry to Cleversafe as the company’s Vice President of Product Strategy, Marketing and Customer Solutions. Having rolled up his sleeves working on automated tape libraries, Russ is still attracted to the technological challenges that have shaped the industry and particularly to the innovative approach that Cleversafe delivers to storage.
Russ joined the company initially in 2007 and left in 2009, staying on in an advisory role. In 2011, Russ rejoined the company seeing a clear opportunity to solve the storage needs surrounding the exponential growth of big data and the unique impact that Cleversafe delivers over traditional systems.
Previously, Russ served as the Vice President of Competitive Intelligence at CA Technologies,... Read More.
Paul Kent is Vice President of Big Data initiatives at SAS. He spends his time discussing, evangelizing and developing software at the confluence of big data and high performance computing.
Paul was previously Vice President of the Platform R&D division at SAS and led groups responsible for the SAS foundation and mid-tier technologies, teams that develop, maintain and test Base SAS®, as well as related data access, storage, management, presentation, connectivity and middleware software products.
Paul joined SAS in 1984 and contributed to the development of SAS software components including PROC SQL, TCP/IP connectivity, the Output Delivery System (ODS) and high-performance initiatives.
A strong customer advocate, Paul is widely recognized within the SAS community for... Read More.
Greg is the Business Development Manager for Big Data at Intel Corporation. He has over 10 years of in-depth experience in business intelligence and data warehousing at Oracle and Business Objects (now SAP). A former U.S. Army officer, Greg holds a master’s degree in Information Systems Management from Carnegie Mellon University.
Siraj founded The Climate Corporation (formerly WeatherBill) in 2006, having previously worked at Google in multiple technical lead roles, from the company’s distributed computing infrastructure to the high-profile Google Book Search project and other offline content search initiatives. Siraj obtained an M.S. degree in Computer Science from Stanford University, and a B.A. (Hons.) in Computer Science from the University of Cambridge, England. While at Stanford, he was also a lead software architect for the popular Folding@Home distributed computing project.
Mr. Irfan Khan joined Sybase in 1992. As Senior Vice President and Chief Technology Officer, Mr. Khan oversees all technology offices in each of Sybase’s business units, ensuring market needs and customer aspirations are reflected within the company’s innovation and product development. Mr. Khan is also responsible for setting the architecture and technology direction for the worldwide technical sales organization.
As part of CTO responsibilities, Mr. Khan oversees the technology evangelism efforts for Sybase, responsible for seeding new innovation and driving new technologies within Sybase’s most strategic accounts. Mr. Khan also is in charge of the Sybase Developer Network.
In 2010, Mr. Khan received the InfoWorld CTO Top 25 Award and was named to the International Advisory Board of Cloud Expo.
Amandeep is a Solutions Architect at Cloudera where he’s involved in the entire lifecycle of Hadoop adoption for customers – from use case discovery to taking systems to production. Amandeep is also a co-author of HBase In Action, a book geared towards building applications using HBase. Prior to Cloudera, Amandeep was at Amazon Web Services, where he was a part of the Elastic MapReduce team and built the first version of EMR’s HBase offering.
Aaron is the Founder and CTO of WibiData, Inc., a software company that engineers solutions for the large-scale user-centric data challenges that face today’s enterprises. He is a committer on the Apache Hadoop project and has been working with Hadoop since 2007. Aaron previously worked at Cloudera, a company which provides an enterprise platform, support and services built around Hadoop. Aaron founded the open source Apache Sqoop data import tool and Apache MRUnit Hadoop testing library projects. Aaron holds a B.S. in Computer Science from Cornell University and a M.S. in Computer Science from the University of Washington.
Robert Kirkpatrick is Director of the Global Pulse initiative of the United Nations Secretary-General, which aims to harness big data for a real-time understanding of human well-being.
Robert’s activities in government, academia, UN agencies, NGOs and the private sector have focused on developing innovative solutions at the intersection of technology, policy and social change. Robert has more than 15 years of experience in the design and use of advanced technology tools in business, public health, disaster relief, security coordination, citizen journalism, telemedicine, crisis monitoring, conflict mediation, and peace building activities. His work has focused on strengthening public policy, enhancing crisis resilience and catalyzing organizational change. Robert advocates open data, open standards, open source software, and participatory development.
He co-founded and led software development for two... Read More.
Edmund Kohlwey is a developer and data scientist at Booz Allen Hamilton. For the last three years, he has helped government clients adopt and develop their big data capabilities across many different problem domains.
Tech lead at Cloudera for new products. Graduated in 2000 with a PhD in databases from UC Berkeley, followed by engineering jobs at a few database-related startup companies. Marcel joined Google in 2003, where he worked on several ads serving and storage infrastructure projects. His last engagement was as the tech lead for the distributed query engine component of Google’s F1 project.
Yekesa Kosuru is a Distinguished Architect at Nokia. He leads the Data & Analytics engineering team. The team is responsible for building and managing the NoSQL platform and the Big Data Analytics Platform. The NoSQL platform executes hundreds of millions of key-value queries a day and the Big Data Analytics platform hosts multiple petabyte and executes over 10,000 jobs a day, with more than 600 users.
Prior to Nokia, Yekesa worked at Oracle, where he gained deep expertise in relational databases and OLAP technology.
Mr. Krishnan is a recognized expert worldwide in the strategy, architecture and implementation of high performance data warehousing and big data solutions. He is a visionary data warehouse thought leader, ranked as one of the top data warehouse consultants in the world, and an independent analyst, writing and speaking at industry leading conferences, user groups and trade publications. He has authored eBooks, over 100 plus articles, viewpoints and case studies in Business Intelligence, Data Warehousing and Data Warehouse Appliances and Architectures.
A recognized authority on Unstructured data integration, text mining and text analytics. Along with Bill Inmon, he is promoting the next generation of data warehousing and business intelligence around BIG Data,Hadoop, NoSQL, Unstructured Data Integration and Social Intelligence.
Krish presents and speaks at... Read More.
Joe Lamantia is currently the User Experience Lead and Architect for Oracle’s Discovery products. He has spent more than 15 years at the forefront of design and technology, working in Europe, the U.S. and Asia as an entrepreneur, management and strategy consultant, and design leader for organizations ranging from Fortune 100 companies to boutique startups. His passion is understanding and bettering how people engage and interact with natural and human-created experiences, systems and services.
Joe builds products and services, design and development teams, and research programs that address emerging and rapidly evolving interaction spaces through deep insight and innovation; speaks frequently for the international design and technology community on leading edge practices; publishes extensively in professional design and technology journals; and creates and shares original... Read More.
Robert (a/k/a r0ml) Lefkowitz is a writer, consultant, and amateur philosopher. He has spent his life working on technology adoption — involved in analyzing and addressing both the technical and social issues. His interests include semasiology and medieval history. He also juggles clubs.
Jonathan has been working in software development for more than 10 years. In the last 3 years he has developed and contributed to several new and existing open source projects, including Jedis, which is being used actively by VMWare, SpringSource, etc.. Today, Jonathan is the Head of Research and Development and is tasked with researching new technologies and opening up MercadoLibre: open source, and open to the community.
Jerome Levadoux is Senior Vice President, Solutions and Alliances for Autonomy, an HP Company. Prior to joining Autonomy, Jerome held positions in the areas of Corporate Strategy and Product Management at SAP, and most recently served as SVP, IT Management line of Business at SAP Labs.
Jerome holds a BS in Engineering from Ecole Centrale Paris, an MS in Operations Research from UC Berkeley, and an MBA from Harvard Business School.
Todd holds a Sc.B in Computer Science from Brown University, where he completed an honors thesis developing a new collaborative filtering algorithm for the Netflix Prize Competition. Todd interned at Google developing machine learning methods to detect credit card fraud on AdWords and Google Checkout. Currently Todd works at Cloudera, Inc on bringing Map/Reduce technology to enterprises.
Mary Ludloff is Vice President of Marketing at PatternBuilders, a big data analytics solution provider. Through her work at Brio Technology (now Hyperion) and NONSTOP Solutions (now Manhattan Associates), Mary has a deep understanding of supply chain management issues as well as the use of business intelligence tools in data warehousing and analytic application efforts. Mary is co-author of Informix Guide to SQL: Tutorial and Privacy and Big Data, a frequent white paper contributor, an editor of Object Request Broker (ORB) programmer reference manuals and tutorials, a speaker, and a blogger on diverse topics such as big data and analytics from a technology and business perspective, data security, data privacy, and marketing in the digital age.
Kevin visualizes data and makes statistical interfaces on the web.
Before Clojure, he wrote machine learning and analytics tools in R and Scala.
In 2010 he wrote a thesis on protein structure, for which Reed College inexplicably awarded him a physics degree.
Kevin lives in Portland, Oregon, and spends as much time rock climbing as he does in the REPL.
I design and build analysis and decision support systems, and building data management and access infrastructure. Research focus these days is on analysis techniques, emerging technology and practices in analytics, BI, information management, user experience for data access & delivery applications. I speak at a lot of conferences on anything data, with a bunch of history of science and technology mixed in.
I focus on two types of work: using data to make decisions and manage organizations, and building data technology infrastructure. A big part of making decisions and using data in a corporate setting is ensuring that the right data capture and data delivery infrastructure is in place to manage the business. As a result, I do as much information strategy and IT architecture... Read More.
Michael is a Developer Programs Engineer supporting developers who work with Google’s Cloud and Data platforms. With many years of experience working for research and non-profit organizations, he is interested in making data analysis on large scale computing infrastructure more accessible and affordable. Michael has written for tech blog ProgrammableWeb.com, has spent time in rural Uganda researching mobile phone use, and has a Masters degree in Information Management and Systems from UC Berkeley’s School of Information.
Steve Mardenfeld is a Data Engineer for the Big Data Team at Etsy, where he uses Hadoop and other tools to process and analyze large data sets. Previously to Etsy, he worked as a Senior Data Analyst for a criminal justice agency, where he focused on utilizing data to help inform criminal justice policy issues.
James Markarian serves as executive vice president and chief technology officer at Informatica with responsibility for the strategic direction of Informatica products, including the delivery of the Informatica Platform. He also runs the corporate development group, including acquisitions. Previously, Markarian held the role of general manager of the Data Integration Business, where he led the delivery of both the flagship PowerCenter product and the Informatica Platform. Prior to joining Informatica in 1998, Markarian spent 10 years at Oracle Corporation, where he was architect for the Oracle Tools Division, responsible for cross-product architectural planning and integration decisions. He also served on Oracle’s Architecture Review Board. Markarian earned a bachelor’s degree in Computer Science and Economics and a master’s degree in Economics from Boston University.
Taylor Martin received her Ph. D. in educational psychology from Stanford. She also holds an M.S. in cognitive psychology, and a B.A. in linguistics. She has worked in research and development in curriculum and design of instructional systems on projects such as the Adventures of Jasper Woodbury and the Algebra Project. She has also worked as an elementary school teacher,particularly in the area of mathematics. She was in the Department of Curriculum and Instruction and is affiliate faculty in Developmental Psychology and the Learning Technology Center at the University of Texas, Austin. She collaborates extensively with partners in the College of Engineering, the Physics Department, and the Texas Advanced Computing Center at the University of Texas, Austin, in Computer Science and Learning Science at the... Read More.
George Mathew oversees product development, strategy and marketing for Alteryx. He brings over 16 years of executive leadership experience in enterprise technology, software-as-a-service, and e-commerce. Previously, George was the group vice-president & general manager for Business Intelligence (BI) at SAP BusinessObjects, and before that managed SAP’s portfolio strategy in the Office of the CEO. Prior to SAP, George was an early member of salesforce.com. He was also a founding member of Fort Point Partners, a management consultancy that delivered large-scale e-commerce systems.
Mike has the responsibility of outbound marketing for the Greenplum product portfolio, a comprehensive platform that is driving the future of Big Data Analytics. Previous to Greenplum, Mike was the Senior Director of Product Management for ParaScale, a parallel distributed file system company now owned by Hitachi Data Systems. Prior toParaScale, Mike held product management roles at EMC Rainfinity and McDATA.
Q Ethan McCallum is a consultant, writer, and technology enthusiast, though perhaps not in that order. Most recently he was seen working on the upcoming book Bad Data Handbook (O’Reilly).
Ryan’s role with farsite is to take the totality of data science and statistics and guide our analysts to create insight for your high-consequence decisions. Part of this job requires in-depth expertise, both to know how to produce results as well as to know what is possible. The other part is listening to you to understand how our analysis will have the most impact.
Originally in high-performance computation for nuclear engineering application, he earned his undergraduate degree and PhD from the University of Michigan. Due to the stochastic nature of many nuclear processes, Ryan naturally began working in statistics, at first in Monte Carlo simulations. While at Los Alamos National Laboratory and as a faculty member at Texas A&M University, Ryan became a recognized expert... Read More.
Richard McDougall is the Application Infrastructure CTO and Principal Engineer in the Office of the CTO at VMware. He is responsible for driving advanced development and strategy for VMware’s application platform architecture – including the performance and integration of applications, runtimes, middleware, and application encapsulation technologies.
Richard’s is known as an expert in the areas of performance measurement and optimization, and in application deployment architectures.
Before the CTO office, as the Chief Performance architect Richard drove the performance strategy and initiatives to enable virtualization of high-end mission critical applications on VMware products.
Prior to joining VMware, Richard was a Distinguished Engineer at Sun Microsystems. During his 14 years at Sun, he was responsible for driving high performance and scalability initiatives for Solaris... Read More.
Susan E. McGregor is an Assistant Professor with the Tow Center for Digital Journalism at Columbia Journalism School, where she specializes in data journalism and information visualization. She joined Columbia in 2011 following four years as the Senior Programmer on the Online News Graphics team at the Wall Street Journal, where she was a recipient of the 2010 Gerald Loeb Award in the “Online Enterprise” category as part of the team that developed the Journal’s “What They Know” series. She is also the co-author of a chapter on data journalism in the upcoming collection “Computation for Humanity: Information Technology to Advance Society,” due out in 2012. She holds a B.A. from Harvard in Interactive Information Design and an M.A. from NYU in Educational Technology.... Read More.
Building analytics libraries and research tools for quantitative finance and other fields. Actively involved in data analysis and statistics applications in the scientific Python community. Author of pandas library, contributor to statsmodels. Upcoming author of “Python for Data Analysis” from O’Reilly Media. CEO of Lambda Foundry, Inc.
Will is a software engineer at Cloudera.
Roberto Medri leads the Business Analytics team at Etsy, the world’s leading marketplace for unique goods.
At Etsy, Roberto works closely with product, marketing, and engineering to scout, build, instrument and improve Etsy’s product portfolio, spanning from revenue insights and data patterns to financial modeling to A/B experiments.
Roberto joined Etsy in 2010 as an intern while at the Wharton School, from which he graduated in 2011 as a Palmer Scholar. Before that, he worked as a consultant for Bain & Company in Italy, Switzerland, and Ukraine.
Roberto was born and raised in Lake Como, Italy.
Who am I?
In short, Nilofer Merchant is the (female) James Bond for Innovation.
As someone who has grown businesses — from Fortune 500s and silicon valley web startups — for 20 years, Nilofer is like a secret agent in knowing how to piece together the parts that matter (frameworks, strategies, and cultural values) — to get what everyone needs… results.
Rob Metcalf serves as President and COO of Digital Reasoning and is responsible for the business and sales strategy of the company. Rob Metcalf brings over a decade of experience in technology and information services businesses around the world. Prior to joining Digital Reasoning, Rob helped build software and information services businesses for LexisNexis in Mexico City, Toronto and Raleigh. As Vice President of Operations for Global Practice Management, Rob oversaw numerous acquisitions and key investments at the forefront of the company’s solution strategy. Prior to LexisNexis, Rob worked in Boston and San Francisco with The Parthenon Group, a strategy consultancy. Rob earned his MBA from Harvard Business School and his AB, Summa Cum Laude, from Princeton University.
Prior to joining the Laura and John Arnold Foundation, Anne served as New Jersey’s Attorney General where she headed the 9,000-person Department of Law and Public Safety. As New Jersey’s chief law enforcement officer, she oversaw 21 prosecutors and approximately 30,000 law enforcement officers. Anne implemented a statewide program to improve public safety through prevention of crime, law enforcement reform, and re-entry initiatives. Anne also served as a member of the United States’ Attorney General’s Executive Working Group on Criminal Justice and as a co-chair of the National Association of Attorneys General Criminal Law Committee. Prior to becoming Attorney General, Anne served as First Assistant Attorney General and, before that, was Counsel to a United States Senator. Anne also previously served as a federal prosecutor... Read More.
Dr. Donald Miner serves as a Solutions Architect at EMC Greenplum, advising and helping customers implement and use Greenplum’s big data systems. Prior to working with Greenplum, Dr. Miner architected several large-scale and mission-critical Hadoop deployments in the U.S. Intelligence Community. He is the author of the upcoming book “MapReduce Design Patterns”, which will be published by O’Reilly in the Fall of 2012. He is also involved in teaching, having previously instructed industry classes on Hadoop and a variety of artificial intelligence courses at the University of Maryland, BC. Dr. Miner received his PhD from the University of Maryland, BC in Computer Science, where he focused on Machine Learning and Multi-Agent Systems in his dissertation.
Romy Misra is the Director of Analytics at Visual.ly and is responsible for the company’s analytics and operations. She is passionate about startup analytics and building data communities to facilitate data related connections.
Romy has a graduate degree in Industrial engineering and is currently the co-organizer of the Data Mining meetup group in the Bay Area, where she regularly holds workshops on education on data science.
Justin is a member of the Entities/Data Science team at Facebook where he helps curate and build from their rich structured object and social graphs, with a focus on location. Before joining Facebook, Justin ran the Data team at foursquare. In addition to building their core data-driven products Explore and Radar, he built a team from the ground up that consisted of Engineers and Data Scientists to solve large scale data problems as foursquare’s dataset grew from half a million check-ins to over 1.5 billion. Before that, Justin worked at a hedge fund as a quantitative analyst, building custom portfolios for their asset management division and doing modeling and analysis for their risk team, specializing in high-frequency, derivatives, and commodities trading. Prior to that, he... Read More.
Prasad is a Software Engineer at Cloudera. He is also a committer on Apache Flume.
Arun C Murthy is Founder and Architect at Hortonworks, a software company focussed on focused on accelerating the development and adoption of Apache Hadoop. He also leads next generation MapReduce project in Apache Hadoop and has contributed full-time to Hadoop for 6 years now. In the community, Arun is the VP, Apache Hadoop, at the ASF i.e. the Chair of the Hadoop PMC. Prior to co-founding Hortonworks, Arun was responsible for all Hadoop MapReduce code and configuration deployed across the 42,000+ servers at Yahoo!. In essence, he was responsible for running Apache Hadoop’s MapReduce as a service for Yahoo. He jointly holds the current world sorting record using Apache Hadoop.
Aaron T. Myers is a Software Engineer at Cloudera and an Apache Hadoop Committer. Aaron’s work is primarily focused on HDFS. Prior to joining Cloudera, Aaron was a Software Engineer and VP of Engineering at Amie Street, where he worked on all components of the software stack, including operations, infrastructure, and customer-facing feature development. Aaron holds both an Sc.B. and Sc.M. in Computer Science from Brown University.
Thejas Nair is a software engineer working on Apache pig, hcatalog and hive projects at Hortoworks. He is a committer and PMC member of Apache Pig project. Previously, he worked at Yahoo for 9 years, developing solutions for large scale distributed data processing.
Moisés J. Nascimento is the Director of Data Architecture at Paypal where he has worked since 2008. Moisés developed a comprehensive data integration platform leveraging Cloud computing principles to build a real time data engine for Paypal. Prior to Paypal, Moisés architected data products and systems for several high tech companies including BMC Software and Salesforce.com.
Jack has over 20 years of enterprise software marketing experience. He has demonstrated success from defining new markets for small companies to increasing sales of new products for large public companies. Jack’s broad experience includes launching and establishing analytic, virtualization, and storage companies and leading marketing and business development for an early-stage cloud storage software provider. Jack has also held senior executive roles with EMC, Rainfinity, Brio Technology, SQRIBE, and Bain and Company. Jack earned an MBA from UCLA Anderson and a BA in economics with honors and distinction from Stanford University.
Cathy O’Neil earned a Ph.D. in math from Harvard, was postdoc at MIT in the math department, and a professor at Barnard College where she published a number of research papers in arithmetic algebraic geometry. She then chucked it and switched over to the private sector. She worked as a quant for the hedge fund D.E. Shaw in the middle of the credit crisis, and then for RiskMetrics, a risk software company that assesses risk for the holdings of hedge funds and banks. Since this spring she’s been a data scientist for the startup media company “Intent Media”;http://www.intentmedia.com/.
Amy is a business and technical executive with extensive experience driving growth and change. Amy leads Nokia’s Analytics team and is committed to enabling Nokia to be a data-driven business, ultimately delivering the best experience for consumers. Her charter includes the development of Nokia’s analytics platform, developing and managing Nokia’s data asset, and leading a team of data scientists to deliver descriptive and predictive insights. Prior to joining Nokia, Amy was vice president of services marketing for Sun Microsystems.
Mike (@mikeolson) co-founded Cloudera in 2008 and served as its CEO until 2013 when he took on his current role of chief strategy officer (CSO.) As CSO, Mike is responsible for Cloudera’s product strategy, open source leadership, engineering alignment and direct engagement with customers. Prior to Cloudera Mike was CEO of Sleepycat Software, makers of Berkeley DB, the open source embedded database engine. Mike spent two years at Oracle Corporation as vice president for Embedded Technologies after Oracle’s acquisition of Sleepycat in 2006. Prior to joining Sleepycat, Mike held technical and business positions at database vendors Britton Lee, Illustra Information Technologies and Informix Software. Mike has a Bachelor’s and a Master’s Degree in Computer Science from the University of California, Berkeley.... Read More.
Oscar Padilla leads the strategic services for Entravision’s Data and Analytics Services division. He is accountable for driving the formulation of new product and service offerings, as well as helping clients achieve higher efficiency from their data and analytics marketing programs. Most recently, Oscar served as senior director of Interactive Strategy at Vertis Communications, a $1.4 billion communications and marketing company. As a direct report to Vertis’ CMO, Oscar oversaw the strategic direction of the division, leading client digital strategies in the areas of social media, mobile, online advertising and web development, and managed partners across all media channels.
In addition, Oscar has worked with Hitachi Consulting, Blackboard Inc., RepEquity (previously Mindshare Interactive) and Bridgeline Digital. At these companies, Oscar led engagement for numerous... Read More.
Josh Patterson is a Principal Solution Architect at Cloudera. Prior to joining Cloudera, he was responsible for bringing Hadoop into the smartgrid during his involvement in the openPDC project. His focus in the smartgrid realm with Hadoop and HBase was using machine learning to discover and index anomalies in time series data. Josh is a graduate of the University of Tennessee at Chattanooga with a Bachelors in Business Management and a Masters of Computer Science with a thesis titled “TinyTermite: A Secure Routing Algorithm” where he worked in mesh networks and social insect swarm algorithms. Josh has over 15 years in software development and continues to contribute to projects such as Apache Mahout, openPDC, and JMotif in the open source community.
Roy Pea is David Jacks Professor of Education and the Learning Sciences at Stanford University, Director of Stanford’s H-STAR Institute (Human Sciences and Technologies Advanced Research), and Founder & Director of Stanford’s PhD program in Learning Sciences and Technology Design. Roy publishes widely on K-12 learning and education, especially in science, math and technology, fostered by advanced technologies including scientific visualization, on-line communities, digital video collaboratories and mobile computers. He is co-author of the 2010 National Education Technology Plan for the US Department of Education, co-editor of Video Research in the Learning Sciences (2007), and co-author of the National Academy of Sciences book: How People Learn (2000). He is a Fellow of the National Academy of Education, Association for Psychological Science, the American Educational... Read More.
Jon Peltier is owner of Peltier Technical Services, Inc. He has been a full time Excel developer since 2004, when he founded Peltier Tech, and part-time since about 1995. Peltier Tech provides data and graphic solutions for users of Microsoft Excel, combining Excel’s worksheet functionality and charting capabilities with its powerful VBA programming environment to produce solutions in engineering, finance, and marketing. Peltier Tech has developed a line of Excel add-ins which extend Excel’s graphical capabilities. The Peltier Tech website is an encyclopedia of Excel charting and programming techniques.
Jon is a metallurgist by training, with a Doctor of Science degree from MIT. Prior to creating Peltier Tech, Jon spent over 20 years in research and development, and more recently in manufacturing and... Read More.
Mike Percy is a Software Engineer at Cloudera. Previously, he worked on Yahoo!’s C.O.R.E team.
Prior to joining Dstillery (former Media6Degrees), Claudia spent five years working at the Data Analytics Research group at the IBM T.J. Watson Research Center, concentrating on research in data analytics and machine learning for complex real-world domains and applications. She has been published in over 30 scientific publications and holds multiple patents in the area of machine learning. Claudia has won many data mining competitions, including the prestigious 2007 KDD CUP on movie ratings, the 2008 KDD CUP on breast-cancer detection, and the 2009 KDD CUP on churn and propensity predictions for telecommunication customers. Claudia received her Ph.D. in Information Systems from Stern School of Business, New York University in 2005 and holds a Master of Computer Science... Read More.
Tom Phillips is CEO of Media6Degrees (m6d), a marketing technology company that captures brand signals from major marketers to deliver high-performing ad campaigns.
Phillips was founding publisher of Spy magazine in 1986, the original media chief for Starwave Corporation in 1993 (creators of ESPN.com, NFL.com, ABCNews.com and NBA.com), and CEO of Deja.com in 1998. After a stint as a venture partner at Insight Venture Partners, he joined Google and managed media platforms and the DoubleClick integration before establishing the Search & Analytics team to pioneer new uses of Google data on behalf of major advertising clients.
Phillips holds a BA in applied mathematics from Harvard and a MBA from Stanford.
Arvind is the PMC Chair for Apache Sqoop and a committer and PPMC member of Apache Flume. A seasoned enterprise software developer, Arvind has worked at Netscape, Sun Microsystems, Informatica and currently at Cloudera.
Sanjay is an Apache Hadoop committer and member of the Apache Hadoop PMC.
Prior to co-founding Hortonworks, Sanjay was the chief architect of core-Hadoop at Yahoo focusing mostly on HDFS, MapReduce schedulers, compatibility, etc.
He has also held senior engineering positions at Sun Microsystems and INRIA, where he developed software for distributed systems and grid/utility computing infrastructures.
Sanjay has a PhD in Computer Science from the University of Waterloo in Canada.
Michael J. Radwin leads Intuit’s Data Services & Solutions team. His team develops new and innovative ways to put the collective data of 50 million customers to work for them, providing unique insights and experiences they can’t get anywhere else. With Radwin’s leadership, Intuit has adopted text analytics, recommendation services, and data-driven algorithms that create experiences that delight customers and impact the bottom line across product lines such as personal finance, tax, and accounting.
Prior to Intuit, Radwin was VP Engineering of Anchor Intelligence, which provided real-time scoring based on predictive models for advertisers and ad networks. Earlier, Radwin was Director of Engineering at Yahoo! where he developed and supported web and data platform technologies Apache, PHP, MySQL, and proprietary TCP/IP middleware systems.... Read More.
Aparna is Director of Engineering at Cloudera, where she leads the development of Cloudera Manager – the industry’s first end-to-end management application for Apache Hadoop. Prior to Cloudera, she worked on search monetization products at Yahoo! and on contextual advertising systems at Microsoft. Aparna holds a Master’s degree in Computer Science from Arizona State University.
Dr. Samantha Ravich consults with both private industry and Federal and State governments on international security, financial, and political risk. She is also a Senior Advisor to The Chertoff Group and Co-Chairs the National Commission for Review of Research and Development Programs in the IC. From 2009-2011, Ravich was Senior Vice President at IPS, a software and analysis firm.
Ravich was Deputy National Security Advisor to Vice President Cheney and served in the White House for 5½ years during the 2001-2008 time period. Ravich was the Vice President’s representative on Asian and Middle East Affairs as well as on Counter-Terrorism and Counter-Proliferation. During her tenure in OVP, she also worked on Free Market and Democracy Promotion, spearheading a number of government wide initiatives.... Read More.
Kim Rees is a founding partner of Periscopic, an award-winning information visualization firm. Their work has been featured in the MoMA as well as several publications, including Print Magazine, CommArts’ Interactive Annual, The Information Design Sourcebook, Adobe Success Stories, Infosthetics.com, FlowingData.com, and others. Periscopic’s body of work was recently nominated for the Cooper-Hewitt National Design Awards.
Kim is a prominent individual in the data visualization community. She has published papers in Parsons Journal of Information Mapping, was an award winner in the VAST 2010 Challenge, and is a guest blogger for Infosthetics.com. Kim has been featured on CommArts Insights and has presented at several industry events including the Wolfram Data Summit, Strata, VisWeek, Tableau Software Conference, WebVisions, and Portland Data Visualization.
... Read More.
I am a marketing manager at Revolution Analytics with a passion for analyzing data. I have worked a number of successful Silicon Valley start-ups including Sytek, Alantec, Parallan Computer and Scotts-Valley Instruments. I have graduate degrees in both the Humanities and Statistics. I taught statistics briefly at SJSU and I blog at blog.revolutionanalytics.com
As president of Luminar, Franklin Rios brings more than 20 years of management, business development, sales and marketing experience in the interactive, e-commerce, telecomm and media industries.
Prior to Luminar, Franklin was president of the Enterprise Solutions Group at Infogroup, and vice president of Interactive Sales and Marketing at Vertis Communications, Inc., a $1.4 billion communications and marketing company.
Franklin ran his own full-service Hispanic advertising agency, Rios Group, from 2003 to 2006. He was also director of sales at CSG Systems from 2001 to 2003 and a sales manager at Columbine JDS (Harris Broadcasting Systems) from 1994 to 2001.
Naomi B. Robbins is the author of Creating More Effective Graphs, published by John Wiley (2005). She is a consultant, keynote speaker, and seminar leader who specializes in the graphical display of data. She trains employees of corporations and organizations on the effective presentation of data with customized programs. She also reviews documents and presentations for clients, suggesting improvements or alternative presentations as appropriate. Naomi received her Ph.D. in mathematical statistics from Columbia University, M.A. from Cornell University, and A.B. from Bryn Mawr College. She had a long career at Bell Laboratories before forming NBR, her consulting practice.
Jason is a Sr. Architect at Think Big Analytics. He has many years of experience writing Java application software, most recently for Hadoop-based applications.
Eric Sammer is currently a Principal Solution Architect at Cloudera where he helps customers plan, deploy, develop for, and use Hadoop and the related projects at scale. His background is in the development and operations of distributed, highly concurrent, data ingest and processing systems. He’s been involved in the open source community and has contributed to a large number of projects over the last decade.
Peter is passionate about designing products that change the way users live, work, and interact with their world. He experienced first-hand the utility and complexity of big data while building products to secure enterprise networks. Peter has led Product and Marketing teams at Solera Networks, IronPort Systems, and Cisco Systems.
Bill Schmarzo, author of the upcoming “Big Data: Understanding How Data Powers Big Business” to be published by Wiley, is responsible for setting the strategy and defining the service line offerings and capabilities for the EMC Consulting Enterprise Information Management and Analytics service line. He’s written several white papers and is a frequent speaker on the use of Big Data and advanced analytics to power organization’s key business initiatives.
Bill has more than two decades of experience in data warehousing, BI and analytic applications. Bill authored the Business Benefits Analysis methodology that links an organization’s strategic business initiatives with their supporting data and analytic requirements, and co-authored with Ralph Kimball a series of articles on analytic applications. Bill has served on The Data Warehouse... Read More.
Dr. Charles Schmitt is the Director of Data Sciences at the Renaissance Computing Institute (RENCI), a research computing center at the University of North Carolina at Chapel Hill. As director, Dr. Schmitt is responsible for exploring and advancing the application of novel data technologies for national research agendas. This includes work in areas such as high throughput genomic sequencing, management of distributed research data, medical decision support, and data security.
Prior to joining RENCI, Dr. Schmitt worked as a Computer Scientist in industry in areas including data-mining, bioinformatics, and software engineering. His Ph.D. is in Computer Science where he focused on developing neural network models of the human visual system.
John Schroeder founded MapR to address core limitations that were limiting the adoption of Hadoop and expand Hadoop capabilities to serve new business critical and real-time applications.
John has led companies creating innovative and disruptive business intelligence, database management, storage and virtualization technologies at early stage ventures through success as large public companies. John was previously CEO of Calista Technologies (Microsoft), CEO of Rainfinity (EMC) and Senior Vice President of products and marketing at Brio Technologies (BRYO).
Noah Schwartz is technology professional with a passion for sports. Noah is a member of a group at Bloomberg that recognized an business opportunity for a sports analytics and has been instrumental in creating the platform to deliver the applications to the end users.
Michael has been working w Hadoop since 2009 and is the founder of the Chicago area Hadoop User Group (CHUG)
He is currently a Principal Architect with Think Big Analytics helping customers solve their Big Data problems.
Chris Selland recently joined HP Vertica as VP of Marketing and brings 20+ years of success driving demand and revenue via innovative online, search & inbound marketing programs as well as leading key strategic alliance & corporate development initiatives for entrepreneurial, high-growth companies including SoundBite Communications (NASDAQ:SDBT), Constant Contact (NASDAQ:CTCT) and Lumigent Technologies (acquired by BeyondTrust).
Chris is an established thought leader, speaker and author on customer strategy-related topics including CRM, Customer Metrics & Loyalty, Customer Support and Social Media Marketing, and is an Expert Advisor at Focus.com as well as a founding member of the Enterprise Irregulars. Earlier in his career he was VP of CRM and Internet Research at the Yankee Group, and later founded Reservoir Partners,... Read More.
Sam Shah is a principal engineer on the LinkedIn data team. He leads many of the site’s large-scale recommendation and analytics systems, which analyze hundreds of terabytes of data daily to produce products and insights that serve LinkedIn’s members. His work involves pure research, product-focused features, and infrastructure development, including social network analysis, recommendation engines, distributed systems, and grid computing. Some of the products under his purview include “People You May Know”, “Who’s Viewed My Profile?”, Skills, related searches, job recommendations, and more. Sam holds a Ph.D. in Computer Science from the University of Michigan.
Sharmila has spent 18+ years building game-changing software companies in a variety of markets. She has been EVP & CMO at numerous software companies, including Netscape, Kiva Software, AOL, Opsware, and Aster Data. She drove the creation of several multi-billion dollar market categories, including application servers, data center automation and big data analytics. She is on the board of Hadapt and Lattice Engines, advisor to numerous companies, large and small, and an active investor in early stage companies.
At Intel, Vin Sharma is responsible for strategic ecosystem initiatives driving adoption of end-to-end analytics solutions based on Intel data center platforms. In this role, Vin spearheads technical and marketing engagements partners working with open source projects such as Hadoop, OpenStack, KVM, and Linux. Before Intel, Vin worked at HP for 15 years, most recently as the business strategist for open source and Linux on HP servers.
Blake Shaw is currently a Data Scientist at foursquare, a location-based service that helps people keep up with friends and discover new places. At this NYC startup, Shaw applies machine-learning algorithms to large spatiotemporal datasets in order to better understand patterns of human mobility. Shaw holds a Ph.D. (also a B.S. and M.S.) degree in Computer Science from Columbia University. His research papers have been accepted at a variety of conferences including NIPS, ICML and AISTAT. Shaw’s “Structure Preserving Embedding” won the Best Paper Award at the 26th International Conference for Machine Learning in 2009. Shaw was also the lead developer of CabSense, a mobile app for predicting the best street corners in New York City for catching taxicabs.
Chang She is CTO and cofounder of Lambda Foundry. From 2011 to 2012, he served as Assistant Vice President at Barclays Capital researching quantitative FX strategies and building research infrastructure. From 2006-2011, he worked at AQR Capital Management in global equities research and algorithm execution. He graduated from MIT with an M.Eng in Computer Science and S.B. degrees in Computer Science and Political Science.
Opower – Analytics – Senior Quantitative Analyst
Freddie Mac – Mission Division – Quantitative Researcher
Johns Hopkins – M.A. Applied Economics
Virginia Tech – B.A. Economics
Tomer Shiran heads the product management team at MapR and is responsible for product strategy, roadmap and requirements. Prior to MapR, Tomer held numerous product management and engineering roles at Microsoft, most recently as the product manager for Microsoft Internet Security & Acceleration Server (now Microsoft Forefront). He is the founder of two websites that have served tens of millions of users, and received coverage in prestigious publications such as The New York Times, USA Today and The Times of London. Tomer is also the author of a 900-page programming book. He holds an MS in Computer Engineering from Carnegie Mellon University and a BS in Computer Science from Technion – Israel Institute of Technology.
Hari is a Software Engineer at Cloudera, where he is working on building Apache Flume. Previously, Hari was a software engineer on Yahoo! Mail’s metadata indexing and query team. He holds a Masters from Cornell University in Computer Science.
Patrick Shumate is currently busy developing and deploying the next generation of content delivery networks for Comcast Cable. Prior to Comcast he provided security consulting to US Government Agencies and was the Senior Architect for RSA Consumer Solutions devision, bring you such hits as the Go ID – federated two factor authentication and RSA eFraudNetwork.
Darío has been working in software development for more than 10 years.
Since 2004 he has specialized in large website, operations and
performance. Today, Darío is the Chief Architect of MercadoLibre (NASDAQ MELI) where he leads the architecture team.
Rick Smolan, a former Time, Life, and National Geographic photographer, is best known as the co-creator of the “Day in the Life” book series.
Today Smolan is the CEO of Against All Odds Productions which orchestrates global photography projects that combine creative storytelling with state-of-the-art technology. Pulling together teams of leading journalists and photographers (plus hundreds of thousands of members of the general public) to focus on emerging topics such as the global water crisis, the effect of the Internet on civilization or how the human race is learning to heal itself, Smolan and his team produce bestselling-illustrated books, TV specials, apps, and exhibits.
Many of Smolan’s books have appeared on the New York Times best-seller lists and have been featured on the covers... Read More.
Juhan Sonin, Creative Director of Involution Studios, is an emeritus of some of the finest software organizations in the world: Apple, the National Center for Supercomputing Applications (NCSA) and the Massachusetts Institute of Technology (MIT). He has been a creative director for almost two decades with his work being featured in the New York Times, Newsweek, BBC International, Billboard Magazine and National Public Radio (NPR). His designs have enjoyed installations throughout Europe and the United States.
Juhan is a recognized expert in design for health, process management, and security, providing consultation to the U.S. Department of Health and Human Services and the Department of Defense.
Juhan was a featured speaker on the topic of software design at the TTI Vanguard... Read More.
Marshall Sponder is an independent Web analytics and
SEO/SEM specialist working in the field of market research,
social media, networking, and PR. He provides digital data
convergence generating ROI and develops data metrics, KPIs,
and dashboards that drive businesses by setting and evaluating
For nearly a decade now, Marshall has been influencing
the development of the industry, lately focusing on social
media metrics. He also possesses considerable in-house corporate
experience as a group leader at IBM and Monster,
combined with contract work at Porter Novelli PR, and is
currently a Senior Analytics Manager at WCG (Wiesscom
Group); Marshall also has deep connections to the NYC startup
... Read More.
Laks has over 20 years of experience in various capacities in marketing, credit risk management, and customer portfolio management, with a focus on analytics and decision automation technologies. He has worked with a number of clients in the retail banking, mortgage, and retail industries addressing and solving various business problems, such as customer acquisition, originations, account management, attrition, collection, and fraud management. Before coming to Opera Solutions, Laks was with Fair Isaac as Senior Director of Innovation Management. There, he managed all aspects of new product development by identifying, evaluating, and developing new business opportunities in the Financial Services and Retail sectors. Laks also served as the Director of Product Management and the Director of Go-To-Market Management at Fair Isaac. In these roles, he led... Read More.
Raymie Stata is CEO of VertiCloud, a startup building a hosted Big Data platform for data-driven applications. Stata comes to VertiCloud from Yahoo!, Inc., where he most recently held the position of Chief Technical Officer. At Yahoo, he played an instrumental role in the development of Yahoo!’s infrastructures for algorithmic search, display advertising, and cloud computing. He also helped set Yahoo’s Open Source strategy and initiate its participation in the Hadoop project. Prior to joining Yahoo!, Stata founded Stata Laboratories, maker of the Bloomba search-based e-mail client, which Yahoo! bought in 2004. He has also worked for Digital Equipment’s Systems Research Center, where he contributed to the AltaVista search engine.
Julie thinks in metaphors and finds beauty in the clear communication of ideas. She is particularly drawn to visual media as a way to understand and transmit information, and is co-author of Beautiful Visualization (O’Reilly 2010) and Designing Data Visualizations (O’Reilly 2012).
Tom Strachan is the Regional Vice President at Gooddata where he is responsible for customer acquisition and retention for the Eastern Division. Prior to GoodData, Tom held a leadership position at WebTrend’s, a leader in the web analytics space where he helped customers measure and understand their return on the digital initiatives. In addition, Tom has been an entrepreneur in which he ran a Yoga eCommerce store called, Yogaduzit.com and has previously held a leadership position at Akamai Technologies where he participated in the growth of the organization from 20M to 400M in revenue. . He loves watching his customers go from being data bankrupt to monetizing that data in a very rapid time frame on the GoodData platform.
Mike Stringer is co-founder and managing partner of Datascope Analytics. A physicist gone wild, Mike is passionate about realizing the potential for big data analytics to make a positive impact on business and society. Despite wearing a suit occasionally, he still finds the time to argue with others about Bayesian statistics, explore the amazing food in Chicago, and has recently become fascinated with typography. Mike received a BS in Engineering Physics from the University of Colorado and a PhD in physics from Northwestern University.
Michael Sullivan is Deputy Editor at The New York World, Columbia Journalism School’s accountability journalism website covering New York State and City government, where he is helping to bring more data-driven journalism to the site. Before joining The New York World in 2012, Michael worked for The Wall Street Journal Online for eight years as an Interactive Designer and Developer. With a background in graphic design and a love for computer programming, Michael found that data visualization/journalism was the perfect niche to quench his creative thirsts. He knows all too well the overwhelming amount and ever-changing set of tools out there for anyone wanting to build information graphics—whether novice or expert—and continues to both learn from and teach others on how best to approach data... Read More.
Ryan is a data engineer at Think Big Analytics. He leads technical consulting projects for big data implementations at Fortune 500 clients. He has in depth experience working with Solr/Lucene and the Hadoop stack.
Liyin Tang is a software engineer at Facebook and a HBase Committer at the Apache Software Foundation. At Facebook, he works on building the data storage system based on HBase for various applications. Liyin holds a bachelor degree in Software Engineering from Shanghai Jiao Tong University, China and a master degree in Computer Science from University of Southern California, US.
Anoush currently works to implement the partnerships and communications strategy for Global Pulse, an initiative of the United Nations focused on applying innovations in big data and real-time analytics to global development.
Charles Thayer is a Software Engineer at Facebook, where he works on the Monitoring Systems. Before Facebook, he worked at Yahoo on search technology including the Web Crawler and Hosted Vertical Crawler. His focus has been scaling both storage and compute resources across thousands of nodes and tens of thousands of disks. He’s been involved with many startups in NYC including Metrobeat/Citysearch and CityRealty. He graduated with a BS EE from Columbia University’s School of Engineering and Applied Science, before becoming CEO of his first startup, Mediabridge Infosystems.
Jim has extensive experience in leading the development, management, and performance for enterprise data architectures, including clustered, large SMP, and distributed systems for the retail, web, and telecom industries. He is responsible for the architecture, vision, direction, and technical evangelization of InfiniDB. Jim holds a BBA from Texas A&M and a Masters in Management Information Systems from the University of Texas at Dallas.
Eric Tschetter is the lead architect of Druid, Metamarkets’ distributed, in-memory database. He held senior engineering positions at Ning and LinkedIn before joining Metamarkets. At LinkedIn, Eric productized LinkedIn’s PYMK with Hadoop. He holds bachelors degrees in Computer Science and Japanese from the University of Texas at Austin, and a M.S. from the University of Tokyo in Computer Science.
Vineet Tyagi has over 17 years of experience on product development, managing innovation and creating enterprise grade solutions for Healthcare, Transportation, Insurance and Financial sectors. His expertise includes Big Data, Hadoop, Cloud Computing and Agile/Lean software development. Vineet has spearheaded the design & architecture of several large & cloud scale solutions for data lifecycle management at petabyte scale. Vineet also leads many Open source contributions that have received global recognitions and is a noted speaker on Big Data Technologies.
Rohit is an experienced technology and marketing executive with over 15 years of experience in enterprise datacenter technologies including hands-on software development, product management and marketing experience in virtualization, cloud computing, security and Java EE middleware. Before joining IBM Corporation through its Platform Computing acquisition, he was the Director of Sun Microsystems’ Cloud Services Business Unit and head of Oracle University marketing. He has been a speaker at numerous JavaONE and other technical conferences and published papers in IEEE and other journals. He is also the author of two US Patents for Java and internet technologies.
Dirk Van den Poel is full professor of marketing modeling at the Faculty of Economics and Business Administration of Ghent University in Belgium. He heads a competence center on analytical customer relationship management (aCRM), direct marketing, customer intelligence. He received his degree of management/business engineer as well as his PhD from K.U.Leuven (Belgium). His main fields of interest are studying consumer behavior from a quantitative perspective (CRM), data mining ￼(genetic algorithms, neural networks, random forests), and operations research. He co-authored more than 50 international peer-reviewed articles in leading academic journals including Journal of Applied Econometrics, International Journal of Information Technology and Decision Making.
Vinod Venkataraman is a Software Engineer at Facebook, where he focuses on developing the in-house monitoring systems. Vinod holds a Master’s degree from the University of Texas at Austin, and a Bachelor’s degree from the National Institute of Technology, Trichy, India, both in Computer Science.
Bala Venkatrao is currently Director, Products at Cloudera, a software pioneer in the Big Data space, aimed at bringing Hadoop and other related cloud technologies to enterprises. At Cloudera, Bala is involved in a wide array of activities including Product Management, Marketing and Partnership Development. He also leads the Customer Advocacy function to ensure the needs of customers are addressed most effectively through products and services.
Prior to joining Cloudera, Bala worked at CA Technologies, where he was the founding member of CA’s cloud business unit. He was instrumental in shaping CA’s cloud strategy that led to the formation of the new business unit and several strategic acquisitions. He also conceptualized and led the product management effort for an innovative cloud service management product.... Read More.
Dauwe completed his masters degree in Business Engineering at Ghent University with a master thesis on distribution optimization in the home vending industry. After some time as an analyst at Accenture, Dauwe returned to Ghent to start his Phd at the faculty of economics and business administration. His main focus is on integrating customer variables in distribution optimization problems. He is also a teaching assistant in the Master of Marketing Analysis program. He has worked on different supply chain projects in different industries and has spent over 3 years working on a project in the home vending industry. His main fields of interest are Optimization, Supply Chain Management, Big Data and Analytics.
JD is a User Experience Lead at Salesforce where he specializes in search, online communities and collaboration. Prior work experience includes Director of UX at MarkLogic Inc, and NewCity, Inc. where he was the UX lead for the Virginia Bioinformatics Institute’s most successful project – resulting in a $27 million grant. (largest in the history of Virginia Tech) Awards include multiple IMA, CASE and ADDY’s. A graduate of Carnegie Mellon’s MHCI program, JD is frankly super stoked about how Big Data and design will affect our future.
Jim is a recovering developer, professional marketer and amateur photographer with nearly twenty years experience building products and developing emerging technologies. During his career, he has brought multiple products to market in a variety of fields, including data loss prevention, master data management and now big data. At Hortonworks, Jim is focused on accelerating the development and adoption of Apache Hadoop.
Dean Wampler is Principal Consultant at Think Big Analytics, specialists in “Big Data”, Machine Learning, and the Hadoop ecosystem. He speaks frequently at conferences on various big data and other programming topics.
Dean is the author of Functional Programming for Java Developers (O’Reilly, 2011), the co-author of Programming Scala (O’Reilly, 2009) and the co-author of the forthcoming Programming Hive, also from O’Reilly.
Sewook Wee is an R&D manager at Accenture Technology Labs. His research has been grounded on distributed system with current emphasis on Big Data platform technologies. Recently, he led Hadoop deployment comparison study where he compared bare metal Hadoop cluster with Hadoop services (Amazon EMR) at the total cost of ownership level with three real world workloads. Previously, he has led various R&D projects including hybrid NoSQL approaches that layers graph data management capability on column-oriented datastores; MapReduce-based data transformation framework; next generation software architecture that maximizes the benefits of cloud; MonteCloudo, elastic Monte Carlo simulation architecture using cloud; and web server farm architecture on AWS EC2 environment. Along with leading R&D projects, he publishes academic papers, business white papers, files patents, presents... Read More.
Frank Weigel is Vice President of Products at Couchbase, responsible for product strategy, product management, and community engagement. Frank is passionate about working with customers and users to understand emerging requirements and successfully driving disruptive technology solutions to market. Previous to Couchbase, Frank was with IBM as technical product manager for cross-platform virtualization. Prior to that Frank served in a variety of roles at Transitive, including R&D, product management, and sales engineering. Frank received his undergraduate and graduate degrees in computer engineering from Manchester University, and holds several U.S. patents.
Ben Werther is the Founder & CEO of Platfora. He founded the company in 2011 to realize his vision of how Hadoop and Big Data Analytics will transform the way every business user uses data and move beyond the fiction, feeling and faith that underlies most business decisions.
Under Werther’s direction, Platfora has grown from an idea sketched on a napkin to one of the hottest enterprise startups in Silicon Valley and a leader of the Big Data Analytics category. Platfora’s mission is to empower customers to leverage Big Data Analytics to transform their businesses into Fact-Based Enterprises. Designed for business users, the company’s product is the first visual self-service platform for interactively and iteratively interrogating enormous amounts of data, and masking the complexity... Read More.
Tom Wheeler’s career spans more than fifteen years in the communications, biotech, financial, healthcare, aerospace and defense industries. Before joining Cloudera, he developed engineering software at Boeing, helped to design a high-volume data processing system for WebMD and served as senior programmer/analyst for a brokerage firm. Mr. Wheeler is a frequent presenter at both user groups and software conferences.
Hadley Wickham is an Assistant Professor and the Dobelman Family Junior Chair in Statistics at Rice University. He is an active member of the R community, has written and contributed to over 30 R packages, and won the John Chambers Award for Statistical Computing for his work developing tools for data reshaping and visualisation. His research focusses on how to make data analysis better, faster and easier, with a particular emphasis on the use of visualisation to better understand data and models.
hMatt is a Principal Program Manager on the Big Data team at Microsoft focused on building tools to make developers’ lives easier. Matt has been at Microsoft for 6 years working on developer tooling for distributed systems and cloud applications as well as technical evangelism. Prior to joining Microsoft, Matt was a consultant in the central United States building enterprise integration solutions.
As the Technology Evangelist for Amazon Web Services, Matt Wood discusses the technical and business aspects of cloud computing throughout Europe. With a background in the life sciences, Matt is interested in helping teams of all sizes bring their ideas to life through technology. Before joining Amazon he built web-scale search engines at Cornell University, sequenced DNA in Hinxton and developed analytics software in Cambridge. He is a frequent speaker at international conferences, a blogger, published author and an advocate of developer productivity.
Adrian Woodhead works for Expedia helping tackle their “big data” challenges. This involves matching data sets of hundreds of millions of items across different product areas in order to allow for analysis and inspection. Prior to this he was the lead of the data team at Last.fm, worked in Amsterdam for a fledgling Digital Rights Management startup as well as other companies and startups in the mobile, content management and banking spaces.
For information on exhibition and sponsorship opportunities, contact Susan Stewart at email@example.com.
For information on trade opportunities contact Kathy Yu at mediapartners
For media-related inquiries, contact Maureen Jennings at firstname.lastname@example.org
View a complete list of Strata contacts.