New speakers are added continuously. Please check back to see the latest updates to the agenda.
Jim is a business executive, entrepreneur, and thought leader on big data, privacy, security, and voting systems. Currently, Jim is VP of Products & Chief Privacy Officer at Metanautix. He also currently serves on the The Department of Homeland Security (DHS) Data Privacy and Integrity Advisory Committee (DPIAC) providing advice at the request of the Secretary of Homeland Security and the DHS Chief Privacy Officer.
Most recently, Jim was Vice President, Data Systems at inome and the first Chief Privacy Officer at Intelius. Jim led the big data team that powers the company’s products as well as serving as its chief consumer advocate. Prior to inome and Intelius, Jim served as president and chief technology officer at Identity.net, an Internet... Read More.
Dr. Vijay Srinivas Agneeswaran has a Bachelor’s degree in Computer Science & Engineering from SVCE, Madras University (1998), an MS (By Research) from IIT Madras in 2001 and a PhD from IIT Madras (2008). He was a post-doctoral research fellow in the LSIR Labs, Swiss Federal Institute of Technology, Lausanne (EPFL) for a year. He has done an internship in Siemens Corporate Research in Bangalore and was with another product development company – Oracle for three years, He subsequently spent a year as principal architect position with GTO, the research arm of Cognizant in Chennai, where he led the Extreme Processing group within the High Performance Computing Centre of Excellence and created Intellectual property in the Big-Data space. He... Read More.
Dr. Nadav Aharony is co-founder and CEO at Behavio. He completed his PhD at the MIT Media Lab’s Human Dynamics group, where he investigated the use of mobile phones as social and behavioral sensors, conducted one of the largest mobile data experiments done in academia, and initiated the open source mobile sensing platform that became Funf.org. Nadav was also a Fellow at the MIT Center for Civic Media for 3 years, since its inception, where he worked on topics of mobile and social activism.
Most recently Nadav worked at Google, as a product manager in the Android team. He has over 10 years of industry experience in engineering, product management, and business development roles, in organizations ranging from startups to... Read More.
He is the author of a number of books, and from time to time he also stands in front of cameras. You can often find him at conferences talking about interesting things, or deploying sensors to measure them. He recently rolled out a mesh network of five hundred sensors motes covering the entire of Moscone West during Google I/O. He’s still recovering.
Tasso Argyros is Vice President for Marketing and Product Management within the Aster Data Center of Innovation. Tasso has a background in data management, data mining and large-scale distributed systems. Before co-founding Aster Data, he was in the Ph.D. program at Stanford University.
Tasso is responsible for Aster Data’s marketing and product management strategy. He works with customers, partners, and industry leaders to help define the technology requirements created by the need to address the rapid emergence of diverse new types of data and diverse new types of analytics. He leverages his experience across the full technology solution stack to identify ways in which Aster Data technology can help to deliver solutions for big data analytics that apply to leading challenges across industries.
Bahman did his PhD at Stanford University, supported by William R. Hewlett Stanford Graduate Fellowship, and focused on the topic of algorithms for big data applications, in which he is a well-published author in some of the best conferences and journals, including PVLDB, SIGMOD, WWW, and KDD. He was the last PhD student of the legendary late Rajeev Motwani, and has been also advised and co-advised by Ashish Goel and Prabhakar Raghavan (formerly Yahoo VP of Strategy, currently Google VP of Engineering). His industry experience during his PhD studies spans several internships and collaborations with some of the best researchers and practitioners from Twitter, Microsoft Research, Yahoo Research, AOL, and Google. He is a recipient of the Yahoo Key Scientific... Read More.
Michael Bailey is an Economist and Researcher at Facebook and leads advertising demand forecasting. He joins a small group of economists specializing in Big Data economics and developing distributed economics algorithms. He graduated from Stanford with a PhD in Economics where he studied applied econometrics and internet economics. While a graduate student, he developing forecasting models for businesses including the International Student Forecasting Analytics Model (ISAFM).
Chaitan Baru is Distinguished Scientist and Associate Director Data Initiatives at the San Diego Supercomputer Center, University of California San Diego, where he also directs the Center for Large-scale Data Systems Research (CLDS). Baru’s interests are in research and development in the areas of parallel database systems, scientific data management, data analytics, and the challenges of data-driven science and data-driven enterprises. Baru has played a leadership role in a number of national-scale cyberinfrastructure R&D efforts across a wide range of science disciplines from earth sciences to ecology, biomedical informatics, and healthcare. Prior to joining SDSC in 1996, Baru led one of the development teams at IBM for an early UNIX-based shared-nothing database systems (DB2 Parallel Edition) and also led a team... Read More.
As Chief Product Officer, Sharren Bates is responsible for the vision, strategy, design and development of inBloom products and services.
Prior to joining inBloom in February 2013, Sharren’s excitement about the power of personalized learning and deep understanding of barriers to entry and innovation in the sector led her to her previous role as Senior Program Officer on the Next Generation Models team at the Bill and Melinda Gates Foundation. There she was one of the Foundation’s lead representatives on the Shared Learning Collaborative, and spearheaded the development of the inBloom technology services.
Prior to the Gates Foundation, Sharren worked on education policy at the federal level as an education analyst working on the National Broadband Plan at the Federal Communications Commission (FCC). Previously,... Read More.
Michael Bean is co-founder and president Forio Simulations, a software company specializing in developing web simulations that improve decision-making. Michael has consulted with corporations and government agencies and conducted scenario planning and computer simulation seminars to corporations and government agencies worldwide.
Dr Tyler Bell is Director of Product for Factual, an LA-based startup that has created a data platform powering local commerce – now with data on 63 million businesses and landmarks, and 500 thousand products. He previously did post-doctoral work on landscape analysis and the semantic integration of heterogeneous data at the University of Oxford. More recently and topically, he led Product for Yahoo’s Geo Technologies Group. He writes about data-, semantic- and geo-technologies for O’Reilly Radar at http://radar.oreilly.com/tylerb/
Milind Bhandarkar was the founding member of the team at Yahoo! that took Apache Hadoop from 20-node prototype to datacenter-scale production system, and has been contributing and working with Hadoop since version 0.1.0. He started the Yahoo! Grid solutions team focused on training, consulting, and supporting hundreds of new migrants to Hadoop. Parallel programming languages and paradigms has been his area of focus for over 20 years. He worked at the Center for Development of Advanced Computing (C-DAC), National Center for Supercomputing Applications (NCSA), Center for Simulation of Advanced Rockets, Siebel Systems, Pathscale Inc. (acquired by QLogic), Yahoo! and Linkedin. Currently, he is the Chief Scientist at Greenplum, a division of EMC.
Dr. Bhatia is a Big Data and HPC specialist at Amazon Web Services working with customers to identify, architect and deploy big data solutions. Prior to Amazon, Dr. Bhatia worked at a number of leading government and university research labs focused on building novel distributed systems primarily for scientific domains.
Marie Bienkowski is the Deputy Director of the Center for Technology in Learning, at the nonprofit research organization, SRI International. She works with educational researchers to develop and evaluate technology in K-12 education, and to contribute research data to education policy discussions. Many of her projects involve efforts to interest underrepresented groups in science, technology, engineering, and mathematics careers with a focus on computer science. She leads software projects in the areas of learning resource analytics and intelligent information management. Dr. Bienkowski co-authored the report “Enhancing Teaching and Learning through Educational Data Mining and Learning Analytics” for the U.S. Department of Education (released in October 2012). She is the co-PI of an NSF-funded grant on assessing computational thinking for high school students and... Read More.
Lynwood Bishop is President of Map Large, Inc.
Dynamic Visualization of Big Data: The MapLarge API can render millions of interactive points and shapes with no pre-caching or client side processing.
MapLarge maintains a cutting edge data visualization API for dynamic maps, charts and interactive query interfaces with machine assisted pattern recognition and data mining analytics. Our distributed platform has utilized thousands of servers to process large jobs. We can provide both existing product and R&D services.
Arrange to meet at Conference:
Billy is responsible for the day-to-day operations of DataStax. He has 20 years of experience in the database industry in roles ranging from DBA to senior executive. Prior to DataStax, Billy spent 6 years at Quest Software, a provider of systems management software, where his most recent role was VP and GM of the database business unit. Under his leadership, the industry-leading Quest database business grew from supporting traditional relational databases to a portfolio that now includes tools for cloud, NoSQL, columnar, and Hadoop databases, as well as business intelligence offerings. Prior to Quest, Billy led product teams for Embarcadero Technologies’ database productivity solutions. Billy holds a bachelor of science in computer science from the University of Louisville.
Cecilia Stewart is Vice President, Global Business Intelligence, Market & Customer Insights at The Western Union Company (NYSE: WU), a leader in global payment services.
Western Union provides consumers and businesses with fast, reliable and convenient ways to send and receive money around the world, to send payments and to purchase money orders. Western Union services are offered through a combined network of more than 500,000 agent locations in 200 countries and territories. In 2011, Western Union completed 226 million consumer-to-consumer transactions worldwide, moving US$73 billion of principal between consumers.
As leader of the global business intelligence function at Western Union, Cecilia is responsible for the design and delivery of market and competitive intelligence, leveraging big data capabilities. She leads the charge in how... Read More.
Ryan Boyd is a developer advocate at Google focused on enabling developers to extend Google Apps and build businesses on top of Google technology. He previously worked on OpenSocial and led the developer relations team for Google’s AtomPub APIs. Prior to joining Google, Ryan worked in higher education as a web architect for RIT’s central web hosting environment and as web app developer building admissions and student systems.
Kipp Bradford is an educator, technology consultant, and entrepreneur with a passion for making things. He is one of the USA Science and Engineering Festival’s Nifty Fifty. He is also the Demo Chair of the "Open Hardware Summit":http://2013.oshwa.org/participate/ and a featured innovator at "Frost & Sullivan’s GIL 2013":http://frostevents.wordpress.com/. As the former "Senior Design Engineer and Lecturer":http://research.brown.edu/research/profile.php?id=1296749291 at the Brown University School of Engineering, Kipp taught several engineering design and entrepreneurship courses. He has founded startups in the fields of transportation, consumer products, HVAC, and medical devices, including the "Data Sensing Lab":http://datasensinglab.com/ and "Revolution By Design":http://www.revolutionxdesign.org/. Kipp is a Fellow at the College of Design, Engineering and Commerce at Philadelphia University, and an Adjunct Critic at the "Rhode Island School of Design":http://www.risd.edu/Graduate_Studies/Kipp_Bradford/. He coauthored "Distributed Network Data":http://shop.oreilly.com/product/0636920028802.do. He... Read More.
Founder/CEO of Spinn3r, co-inventor of RSS, Apache contributor, and big data geek.
Feilding Cage is an interactive designer for the Guardian’s US in New
York. Recent work includes state-by-state visualizations of US gun
and gay rights (http://www.guardian.co.uk/world/interactive/2012/may/08/gay-rights-united-states),
and live election results from the primaries through the general
Feilding’s work has been honored with a 2012 Online Journalism Award
and is nominated for a Glaad Media Award. Before coming to the
Guardian, Feilding was an interactive designer at Time.com and the
supervising interactive developer at Associated Press. He is a 2006
graduate of UNC’s school of journalism and mass communication and is a
master’s degree candidate at NYU.
David Campbell is a Microsoft Technical Fellow whose present role is Vice President of Product Development for the SQL Server product suite.
David graduated with a Master’s Degree in Mechanical Engineering (Robotics) from Clarkson University in 1984 and began working on robotic workcells for Sanders Associates – later a division of Lockheed Corporation. In 1990 he joined Digital Equipment Corporation where he worked on their Codasyl database product DEC DBMS as well as their relational database product; Rdb.
Upon joining Microsoft in 1994, David was a developer and architect on the SQL Server Storage Engine team that was principally responsible for rewriting the core engine of SQL Server for SQL Server Version 7.0.
At Microsoft, he has held numerous... Read More.
Aaron is a software engineer currently located in Pittsburgh, PA. He received his Ph.D. in 2007, developing algorithms and software for 3D medical image analysis. He currently leads a software development team at Carnegie Mellon University, focusing on web application development and cloud systems.
Aaron is a polyglot programmer, with a keen interest in open source technologies. Some favorite technologies at the moment include Node.js, Python/Django, MongoDB, and Redis.
Mr. Collis is the Chief Technical Officer of AWhere Inc. He has over 15 years’ experience in all phases of design, development and management of software development projects for desktop and web applications. He has an education in Engineering Science and Computer Science with a focus on spatial information systems and remote sensing and applied experience in agricultural research science and natural resource management applications. He has experience developing software for use and distribution in developing countries and has worked on a variety of software and database development projects with various international research organizations, NGO’s and in-country government departments. More recently his focus has been on development of integrated spatial intelligence systems and user interfaces to allow non-technical users access to the enormous power of... Read More.
Eric Colson is Chief Analytics Officer at a Stitch Fix. For more than 18 years, he has led data-oriented teams that span algorithms & machine learning, Big Data & data warehousing, and analytics & business intelligence. Prior to Stitch Fix Colson was Vice President of Data Science & Engineering at Netflix. He holds degrees in Information Systems and Economics.
Shaun has more than 20 years of experience in the software industry, with a track record of building early stage and midsize software companies into successful market leaders. Shaun was previously VP of Product Strategy at VMware where he focused on defining VMware’s cloud application platform vision. Shaun has also held VP and Director level positions at SpringSource, Red Hat, JBoss, Princeton Softech, HP, Bluestone Software, and Primavera Systems. He holds a B.S. in Electrical Engineering from Drexel University. Follow Shaun on Twitter: @shaunconnolly.
Liz is the CTO at Birchbox an ecommere company that is changing the way people discover products. She has published papers on topics ranging from Machine Learning and Natural Language Processing to Intelligent Tutoring Systems and Robotics. Liz has a PhD in Computer Science from Carnegie Mellon and is a co-organizer of CTO School.
For ten years, Kate Crawford has published widely on the social, political and cultural practices that surround and inform media technologies. She has conducted large-scale studies of mobile and social media use at sites around the world, including India, the US and Australia. Her book on technology, culture and generational critique, ‘Adult Themes’, won the Manning Clark Cultural Award. Her current projects include the politics of big data,... Read More.
Alistair has been an entrepreneur, author, and public speaker for nearly 20 years. He’s worked on a variety of topics, from web performance, to big data, to cloud computing, to startups, in that time. In 2001, he co-founded web performance startup Coradiant (acquired by BMC in 2011), and since that time has also launched Rednod, CloudOps, Bitcurrent, Year One Labs, the Bitnorth conference, the International Startup Festival and several other early-stage companies.
Alistair is the chair of O’Reilly’s Strata conference; Techweb’s Cloud Connect; and the International Startup Festival. He’s written four books on analytics, technology, and entrepreneurship, including the best-selling Lean Analytics which is being translated into eight languages. He lives in Montreal, Canada and tries to mitigate chronic ADD by writing... Read More.
Sandra Crucianelli is a Knight International Journalism Fellow who is an investigative journalist and instructor. She specializes in digital resources and data journalism. She is the founder and editor of Sololocal.Info, an online magazine that provides hyper-local news in Bahía Blanca City, Argentina, and has been an instructor and consultant for the Knight Center for Journalism in the Americas at the University of Texas, Austin, since 2004.
Crucianelli also is an instructor for Florida International University’s International Media Center and the author of Digital Tools for Journalists, a Spanish- and Portuguese-language resource. She is on the advisory board of the Digital Journalism Center at the University of Guadalajara, Mexico. She was one of the first journalists working outside Argentina’s capital to focus on investigative... Read More.
I create interactive data visualizations. Prior to that, I have been in charge of storytelling with data and the visualization effort for OECD, and before that, I managed a videogames development studio. I hold an MBA from the University of Texas as Austin and a masters degree from EM Lyon.
Kenneth Neil Cukier the data editor of the The Economist, and
co-author of “Big Data: A Revolution that Will Transform How We Work,
Live and Think” to appear in early 2013. Previously, he was the
paper’s Tokyo correspondent and before that, its technology
correspondent in London. From 2002 to 2004 Mr. Cukier was a research
fellow at Harvard’s Kennedy School of Government, where he worked on
the Internet and international relations. Additionally, Mr. Cukier
serves on the board of directors of International Bridges to Justice,
a Geneva-based NGO promoting legal rights in developing countries.
William Cukierski is a data scientist at Kaggle. He has a bachelor’s degree in physics from Cornell University and a Ph.D. in biomedical engineering from Rutgers University, where he studied applications of machine learning in cancer research. Prior to joining Kaggle, he finished competitively in predictive data competitions on topics ranging from predicting stock movements, to forecasting grocery shopping, to automated essay grading.
Raanan is responsible for Big Data Product Marketing at Splunk. He is focused on helping customers harness their big machine data to gain operational intelligence and driving an effective market-driven product strategy. Raanan has spent the past 10 years focused on data, information and distributed systems. Prior to Splunk, Raanan worked at Cloudera where he was a Hadoop sales engineer and before that, spent 9 years as an engineer and sales engineer at Oracle. Raanan is a certified Hadoop developer and administrator and holds a BS in Computer Science and MS in Management.
Brandon Daniels is the President of Managed Services for Clutch Group. Brandon leads the organization’s strategy for all commercial and operational aspects of our Litigation & Investigation, Compliance & Risk, and Corporate In-house Services. Brandon has an extensive background in the management and application of legal technology. Over the last decade, he has focused on applying technology to legal services in order to create efficiency, increase quality, and reduce risk for corporations. He has managed the development or acquisition and implementation of knowledge management, performance tracking, data room, and litigation support software systems. He has also led the creation of facilities and technology environments that have withstood both risk and security audits from some of the largest investment banks, bioscience companies, and technology corporations in... Read More.
Carson is a mechanical engineer from MIT with additional expertise in computer science and electrical engineering. He worked previously at Cool Earth Solar and Alliance H. Inc., where he created testing and data systems as well as designed and manufactured prototypes for various products. Additionally, he worked for several years on the MIT Formula SAE Race Car Team, where he designed car-specific dual independent suspension systems. In his role at Rest, he leads the team’s electronic and web development.
Tathagata Das is a fourth-year Ph.D. student in the AMP Lab in UC Berkeley, working Scott Shenker and Ion Stoica. He leads the development of the Spark Streaming project. His research interests include datacenter networks and frameworks for large scale data processing. Before graduate school, he has worked as an Assistant Researcher in Microsoft Research Lab India.
Joydeep has worked in the enterprise software industry for over 20 years in leadership roles in engineering and product management. As an engineer, he led several R&D projects building enterprise DBMS products at Sybase and Informix. In his product management role, he currently leads and manages SAP’s HANA In-memory DBMS and Sybase branded data management portfolio with a focus on big data analytics and transaction processing segments. Joydeep is a frequent speaker at tradeshows, user conferences, customer presentations, and webcasts. Joydeep has also authored several articles in leading publications.
John A. De Goes has been working professionally in distributed systems design and development for more than a decade. Author of multiple best-selling technical books, and a major contributor to open source, John has an extensive background in in scientific and distributed computing, and large-scale analytics. John is a frequent and well-received speaker at industry events. Recent engagements include DataWeek Conference, Glue Conference, Frontier Developers and NEScala. As CEO and CTO of Precog, John is responsible for leading the design and development of the company’s data warehousing and analysis platform.
Marc Demarest is CEO and a principal in Noumenal, Inc., an international management consulting firm based in the Pacific Northwest and the UK that provides a range of management and technical consulting services to high-tech, biotech, nanotech, and greentech firms. Widely known as an early proponent of data marting and tiered enterprise data warehousing models, Demarest is currently writing a book on nontechnical aspects of enterprise data warehousing.
Jeff Denworth has over a decade of experience with advanced computation and massively scalable big data storage technologies. Before joining DDN in 2006, Mr. Denworth ran worldwide sales and marketing at Cluster File Systems, Inc. – where he managed all aspects of sales & business development associated with the Lustre® file system. Prior to Cluster File Systems, Jeff managed the HPC and enterprise server memory business development at Dataram Corporation.
Rahul has a unique blend of Web Analytics strategy, Business Intelligence and marketing experience. He is currently Director – Web Intelligence Product Marketing at Splunk Inc. Previously, he was Sr. Manger – Search Analytics at eBay. Rahul led a team of analyst that worked on analytics and optimization for Product Based Experience, Catalogs, Vertical search, Fitment, Classification and Advertising. Prior to eBay, Rahul was an Independent Web Strategy consultant focused on Social Media. His past experience includes Director of Analytics and Optimization for Ask.com and Dell. Rahul has served on the Client Advisory board for Hitwise and has spoken at Google, Strata Velocity, eMetrics and number of industry events.
Parviz Deyhim is an Enterprise Solutions Architect at Amazon Web Services. Parviz focuses on big data and Hadoop related topics and works with different customers to help them identify and architect their big data workload on Amazon’s Elastic Map Reduce.
Edd Dumbill is a technology analyst, writer and entrepreneur based in California. He’s helping drive businesses with data as VP Strategy for Silicon Valley Data Science.
A startup veteran, Edd was the founder and creator of the Expectnation conference management system, and a co-founder of the Pharmalicensing.com online intellectual property exchange.
An advocate and contributor to open source software, Edd has contributed to various projects, such as Debian and GNOME, and created the DOAP Vocabulary for describing software projects.
Edd has written four books, including O’Reilly’s “Learning Rails”. He writes... Read More.
Ted Dunning has been involved with a number of startups with the latest being MapR Technologies where he is Chief Application Architect working on advanced Hadoop-related technologies. He is also a PMC member for the Apache Zookeeper and Mahout projects. Opinionated about software and data-mining and passionate about open source, he is an active participant of Hadoop and related communities and loves helping projects get going with new technologies.
Dr. Khaled El Emam is the Founder and CEO of Privacy Analytics, Inc. He is also an Associate Professor at the University of Ottawa, Faculty of Medicine, a senior investigator at the Children’s Hospital of Eastern Ontario Research Institute, and a Canada Research Chair in Electronic Health Information at the University of Ottawa. His main area of research is developing techniques for health data de-identification or anonymization and secure disease surveillance for public health purposes. He has made many contributions to the health privacy area. In addition, he has considerable experience de-identifying personal health information under the HIPAA Privacy Rule Statistical Standard.
Previously Khaled was a Senior Research Officer at the National Research Council of Canada, and prior to that he was head... Read More.
Byron joined LivePerson in April as Chief Data Scientist where he works on improving LivePerson’s intelligent targeting solutions and developing new data products. Prior to joining to LivePerson he spent over 4 years at adBrite, an online advertising exchange, where he held a number of roles related to leveraging the large amount of data generated by adBrite’s exchange. He holds a PhD in Statistics from Harvard University where he studied structure learning for Bayesian Networks with applications to the Life Sciences.
Stephan Ellner has been a software engineer in Google’s ads infrastructure team since 2004, focusing on stream processing and low-latency distributed query execution. Stephan currently manages the teams developing the F1 database and related extraction and reporting systems. He holds Master’s degrees in computer science and classical music from Rice University.
Product manager at Cloudera, the standard for Hadoop, for HDFS, HBase, and part of Hive. Previously lead development of the new high availability and disaster recovery solution for Microsoft SQL Server 2012 and a Stanford University graduate.
Tim Estes is the Chairman, CEO, and founder for Digital Reasoning. Tim’s academic work at the University of Virginia focused in the areas of Philosophy of Language, Mathematical Logic, Semiotics, Epistemology, and Phenomenology. It was that eclectic academic background, coupled with the belief that in the future all software would learn from data as a core capability that gave rise to Digital Reasoning.
Tim and his team at Digital Reasoning work closely with leaders in government and industry to solve extraordinarily valuable and morally compelling problems in National Security, Finance, Legal, and Health Care by automating the understanding of unstructured data.
Susan Etlinger is an industry analyst at Altimeter Group, where she works with global companies to develop both social data intelligence strategies that support their business objectives. Susan has a diverse background in marketing and strategic planning within both corporations and agencies. She’s a frequent speaker on social media and analytics and has been extensively quoted in outlets including Fast Company, BBC, New York Times and The Wall Street Journal. Find her on Twitter at @setlinger and at her blog, Thought Experiments, at susanetlinger.com.
Shelley Evenson recently joined Fjord as Executive Director, Organisational Evolution. As a leader in service design and with more than 25 years of experience, Shelley Evenson is adept at helping organizations develop an in-depth understanding of customer needs, building experience strategies that respond to those needs, and implementing strategies across platforms and channels.
Shelley formerly oversaw research for user experience and design at Facebook and was a Principal User Experience Designer and Manager for Microsoft. Shelley was also an Associate Professor at the School of Design at Carnegie Mellon University, where her projects included collaborations with GM, Intel, The Mayo Clinic, Microsoft, Motorola, and UPMC.
Robert Faludi is the Collaborative Strategy Leader in R&D for Digi International, with a mandate to forge stronger connections with the community of innovators, discover outstanding new work, contribute to outside projects, and support the people making that work. Faludi is also a professor in the MFA program at the School of Visual Arts in Manhattan and in the Interactive Telecommunications program at NYU. He specializes in behavioral interactions through physical computing and networked objects. Rob is the author of Building Wireless Sensor Networks, with ZigBee, XBee, Arduino and Processing published by O’Reilly Media, 2011. His work has appeared in The New York Times, Wired Magazine, Good Morning America, BBC World, the Chicago Museum of Science & Industry and MoMA... Read More.
John Feland is the award winning founder and CEO of Argus Insights. With background in both industrial and component-level design, carrier influences, and OEM collaboration, Feland brings his expert knowledge and perspective on product design to to Argus Insights,
In 2009-2010, Feland was the Executive Director for Stanford University’s ME310 Design Innovation course. In 2008, Feland served as Chief Technologist for SK Telecom America’s R&D Group. Prior to SKTA, Feland led Synaptics’ marketing division to help handset manufacturers integrate next-generation capabilities. Feland was the principle architect for the Onyx Concept Phone, the world’s first capacitive multi-touch mobile experience. Feland holds an S.B. in Mechanical Engineering from MIT, an M.S. in Mechanical Engineering from Stanford University, and a PhD in Mechanical Engineering... Read More.
Julia Ferraioli is a Developer Advocate working on Google Compute Engine. She helps developers harness the power of Google’s infrastructure to tackle their computationally intensive processes and jobs. She comes from an industrial background in software engineering, and an academic background in machine learning and assistive technology.
Lutz is a Big Data Guy, Entrepreneur and Quantum Physicist. He has co-founded and lead Fisheye Analytics, a media data mining company. As expert for data analytics he has supported various industries, from Brands such as Telefonica or the Daily Mail Group to the governments and NGO ’s such as the World Economic Forum and the Olympic Committee .
His book COMPETING ON DATA will be published in Summer 2013 by O’Reilly Media.
Lutz has a Masters in Quantum Physics and an MBA from INSEAD.
Barry Fischer is the Head Writer/Editor of Opower’s big-data and energy blog, “Outlier,” which leverages energy usage data across 50 million US households. Prior to his current role, Barry was part of Opower’s engagement management team, which works with 80 utility companies to empower millions of customers to understand their energy usage. Barry earned a BA in economics and MS in environmental engineering from Stanford University, and subsequently completed a Fulbright Fellowship in New Zealand, where he investigated opportunities for renewable energy in the Pacific islands. His work on island energy development has appeared in the journal Energy. You can follow Barry on Twitter @OpowerOutlier.
Michael Fishwick heads the Global Business Intelligence unit of Telefonica Digital.
Johanna is a data scientist and project manager at Blue Yonder, specialized in demand forecasting in the retail industry. She has worked on projects with major retailers. She holds a PhD in high energy physics.
Previously, Johanna worked for the European Organization for Nuclear Research in Switzerland (CERN) where she gained her deep experience with big data, predictive analytics, and machine learning.
John Foreman is the Chief Data Scientist for MailChimp.com where he leads MailChimp’s data product development effort called the Email Genome Project. He also runs the Data Science for Managers course at Analytics Made Skeezy.
John holds a graduate degree in Operations Research from MIT and has worked as an analytics consultant for the Department of Defense, Coca-Cola, Royal Caribbean International, and Intercontinental Hotels Group. His expertise is in optimization modeling, revenue management, and predictive modeling.
Natasha Gajic is currently Enterprise Business Intelligence (EBI) Specialist – Technical Architecture Lead at Rackspace. Natasha has been working in the computer industry since 1988 in various roles around software design, database administration, system development and consulting. Currently, Natasha is leading an engineering effort on Rackspace OpenStack Big Data solution called Analytical Compute Grid. Before joining Rackspace, Natasha worked for 10 years as a consultant on various projects in Toronto, Canada. There she implemented the Ontario Major Case Management system, which is a province-wide intelligence system for criminal investigations. Natasha holds a patent on Data Management and Indexing across a Distributed Database in the Cloud. She holds a Masters of Electrical Engineering degree from Military Technical University in Zagreb, Croatia.
Yael Garten is a Senior Data Scientist at LinkedIn, where she leads the mobile data analytics team that focuses on understanding and increasing Mobile engagement of LinkedIn’s 200 million members. Yael’s team is focused on converting LinkedIn’s rich data into actionable product and business insights that optimize the LinkedIn mobile experience and create powerful data-driven products that help members be productive and successful. Her team also focuses on understanding the impact of mobile usage on the rich LinkedIn ecosystem.
Before joining LinkedIn, Yael was at Stanford University, where she completed her PhD in Biomedical Informatics, focusing on information extraction via natural language processing to understand how human genetic variations impacts drug response. Yael advises biomedical companies on informatics methodologies to transform high throughput data into... Read More.
Alan is a co-founder of Hortonworks. He is an original member of the engineering team that took Pig from a Yahoo! Labs research project to a successful Apache open source project. Alan also designed HCatalog and guided its adoption as an Apache Incubator project. He is also the author of Programming Pig, a book from O’Reilly Press.
A software/systems engineer with a lot of experience building big, real-world systems.
Rayid Ghani is a Research Director and Senior Fellow at the Computation Institute and the Harris School of Public Policy at the University of Chicago. He is also the co-founder of Edgeflip, an analytics and social media startup that is focused on helping non-profits, advocacy groups, and charities do better fundraising, volunteer recruiting, outreach and advocacy. Previously, Rayid was the Chief Scientist for the Obama 2012 Election Campaign focusing on analytics, data, and technology.
Rayid is currently focused on using data, analytics (and other related buzzwords ) for social causes, both with Edgeflip and the University of Chicago. Rayid created and runs the Eric & Wendy Schmidt “Data Science for Social Good” Summer Fellowship which brings together aspiring data scientists to work on data science... Read More.
Brian Granger is an Assistant Professor of Physics at Cal Poly State
University in San Luis Obispo, CA. He has a background in theoretical
atomic, molecular and optical physics, with a Ph.D from the University of Colorado. His current research interests
include quantum computing, parallel and distributed computing and
interactive computing environments for scientific and technical computing. He is a core developer of the
IPython project and is an active contributor to a number of other open source projects focused on scientific computing in Python.
Dr. Gray obtained degrees in Applied Mathematics and Computer Science from Berkeley and a PhD in Computer Science from Carnegie Mellon, and is a tenured professor at Georgia Tech. His lab works to scale up all of the major practical methods of machine learning (ML) to massive datasets. He began working on this problem at NASA in 1993 (long before the current fashionable talk of “big data”). His large-scale algorithms helped enable the Science journal’s Top Breakthrough of 2003, and have won a number of research awards. He is a member of the National Academy of Sciences (NAS) Committee on the Analysis of Massive Data, is a NAS Kavli Scholar, and frequently gives invited tutorial lectures on massive-scale ML at top research... Read More.
Lisa Green is the Director at the Common Crawl Foundation where she
oversees the foundation’s mission of building, maintaining and openly
disseminating a comprehensive crawl of the web. Prior to joining
Common Crawl, she was the Chief of Staff at Creative Commons. Lisa
holds a PhD in physical chemistry from the University of California
Berkeley, lives in San Francisco and is passionate about open systems.
I specialize in teaching people how to use R – and especially Hadley Wickham’s R packages – to do insightful, reliable data analysis. I’ve worked with Hadley for five years. He was my dissertation advisor at Rice University, where I gained a first-hand understanding of his R libraries. While at Rice, I taught (and helped developed) the courses “Statistics 405: Introduction to Data Analysis,” and “Visualization in R with ggplot2”. Before that, I taught introductory statistics as a Teaching Fellow at Harvard University.
I’m very passionate about helping people analyze data better. I have travelled as far as New Zealand, where R was born, to learn new ways to teach data analysis. I worked alongside some of the original developers of R to hone my... Read More.
Dr. Nagui Halim’s technical vision and leadership launched the era of stream computing at IBM. In response to a client request in 2003 to create a new architecture for high-speed adaptive stream processing and analytics, Nagui recruited and assumed leadership of a large interdisciplinary research team, working in close and novel collaboration with the client, to undertake this formidable project to develop a new type of computing system able to manage and analyze massive volumes of continuous streams of data, which became known as System S. As the technical lead on System S, Nagui developed the foundational concepts and designed the architecture for this new computing system.
Ben Hamner is responsible for data analysis, machine learning, and competitions at Kaggle. He has worked with machine learning problems in a variety of different domains, including natural language processing, computer vision, web classification, and neuroscience. Prior to joining Kaggle, he applied machine learning to improve brain-computer interfaces as a Whitaker Fellow at the École Polytechnique Fédérale de Lausanne in Lausanne, Switzerland. He graduated with a BSE in Biomedical Engineering, Electrical Engineering, and Math from Duke University.
Jeanne Harris is the Global Managing Director of Information Technology Research and Executive Research Fellow at the Accenture Institute for High Performance in Chicago. Jeanne leads the Institute’s global research agenda, which currently includes data monetization strategies, embedding Big Data analytics, and understanding and exploiting digital experiences. She is co-author of “Competing on Analytics: The New Science of Winning” and “Analytics at Work: Smarter Decisions, Better Results” from Harvard Business School Press. She is on the faculty of Columbia University and is the recipient of Consulting Magazine’s Women Leaders in Consulting Award for Lifetime Achievement in 2009.
Bert Haskell is Technology Director of Pecan Street Project Inc., a smart grid research and development organization headquartered at The University of Texas at Austin. After being assigned to the MCC Consortium by Eastman Kodak, Mr. Haskell spent nearly a decade working for the consortium, rising to the position of MCC’s Vice President of Portable Electronics Product Research. In addition to his work for Kodak and MCC, he has held product development, product marketing and advisory rolls at a number of start-up companies including Stellar Display Corporation, Wireless Age, Motion Computing, Portelligent. Prior to joining Pecan Street Project, he was Director of Product Development for Heliovolt, a developer of CIGS based thin-film photovoltaic modules. Mr. Haskell holds a Masters in Mechanical Engineering... Read More.
Amy is the Director of Mathematics at Quid, where she has been since its inception, prototyping and launching the company’s technology for analyzing document sets. A Cambridge Mathematician who previously worked in London modeling cities, Amy is fascinated by complex human systems, and the algorithms and data that help us understand them.
Joseph M. Hellerstein is a Chancellor’s Professor of Computer Science at the University of California, Berkeley, whose work focuses on data-centric systems and the way they drive computing. He is an ACM Fellow, an Alfred P. Sloan Research Fellow and the recipient of two ACM-SIGMOD “Test of Time” awards for his research. In 2010, Fortune Magazine included him in their list of 50 smartest people in technology , and MIT’s Technology Review magazine included his Bloom language for cloud computing on their TR10 list of the 10 technologies “most likely to change our world”.
In 2012, Joe joined with Jeff Heer and Sean Kandel to found Trifacta, a company developing intuitive, powerful, and remarkably useful technologies for data analysis.
Paul Henderson is Head of Business and Clinical Intelligence for Ascribe. In the UK, 82% of National Health Service organisations own at least one Ascribe solution from a portfolio of point-of-care and operational management offerings. Over 130 organisations use BI provided by Paul’s team, which has recently expanded to cover mainland Europe and Asia-Pacific. With a background in qualitative research Paul’s ethos is to provide solutions that identify variation in clinical and operational practice, drive service transformation and improve the quality, safety and cost-effectiveness of services bought and delivered in the geographies his team serves. Last year Ascribe were Microsoft’s global Healthcare Partner of the Year, based on their ability to take innovation and make it immediately relevant, which is exactly what the team are... Read More.
James Hendler is the Tetherless World Professor of Computer and Cognitive Science, and the Head of the Computer Science Department at Rensselaer. He is also a faculty affiliate of the Experimental Multimedia Performing Arts Center (EMPAC), serves as a Director of the UK’s charitable Web Science Trust and is a visiting Professor at DeMontfort University in Leicester, UK. Hendler is the first computer scientist to serve on the Board of Reviewing Editors for Science. In 2010, Hendler was named one of the 20 most innovative professors in America by Playboy magazine and was selected as an “Internet Web Expert” by the US government.
Dave is responsible for assisting Pentaho’s corporate customers and technology partners with the evaluation, development and deployment of big data strategies and solutions. He has nearly 20 years of experience with database technologies, data warehousing and business intelligence and has held a variety of senior positions including business development, product marketing, pre-sales management and professional services at Sagent, Temtec and PLATINUM technology. Dave has been with Pentaho since its commercial inception. He holds a Masters degree in Information Science from the University of North Texas and is a lifelong sailor.
Now – President and Chief Scientist: Kaggle. Earlier – Founder: FastMail; Founder: Optimal Decisions. Long ago – Consultant: McKinsey & Co, A.T Kearney
Jeremy Howard is the President and Chief Scientist at Kaggle. Previously, he founded FastMail (sold to Opera Software) and Optimal Decisions (sold to ChoicePoint – now called LexisNexis Risk Solutions). Prior to that he worked in management consulting, at McKinsey & Company and A.T. Kearney, but he is now nearly fully recovered.
Jeremy’s passion is applying algorithms to data. At FastMail he used algorithms to automate nearly every part of the business – as a result the company only needed a total of 3 full time staff, and got over a million signups. Optimal Decisions... Read More.
Software Engineer @ Cloudera. Apache HBase Commiter, Apache Flume Founder.
Alysa Z. Hutnik is a partner in the Advertising & Marketing and Privacy & Information Security practices at Kelley Drye & Warren LLP in Washington, D.C. Her practice includes representing clients in all forms of consumer protection matters, from counseling to defending regulatory investigations and litigation. Her specific focus is on privacy, data security, and advertising law, including unfair and deceptive practices, electronic and mobile commerce and data sharing. Ms. Hutnik is past chair of the ABA’s Privacy and Information Security Committee (Section of Antitrust), the co-chair of the Section’s 2011 Consumer Protection Conference, and was the editor-in-chief of the ABA’s Data Security Handbook, a practical guide for data security legal practitioners. For more information, visit www.KelleyDrye.com, subscribe to the “www.AdLawAccess.com”: www.AdLawAccess.com blog,... Read More.
Noah Iliinsky strongly believes in the power of intentionally crafted communication. He has spent the last several years researching, writing, and speaking about best practices for designing visualizations, informed by his graduate work in user experience and interaction design. He is a frequent speaker in both industry and academic contexts. He has a master’s in Technical Communication from the University of Washington, and a bachelor’s in Physics from Reed College. Noah works as a Visualization Expert at IBM’s Center for Advanced Visualization.
Fausto Inestroza is an architect at the Accenture Technology Labs, the dedicated R&D organization of Accenture. His focus is on data platforms, analytical processes and distributed systems.
Sasha Issenberg is the author of “The Victory Lab: The Secret Science of Winning Campaigns.” He is columnist for Slate and the Washington correspondent for Monocle, where he covers politics, business, diplomacy, and culture. He covered the 2008 election as a national political reporter in the Washington bureau of The Boston Globe, and his work has also appeared in New York, The New York Times Magazine, The Washington Monthly, Inc., The Atlantic, Boston, Philadelphia, and George, where he served as a contributing editor. His first book, The Sushi Economy: Globalization and the Making of a Modern Delicacy, was published by Gotham in 2007.
After previous careers in physics and software development, Philipp K. Janert currently provides consulting services for data analysis, algorithm development, and mathematical modeling.
He is the author of two books on data analysis: “Data Analysis with Open Source Tools” (O’Reilly) and “Gnuplot in Action – Understanding Data with Graphs” (Manning Publications).
He holds a Ph.D. in theoretical physics from the University of Washington. Visit his company website at www.principal-value.com
Girish Juneja is CTO of Datacenter Software Division and General Manager of the Big data and Expressway software businesses. Girish has over 21 year experience in building software businesses, at Intel, as an entrepreneur and earlier at Verizon Telecommunications.
Prior to his current position, he was the Director of Application Security & Identity Products in the Software Services Division in SSG. In this role he was responsible for conceptualization, product development, sales & marketing of Intel Expressway software product line for Service Providers, ISVs, OEMs and select end-users. Girish also led the development of Identity software strategy for SSG that led to acquisition of Nordic Edge, development & launch of McAfee Identity Manager & Intel CloudSSO Identity-as-a-Service offering in collaboration with Salesforce.com.... Read More.
Since joining Blue Yonder in 2011, Jan has taken the role as Head of Development and CTO. Blue Yonder develops and applies state-of-the-art predictive analytics to real-world business problems.
Previously, Jan worked in different positions in the software industry.
Jim Kelly leads Quantcast’s R&D team, which works on both adding computing capacity through cluster software innovations and using it up through new analytic and modeling products. Having been at Quantcast for six years, he has seen its data volumes and processing challenges grow from zero to petabytes and led technical and organizational changes that have kept Quantcast a step ahead. Previously he held engineering leadership roles at Oracle, Kana, and Scopus Technology (acquired by Siebel). Jim holds a PhD in physics from Princeton University.
Eron Kelly is a 12 year veteran of Microsoft with a broad set of experiences in product management across the Microsoft Office and Server and Tools Divisions. He is currently the General Manager responsible for product marketing for Microsoft SQL Server and the Data Platform. In this capacity he leads core product marketing, technical readiness, and value proposition for the Data Platform. Before leading the SQL product marketing team, he was a General Manager in the Windows Azure group leading an incubation effort focused on Microsoft’s commerce platform.
Prior to that, he led the product management team for Office 365, the fastest growing business in Microsoft’s history. As the founding member of the product management team, he led the development of the initial... Read More.
Brendan is an engineer on the Geo Developer Relations team. Prior to joining Google, he split his career between teaching math and helping advance the state of the web. Now he gets to do both.
Greg is the Business Development Manager for Big Data at Intel Corporation. He has over 10 years of in-depth experience in business intelligence and data warehousing at Oracle and Business Objects (now SAP). A former U.S. Army officer, Greg holds a master’s degree in Information Systems Management from Carnegie Mellon University.
Josh has been working with data and analytics since 2000, including being the product manager for the first “Datamart in a Box” (Broadbase) and running product management for one of the largest Data and Analytics operations in the world (Yahoo!). Josh is now applying these learnings at Pivotal, where he is building the industry’s first unified Big Data and Analytics Platform.
Jace Kohlmeier is the Dean of Analytics at Khan Academy. He earned degrees in Mathematics and Computer Science as a Kansas State Wildcat, and a Master’s degree in Computer Science from Princeton. He spent six years at Citadel Investment Group, where he cofounded the High Frequency Trading group and oversaw its trading in fixed income, currency, commodities and futures. He is also the cofounder of Teza Technologies, where he served as President and Head of Quantitative Research. After volunteering in 2010, he joined Khan Academy full-time on a mission to apply state-of-the-art data science toward optimized learning.
Nick is the lead data scientist at Accenture working with executives to build global data science
capabilities across the company. Nick and his team operate like Navy Seals focused on quick
win execution. They are regularly inserted into Fortune 500 companies with ugly data and
limited infrastructure. In a short span of time they work with executives to develop strategies
around data science and then rapidly assemble and deploy the solutions. Engagement with a
wide variety of business problems and domains has led Nick to hone his tool set for efficiency.
He also founded the Data Science Linux project, a clean cut linux distribution for data scientists.
Nick holds a BS in statistics and a BA in computer... Read More.
Eugene Kolker is Chief Data Officer at Seattle Children’s and Head of the Bioinformatics & High-throughput Analysis Laboratory at Seattle Children’s Research Institute. In 2001, he co-founded OMICS: A Journal of Integrative Biology and served as its Editor-in-Chief through 2012. He is now Executive Editor of OMICS. In 2011, Eugene co-founded DELSA Global (Data-Enabled Life Sciences Alliance) a non-for-profit organization and serves as its President. In 2012, he co-founded the Big Data journal and serves as its Executive Editor. Dr. Kolker has more than 25 years of transdisciplinary experience in data analysis, integration and dissemination, predictive analytics, software and algorithm development, and informatics. Eugene is also an Affiliate Professor at the Departments of Biomedical Informatics & Medical Education and Pediatrics at the... Read More.
Andy Konwinski is a postdoc in the AMPLab at UC Berkeley focused on large scale distributed computing and cluster scheduling. He co-created and is a committer on the Apache Mesos project that has been adopted by Twitter as their private cloud platform. He also worked with systems engineers and researchers at Google on Omega, their next generation cluster scheduling system. More recently, he lead the AMP Camp Big Data Bootcamp and has been contributing to the Spark project.
Mr. Krishnan is a recognized expert worldwide in the strategy, architecture and implementation of high performance data warehousing and big data solutions. He is a visionary data warehouse thought leader, ranked as one of the top data warehouse consultants in the world, and an independent analyst, writing and speaking at industry leading conferences, user groups and trade publications. He has authored eBooks, over 100 plus articles, viewpoints and case studies in Business Intelligence, Data Warehousing and Data Warehouse Appliances and Architectures.
A recognized authority on Unstructured data integration, text mining and text analytics. Along with Bill Inmon, he is promoting the next generation of data warehousing and business intelligence around BIG Data,Hadoop, NoSQL, Unstructured Data Integration and Social Intelligence.
Krish presents and speaks at... Read More.
As the co-founders and CTO of Infochimps, my mission is to build tools to organize, connect and comprehend massive information streams.
At Infochimps, we’ve built a cloud architecture capable of spinning up clusters as large as 60 nodes with the push of a button, using devops to make this possible with minimal distraction. We’re also the authors of Wukong, the most-used frameworks for Ruby streaming in Hadoop.
I am CTO and co-founder of infochimps.com, the leading big data platform in the cloud. At infochimps we’ve built a scalable architecture that allows app programmers and statisticians to quickly and confidently manipulate data streams at arbitrary scale — terabytes in size, thousands of events per second, dozens of disparate data sources. We use a mixture of Hadoop, Elasticsearch, Storm/Kafka, Goliath and other industrial-strength solutions.
My projects include Wukong, (the most-used frameworks for Ruby streaming in hadoop); Ironfan (cloud orcestration capable of spinning up clusters large or small at the push of a button); Configliere (ruby configuration made easy); and I’m a core committer to Goliath (liquid fast concurrent web framework).
I am the author of “Big Data for Chimps”, a book on data... Read More.
Michael Lang is CEO and founder of Revelytix. He has worked in various aspects of the enterprise information management space for 25 years, including roles at Reuters, Vitria, Metamatrix (founder and president, sold to Red Hat) and Revelytix. For the past 10 years he has worked closely with the U.S. Department of Defense innovating solutions to complex enterprise data management problems.
Justin is CEO of Zoomdata, Inc. Prior to Zoomdata, Justin was the co-founder of Clarabridge and the inventor of Clarabridge’s award-winning, patented, text analytics software.
Prior to Clarabridge, Justin co-founded and was CTO of Claraview, a BI strategy and technology consultancy, which was sold to Teradata in 2008. Before founding Claraview, Justin served as founder and CTO of Strategy.com, a real-time data analysis and alerting subsidiary of MicroStrategy.
Prior to launching Strategy.com, he was a technology program manager and consultant at MicroStrategy, designing the second-generation web-based BI tool for MicroStrategy, and working with large customers on their BI deployments.
In the early 1990′s Justin was active in the BBS community, and he authored and marketed the EIS-PC BBS system.... Read More.
Ruben Lara heads the user modelling area of Telefonica Digital, part of its Product Development and Innovation unit. He holds a PhD in computer science.
Eric Legrand is a marketing scientist at Wells Fargo where he focuses on building tools to measure marketing performance. He has developed compelling analyses, visualizations, and dashboards that are used both by executives as well as operational managers. Previous to Wells Fargo Eric managed pricing and cost models at Niman Ranch, a leading natural meat company. Prior to Niman Ranch, he worked as a software developer at a number of start-ups that are no longer with us. Eric studied mathematics and philosophy at Wheaton College, IL, received a certificate in Management Science and Engineering at Stanford, and is currently working on an MS in Predictive Analytics at Northwestern University. Eric lives with his wife and daughter in San Francisco, CA.
Greg Lindahl is the Founder and CTO of Blekko. Previous to founding Blekko, he founded PathScale where he architected the software and hardware and saw the company through to a highly successful exit. He is the author of several patents.
Ben Lloyd has been with NetApp for 6 years, and spent more than 25 years in IT, managing solution implementation projects, Quality, and Support teams. He’s managed or architected projects in Data Warehousing, SFA, Workflow Management, ERP, CRM, Web, Content Management, Fault Diagnosis, and Analytics. In 2011, he took a role in NetApp’s Customer Support organization managing the program to implement the next generation solution for handling NetApp’s phone-home application (AutoSupport).
I design and build analysis and decision support systems, and building data management and access infrastructure. Research focus these days is on analysis techniques, emerging technology and practices in analytics, BI, information management, user experience for data access & delivery applications. I speak at a lot of conferences on anything data, with a bunch of history of science and technology mixed in.
I focus on two types of work: using data to make decisions and manage organizations, and building data technology infrastructure. A big part of making decisions and using data in a corporate setting is ensuring that the right data capture and data delivery infrastructure is in place to manage the business. As a result, I do as much information strategy and IT architecture... Read More.
Walt Maguire is the Analytics Director for ParAccel. Walt started working with data when Big Data meant less than a megabyte – writing his first analytic routines on an IBM XT he’d built from parts purchased at Fry’s Electronics in 1988. He built his first linux system before linux had a name – downloading the kernel from the University of Helsinki in the early 1990’s to systems built from parts. Since then, he has worked in analytics settings ranging from mainframe programming for academia to managing a multi-terabyte data infrastructure for an internet company to working with companies running petabyte–scale data warehouses. At one time or another he’s worked with most of the tools and technologies data management and analysis. His focus today is... Read More.
Justin Makeig is a Director of Product Management at MarkLogic where he oversees the suite of applications, tools, and APIs built around MarkLogic’s enterprise NoSQL database. He manages the company’s Hadoop strategy along with front-end application development and administration tools. Justin has over 10 years of experience designing, developing, and bringing to market data-driven applications for start-ups and large organizations using web and Big Data technologies. He holds an MBA from the University of California, Berkeley.
Luis Maldonado is the Director of Product Management for HP Vertica where he is responsible for driving strategy and development of the Vertica product portfolio. He has over 20 years of experience developing and productizing infrastructure software and services. Prior to HP Vertica, Luis held product leadership roles for companies such as Akamai, Progress Software and DataSynapse (now TIBCO Software). Luis holds a Bachelor’s degree in Computer Science and Engineering from MIT.
Dean Malmgren is co-founder and managing partner of Datascope Analytics. As an author of several peer-reviewed publications on big data analytics and visualization, Dean is excited about bringing cutting-edge techniques out of research and into practice. When not teasing himself or others, Dean can be found swimming, cycling, or running for silly long distances. Dean received a BS in math and chemical engineering from the University of Michigan and a PhD in chemical engineering from Northwestern University.
Michael is a Developer Programs Engineer supporting developers who work with Google Cloud and Apps products. With many years of experience working on Internet media projects for non-profit organizations, he especially enjoys helping educational institutions “Go Google.” Michael has written for tech blog ProgrammableWeb.com, has spent time in rural Uganda researching mobile phone use, and has a Masters degree in Information Management and Systems from UC Berkeley’s School of Information.
Steve Mardenfeld is a Data Engineer for the Big Data Team at Etsy, where he uses Hadoop and other tools to process and analyze large data sets. Previously to Etsy, he worked as a Senior Data Analyst for a criminal justice agency, where he focused on utilizing data to help inform criminal justice policy issues.
Sanjai Marimadaiah is the WW Business Leader for Big Data solutions in the Converged Application Systems division of Enterprise Group at HP. Sanjai manages the portfolio of solutions based on Hadoop, Vertica, Autonomy and emerging database software. He drives the strategy, product management and go-to-market activities. These solutions extend to the application the HP Converged Infrastructure strategy-the company’s approach for next-generation data centers that enables customers to increase agility, lower costs of operations and drive innovation into their businesses.
Mano joined Google’s Geo API team in 2006. He helps people all over
the world develop and deploy their content in KML and Google Maps,
working with large companies, small startups, and international aid
organizations. Before coming to Google, Mano had an eclectic career
that involved getting a Masters in History, a Masters in Information
Management and Systems, and working as a data manager in social service and public benefit organizations for over a decade.
Nathan Marz is the lead engineer on Twitter’s Publisher Analytics team. He was previously the lead engineer at BackType before being acquired by Twitter in July of 2011.
Nathan is the author of numerous open-source projects relied upon by companies all around the world. These include Cascalog, ElephantDB, and Storm.
He has spoken about his work at conferences such as the Hadoop Summit, Strange Loop, Gluecon, Clojure/conj, and POSSCON. He writes a blog at http://nathanmarz.com.
Kate Matsudaira is an experienced CTO and has spent the last 7 years immersed in startups in architect and CTO roles. Prior to joining the startup world she spent time as a software engineer, and technical lead/manager at Amazon and Microsoft. Kate has a passion for big data and all the challenges that come with it. She has lot of experience building large scale distributed web systems, with some mobile, cloud computing and engineering leadership thrown in for good measure. She maintains a blog at http://katemats.com.
Co-founder and CEO of DataPad (http://www.datapad.io). Innovating analytics and data visualization tools. Author of “Python for Data Analysis” from O’Reilly Media. Created pandas project.
Stephanie McReynolds is Vice President, Product and Marketing at ClearStory Data, which helps businesses analyze and explore Big Data from public and private sources. Previously, Stephanie was Senior Director of Product & Technical Marketing at Teradata Aster, where she brought to market the Teradata Aster MapReduce Platform. Stephanie joined Teradata through the acquisition of Aster Data.
Stephanie has more than a decade of experience in product management and marketing for business intelligence, data warehouse and complex event processing products at organizations including, Oracle, PeopleSoft, and Business Objects.
Fernando Mesa is a technology leader with 20+ years of experience in the IT industry. At MarkLogic Fernando is responsible for the creation, communication and implementation of the technical vision for the company in the Enterprise space. He also provides strategic guidance to Enterprise organizations of all sizes from small startups to fortune 500 organizations. He has expertise in the implementation of emerging technologies in large scale systems and is a frequent speaker at technology events and conferences.
Prior to joining MarkLogic, Fernando worked at United Health Group, where he served as Vice President of Enterprise Architecture. He has also held a variety of senior roles in Media and Healthcare corporations.
Timothy Mohn is a entrepreneur and scientist with 20 years of technology experience. Previously he developed HBOGO, founded Pratt Institute’s Digital Arts Research Lab, and worked with a renowned design firm where he focused on replicating and enhancing physical architectural experiences in the cloud. He was one of the founders of MedNet Systems, a startup focused on a collaborative diagnosis platform for liver biopsy diagnosis and a cloud based certification platform for the FDA and large pharmaceutical organizations. Follow Tim on Twitter – @m3thdman.
Timothy’s work has been featured in the New York Times, Wall Street Journal, Wired, and elsewhere and has won numerous awards from AIGA, ID Magazine, Critique Magazine and Communication Arts. He holds a MS from NYU where... Read More.
Big data for a better world! Rob is the CEO of Idibon, who are tackling the problem of extracting information from unstructured speech and text in the world’s connected languages — all 5000 of them. His background includes building infrastructure in Sierra Leone and Liberia, running crowdsourced translation platforms for Haiti, and work in language processing technologies that support a number of Silicon Valley search-engines and start-ups. He has a PhD from Stanford University.
Sean Patrick Murphy, with degrees in mathematics, electrical engineering, and biomedical engineering and an MBA from Oxford University, has served as a senior scientist at the Johns Hopkins Applied Physics Laboratory for the past ten years. Previously, he served as the Chief Data Scientist at WiserTogether, a series A funded health care analytics firm, and the Director of Research at Manhattan Prep, a boutique graduate educational company. He was also the co-founder and CEO of a big data-focused startup: CloudSpree.
Scott Murray is a code artist who writes software to create data visualizations and other interactive phenomena. His work incorporates elements of interaction design, systems design, and generative art. Scott is an Assistant Professor of Design at the University of San Francisco, where he teaches data visualization and interaction design. He is a contributor to Processing, and is author of the forthcoming O’Reilly title “Interactive Data Visualization for the Web”.
Data Scientist for Concurrent in SF, and a committer on the Cascading open source project. 10+ years leading innovative Data teams, 25+ yrs in tech industry overall. Background in math/stats and distributed computing. Expertise in Hadoop, R, AWS, predictive analytics, machine learning, NLP
Tim has authored and co-authored a number of books for O’Reilly including: Maven: The Definitive Guide, The Jakarta Commons Cookbook, Maven: A Developer’s Notebook, and Harnessing Hibernate. He has also written a number of popular open source books including: Maven by Example, Maven: The Complete Reference, Repository Management with Nexus, and The Maven Cookbook.
Tim focuses on helping developers understand various topics including development infrastructure, general architecture, and emerging approaches to persistence. He has contributed to open source projects at Apache.
Vadim Ogievetsky is a frontend developer at Metamarkets where he uses D3.js in a production, client-facing environment. Prior to Metamarkets, in 2008, he received a BA degree in Mathematics and Computer Science from Oxford University and, in 2011, graduated from Stanford University with a Master’s degree in Computer Science specializing in Human Computer Interaction. While at Stanford he was part of the Data Visualization group where he contributed to Protovis. His open-source development is now focused on DVL, a reactive data
flow library for dynamic data visualization built on top of D3.js.
Jennifer Pahlka is the founder and executive director of Code for
America, which is dedicated to the idea that government can work for
the people, by the people, in the 21st century. She is an Ashoka
fellow, and received the Internet and Society Award from the Oxford
Internet Institute in 2012. Government Technology named her one of
2011’s Doers, Dreamers and Drivers in Public Sector Innovation and the
Huffington Post named her the top Game Changer in Business and
Technology the same year. She is known for her TED talk, Coding a
Better Government, and is a frequent speaker. Previously, she ran the
Web 2.0 and Gov 2.0 events for TechWeb, in conjunction with O’Reilly
Tim celebrates software development using many languages and frameworks, heeding less to past experience in choosing technologies. Spring MVC, Hibernate, Rails, .NET MVC, Django and the variety of languages that come with are in his L1 cache. Among other endeavors to keep him sharp, he currently provides coded solutions for the Software Engineering Institute at CMU.
Tim received a B.S. in Computer Engineering in 2003 and resides in Pittsburgh, PA.
Zach Pardos is a researcher at Massachusetts Institute of Technology exploring data driven methodologies to aid learners and educators using virtual learning environments. He earned his PhD in Computer Science at Worcester Polytechnic Institute in the Tutor Research Group in 2012. During his PhD he spent extensive time on the front lines of K-12 education working with teachers and students to integrate educational technology into the curriculum as an assessment tool to be used formatively. He is an emerging leader in the field of Educational Data Mining and has received numerous academic awards and honors for components of his thesis work on “Predictive Models of Learning” including a top prize applying his educational analytics in the 2010 KDD Cup, an international big data competition... Read More.
Sonali Parthasarathy is a developer at the Accenture Technology Labs. Her primary areas of interest include ETL processes, Hadoop, Pentaho, Storm and complex event processing.
Priyank Patel is Director, Product Management at Teradata Aster. Priyank joined Aster as its 3rd employee working on building out the core product in the Engineering team. Since then he has held various roles in Engineering Management, Field Engineering and is currently responsible for Product Management of Aster’s SQL-MapReduce framework and Analytical Libraries built on it. Before joining Aster Priyank held engineering roles at Microsoft Corporation where he worked on the Windows OS. Priyank holds a Master Degree in Computer Science from Stanford University and a Bachelors in Computer Engineering from Gujarat University.
Ashish Patel is one of the founders of Not Only Development which is focused on Open and collaborative projects to improve healthcare.
Ashish comes from a career as an information security expert
DJ is the “Data Scientist in Residence” at Greylock Partners.
Previously he was the Chief Product Officer for Color and the Chief Scientist at the LinkedIn Corporation, leading the Analytics and Data Teams. Some of the products shipped include, People You May Know, Who’s Viewed My Profile, Talent Match, Skills, and Career Explorer.
He has held roles at Skype, PayPal, and eBay. As was a member of the faculty at the University of Maryland, he helped start a major research initiative on numerical weather prediction. As an AAAS Science & Technology Policy Fellow for the Department of Defense, Dr. Patil directed new efforts to leverage social network analysis and the melding of computational and social sciences to anticipate emerging threats to the US. He... Read More.
Angelica Peralta Ramos is Multimedia Development Manager and LA NACION DATA project leader at LA NACION.
She co-founded Lanacion.com in 1995 and was Project Leader, Product Manager, Product Development Manager and Digital Media General Manager from LA NACION until 2004. In those years she was in charge of leading the operations, the team and the product development which included Lanacion.com´s first steps in digital marketing and sales.
Since 2006 she is in charge of a Research, Development and Training area that helps LA NACION transform it´s newsroom into multiplatform ready, including the introduction of multimedia, social media and mobile journalism skills, technology and tools and the introduction of new formats for digital content.
Since 2011 she also leads a project... Read More.
Louis Perrochon is an Engineering Director at Google and he currently leads the team that builds Google’s geospatial products for businesses, governments and non-profits. These products enable you to put “your data on the Google map” and include Google Earth, Google Maps Engine and Google Earth Engine. His team also processes all aerial and satellite imagery for Google Earth and Google Maps. Previously, he has been leading Engineering for YouTube, which processes over 72 hours of video per minute and serves over 4 billion videos a day. His academic credentials include a PhD in Computer Science from the Swiss Federal Institute of Technology and 3 years post-doctoral research at Stanford University.
Mr. Peterson is a seasoned technology executive with over 30+ years of experience managing architecture and data services organizations. In his current role as Vice President of Platforms and Data Architecture at Neustar, Inc., Mr. Peterson is responsible for the Data Services group that has introduced Big Data technologies, an architecture team responsible for establishing and improving common technology platforms, and the Web Services infrastructure for Neustar. He has been instrumental in moving Neustar, inc toward the utilization of open source technologies.
Data Scientist at LexisNexis Risk Solutions. I have responsibilities to the core HPCC Systems platform technology and spearhead large scale graph analytics projects working with big data, for various industries to help customers target fraud, collusion and other red flag indicates. Prior to LexisNexis, Mr. Prichard worked for Topspeed Software R&D in London.
Prasad Ram (aka Pram) is the Founder and CEO of Gooru. While working at Google, Pram devised a prototype of a search engine for learning. What began as a “20% effort” evolved into a year-long pilot in India that included 1,000 students across 25 classrooms. Pram subsequently left Google to pursue Gooru as a non-profit education technology start-up in Silicon Valley with a mission to honor the human right to education.
Prior to founding Gooru, Pram worked as the head of Google R&D in India, CTO for Yahoo! India and research scientist at Xerox Research. He has led the development of many products such as Google Maps, News and Translate as well as many technologies in search, ad targeting, and digital rights management.... Read More.
Before joining Impermium, co-founder and CTO Vish led engineering for Yahoo! Mail, Yahoo! Search and Insights groups. Vish has a background in machine learning and optimization but has dabbled in graphics and drug discovery.
Kim Rees is a founding partner of Periscopic: http://www.periscopic.com, an award-winning information visualization firm. Their work has been featured in the MOMA as well as several online and print publications, including CommArts’ Interactive Annual, The Information Design Sourcebook, Adobe Success Stories, CommArts Insights, Infosthetics.com, FlowingData.com, and numerous websites, blogs, and regional media outlets. Periscopic’s body of work was recently nominated for the Cooper-Hewitt National Design Awards.
Kim is a prominent individual in the information visualization community. She has published papers in Parsons Journal of Information Mapping, was an award winner in the VAST 2010 Challenge, and is a guest blogger for Infosthetics.com. Kim has been featured on CommArts Insights and has presented at several industry events including Strata, the Tableau Software Conference,
Naomi B. Robbins is the author of Creating More Effective Graphs, first published by John Wiley (2005). She also blogs on effective graphs for forbes.com. She is a consultant, keynote speaker, and seminar leader who specializes in the graphical display of data. She trains employees of corporations and organizations on the effective presentation of data with customized programs. She also reviews documents and presentations for clients, suggesting improvements or alternative presentations as appropriate. Naomi received her Ph.D. in mathematical statistics from Columbia University, M.A. from Cornell University, and A.B. from Bryn Mawr College. She had a long career at Bell Laboratories before forming NBR, her consulting practice.
Monica is a data scientist with a passion for turning data into products, actionable insights, and meaningful stories. As the VP of Data for Jawbone, she focuses on developing data-driven products that promote a healthier lifestyle and on finding stories in the UP wristband data.
Prior to Jawbone, Monica was one of the early members of the LinkedIn data science team, where she developed and improved some of LinkedIn’s key data products for matching jobs to passive candidates, discovering people you may know, and recommending groups you may like.
Monica’s compelling data stories are often picked up by the mainstream press, including the Wall Street Journal, The Economist, NPR and CNN. Monica holds a Ph.D. in Computer Science from CMU, where... Read More.
Simon Rogers is editor of the Guardian’s Datablog and Datastore, an online data resource which publishes hundreds of raw datasets and encourages its users to visualise and analyse them. He is the author of Facts are sacred: the power of data available now on Kindle. Simon is also a news editor on the Guardian, working with the graphics team to visualise and interpret huge datasets. He was closely involved in the Guardian’s exercise to crowdsource 450,000 MP expenses records and the organisation’s coverage of the Afghanistan Wikileaks war logs. Previously he was the launch editor of the Guardian’s online news service and has edited the paper’s science section. He has edited two Guardian books: How Slow Can You Waterski and The Hutton Inquiry... Read More.
Duncan has been a data miner since the mid 1990s. He was Director of Advanced Analytics at Teradata until 2010, leaving to become Data Director of Experian UK. He rejoined Teradata to lead their European Data Science team.
At Teradata he has been responsible for developing analytical solutions across a number of industries, including warranty and root cause analysis in manufacturing, and social network analysis in telecommunications. These solutions have been developed directly with customers and have been deployed against some of the largest consumer bases in Europe.
In his spare time Duncan has been a city Councillor, chair of a national charity, founded an award winning farmers’ market, and is one of the founding Directors of the Society of Data Miners.
Jason works at Datastax as a senior Big Data engineer architecting,
developing and supporting the Datastax Enterprise product line which
includes Solr integrated with Cassandra. His career has involved an
array of technologies including search, Hadoop, Hive, mobile phones,
cryptography, and natural language processing. Jason has been
developing solutions with Lucene and Solr for more than 7 years and is
a co-author of Programming Hive from O’Reilly. Jason frequently
gives tutorials and speaks at conferences such as Strata, Cassandra Summit, ApacheCon and others.
John Santaferraro is the Vice President of Marketing for Actian Analytics Platform. Prior to joining Actian, Santaferraro was an independent industry analyst in the business intelligence and analytics market. Before that he developed and executed a vertical market strategy for Hewlett Packard’s business intelligence group, focusing on energy, communications, retail, healthcare and financial services. At Hewlett Packard, he was also instrumental in helping establish the new business intelligence business group with a combination of solutions, products, and consulting. In 2000, John founded a marketing and sales consulting company, Ferraro Consulting, providing business acceleration strategy for technology companies. Along with business intelligence executive positions in Compaq Computers and Tandem Computers, Santaferraro co-founded a venture-backed, data warehouse startup company, Virtual Integration Technology, that was later sold to... Read More.
Pete is the VP of Products at Platfora. He is responsible for early development, product design, and the roadmap for Platfora’s products. Pete spends most of his time working with enterprise customers to solve their big data BI needs. Prior to Platfora, he was VP of Products and Marketing at Solera Networks, building network security analytics products to store, search, and analyze every packet traversing 10 GigE networks. Early in his career he lead Products for IronPort Systems – the first company to solve spam at Internet scale – which was acquired by Cisco Systems in 2007.
Bill Schmarzo, author of the upcoming “Big Data: Understanding How Data Powers Big Business” to be published by Wiley, is responsible for setting the strategy and defining the service line offerings and capabilities for the EMC Consulting Enterprise Information Management and Analytics service line. He’s written several white papers and is a frequent speaker on the use of Big Data and advanced analytics to power organization’s key business initiatives.
Bill has more than two decades of experience in data warehousing, BI and analytic applications. Bill authored the Business Benefits Analysis methodology that links an organization’s strategic business initiatives with their supporting data and analytic requirements, and co-authored with Ralph Kimball a series of articles on analytic applications. Bill has served on The Data Warehouse... Read More.
John Schroeder founded MapR to address core limitations that were limiting the adoption of Hadoop and expand Hadoop capabilities to serve new business critical and real-time applications.
John has led companies creating innovative and disruptive business intelligence, database management, storage and virtualization technologies at early stage ventures through success as large public companies. John was previously CEO of Calista Technologies (Microsoft), CEO of Rainfinity (EMC) and Senior Vice President of products and marketing at Brio Technologies (BRYO).
Dr. Rachel Schutt is a Senior Research Scientist at Johnson Research Labs. Prior to that, she was a Senior Statistician at Google Research in the New York office. She is also an Adjunct Assistant Professor in Columbia’s Statistics Department, and is a founding member of the Education Committee for the Institute for Data Sciences and Engineering at Columbia. Rachel is co-authoring a book (with Cathy O’Neil) called “Doing Data Science” to be published by O’Reilly in 2013.
Her interests include statistical modeling, exploratory data analysis, machine learning algorithms, and social networks, as well as the ethical dimensions of Data Science, and using Data Science to do good. She holds several pending patents. She is a frequent speaker at conferences and universities.
She earned her PhD... Read More.
Sam Shah is a principal engineer on the LinkedIn data team. He leads many of the site’s large-scale recommendation and analytics systems, which analyze hundreds of terabytes of data daily to produce products and insights that serve LinkedIn’s members. His work involves pure research, product-focused features, and infrastructure development, including social network analysis, recommendation engines, distributed systems, and grid computing. Some of the products under his purview include “People You May Know”, “Who’s Viewed My Profile?”, Skills & Endorsements, related searches, job recommendations, and more. Sam holds a Ph.D. in Computer Science from the University of Michigan.
Sharmila Shahani-Mulligan is CEO & Founder of ClearStory Data, which helps businesses broaden insights by interactively exploring and analyzing Big Data from a combination of private, public/web and premium sources. Prior to ClearStory, Sharmila was EVP at Aster Data and prior to that, CMO of the largest business unit within HP Software and was responsible for a $3+ billion software products portfolio. Prior to HP, she was CMO/EVP at Opsware, where she joined Marc Andreessen and Ben Horowitz to re-position Loudcloud into a software company and formed Opsware in 2002, which was eventually acquired by HP for $1.65 billion. She has also held executive positions at Netscape, Kiva Software which was acquired by Netscape, AOL and Totality.
Chang She is a cofounder of Lambda Foundry. From 2011 to 2012, he served as Assistant Vice President at Barclays Capital researching quantitative FX strategies and building research infrastructure. From 2006-2011, he worked at AQR Capital Management in global equities research and algorithm execution. He graduated from MIT with an M.Eng in Computer Science and S.B. degrees in Computer Science and Political Science.
Jayant is Solutions Architect at Cloudera working with various large and small companies in various Verticals on their Big Data Use Cases, Architecture, Algorithms and Deployments. Prior to Cloudera Jayant also worked at Yahoo where he was instrumental in building out the large scale Content/Listings Platform using Hadoop & Big Data technologies and working with various Yahoo Properties, Real Estate, Autos, Local, News, Movies etc. Prior to Yahoo, Jayant worked at eBay building out a new Shopping Platform (K2) using Nutch/Hadoop, Search Intelligence Platform, among others. Jayant also worked at KLA-Tencor building software for Reticle Inspection Stations and defect analysis systems. Jayant has Bachelor’s degree in Computer Science from IIT Kharagpur and Master’s degree in Computer Engineering from San Jose State University
Rebecca Shockley is the Global Research Leader for Business Analytics and Optimization with the IBM Institute for Business Value, where she conducts fact-based research to develop thought leadership for senior executives.
Rebecca has been a management strategy consultant with IBM since 1997, and spent more than 12+ years consulting with top tier clients on how to transform data within an organization into insights that drive better customer relationships.
Jeff Shute is the lead engineer on F1. Jeff has been focused on distributed storage systems for the ads team since joining Google in 2005. Jeff studied computer science and math at University of Waterloo and previously worked at a company building a high-performance column-store database engine.
Marcio Silva is a Lead Data Mining Engineer at the MITRE Corporation’s Center for Advanced Aviation System Development (CAASD). Marcio holds a B.S. in Computer Science from George Mason University, and his areas of focus include data intensive computing, visualization, and web application development.
Prior to joining MITRE, Marcio held positions at Blackboard, Celera Genomics and Applied Biosystems where he helped develop a wide-range of data-heavy software products, including bioinformatic client applications, learning management systems, and life-science research portals.
Pete Skomoroch is a Principal Data Scientist at LinkedIn where he leads a team focused on identity, reputation, information extraction, and building data driven products. He was also the creator of LinkedIn Skills http://www.linkedin.com/skills/
Prior to LinkedIn, he was based in Washington, DC where he mined insights from search query data as the Director of Advanced Analytics at Juice Analytics and as a Sr. Research Engineer at AOL Search. While in DC, he also founded DataWrangling.com, which provided custom data mining solutions to clients in bioinformatics, finance, and cloud computing.
He spent the previous 6 years in Boston implementing Biodefense pattern detection algorithms for streaming sensor data at MIT Lincoln Laboratory and constructing predictive models for large retail datasets at Profitlogic (now Oracle... Read More.
David Smith is the Vice President of Marketing and Community at Revolution Analytics, the leading provider of software and services for the open-source R statistical language. David writes daily about applications of R, analytics and open-source software at the Revolutions blog (blog.revolutionanalytics.com), and was named a top 10 influencer on the topic of “Big Data” by Forbes. He is the co-author (with Bill Venables) of the tutorial manual, An Introduction to R, and one of the originating developers of the ESS: Emacs Speaks Statistics project. Prior to joining Revolution Analytics, David was the director of product management for S-PLUS at Insightful, Inc. Follow David on Twitter as @revodavid.
Dr Arfon Smith is Director of Citizen Science at the Adler Planetarium in Chicago and Technical Lead of the Zooniverse (www.zooniverse.org). He leads a team of developers, educators and scientists who build citizen science projects across a range of disciplines including astrophysics and papyrology. He gained a PhD in Astrochemistry from The University of Nottingham (2006) and subsequently worked as a senior software developer in the production software group at The Wellcome Trust Sanger Institute in Cambridge (UK). In 2008 he joined the Zooniverse team at University of Oxford and has coordinated the development of more than 20 citizen science projects and grown the Zooniverse community to more than 680,000 volunteers.
Anna Smith is a resident data scientist at bitly in New York while being ‘in absentia’ from the University of Oregon physics doctorate program. Recently, she has published in both Forbes and Publications of the Astronomical Society of Australia. Her interests include manipulating data and catching up on the latest celebrity gossip.
Sarah Sproehnle is the Director of Educational Services for Cloudera
where she helps customers learn to use Apache Hadoop for big data
processing. Cloudera provides commercial support, training and
services for the Apache Hadoop platform.
Julie Steele is the Content Editor for Strata at O’Reilly Media. She is co-author of Beautiful Visualization and Designing Data Visualizations. She finds beauty in exploring complex systems, and thinks in metaphors. She is particularly drawn to the visual medium as a way to understand and transmit information.
Julie holds a Master’s degree in Political Science (International Relations) from Rutgers University in Newark. She lives in New York City, where she cooks, reads, designs, and practices yoga. You can find her blogging occasionally for O’Reilly Radar, or on Twitter.
Carl Steinbach is a software engineer at Citus Data, as well as a committer and PMC member on the Apache Hive project. Previously Carl worked at Cloudera where he led the Hive team, at NetApp where he developed storage encryption products, and at Oracle where he was a member of the Server Technologies group. Carl holds B.S. and M.Eng. degrees in Computer Science from MIT.
Ion Stoica is a Professor of Computer Science at UC Berkeley, where he does research on cloud computing and networked computer systems. Past work includes the Dynamic Packet State (DPS), Chord DHT, Internet Indirection Infrastructure (i3), declarative networks, replay-debugging, and multi-layer tracing in distributed systems. His current research includes resource management and scheduling for data centers, cluster computing frameworks, and network architectures. He is the recipient of a SIGCOMM Test of Time Award, the CoNEXT Rising Star Award, the PECASE Award, and the ACM doctoral dissertation award. Ion also co-founded Conviva, a startup to commercialize technologies for large scale video distribution.
Mike Stringer is co-founder and managing partner of Datascope Analytics. A physicist gone wild, Mike is passionate about realizing the potential for big data analytics to make a positive impact on business and society. Despite wearing a suit occasionally, he still finds the time to argue with others about Bayesian statistics, explore the amazing food in Chicago, and has recently become fascinated with typography. Mike received a BS in Engineering Physics from the University of Colorado and a PhD in physics from Northwestern University.
Ryan is a data developer at Think Big Analytics. He leads technical consulting projects for big data implementations at Fortune 500 clients. He has in depth experience working with Solr/Lucene and the Hadoop stack.
Rajat Taneja is Chief Technology Officer of Electronic Arts, the world’s leading developer and publisher of interactive entertainment.
Mr. Taneja joined EA in October 2011 as CTO to drive critical technology decisions and investments for EA on a global basis. Before joining EA, Mr. Taneja spent 15 years at Microsoft where he most recently lead the division responsible for development and deployment of all commerce and transaction technologies across Microsoft’s connected services, including Xbox Live®, Windows Phone®, Windows Azure™, Microsoft® Office 365, Microsoft adCenter® and MSN®. In previous roles at Microsoft, Mr. Taneja was responsible for leading the company’s online digital advertising platforms and its first business online service offering.
Prior to Microsoft, Mr. Taneja held roles at Digital Equipment Corporation and Wipro Information... Read More.
Fred Trotter is the leading consultant and advocate for Free/Libre and Open Source (FOSS) Health Software. In recognition of his role within the Open Source Health Informatics community, Trotter was the only Open Source representative invited by the NCVHS to testify on the definition of ‘meaningful use’.
Trotter has contributed code to FreeMed, OpenEMR is the current project manager of MirrorMed and the original author of FreeB, the worlds first GPL medical billing engine. In 2004 Fred Trotter received the LinuxMedNews achievement award for work on FreeB. Fred Trotter manages the Open Source EHR review project with the American Medical Informatics Association (AMIA), Open Source Working Group (oswg). Fred is also a member of WorldVistA.
Fred Trotter is a recognized... Read More.
Eric Tschetter is the lead architect of Druid, Metamarkets’ distributed, in-memory database. He held senior engineering positions at Ning and LinkedIn before joining Metamarkets. At LinkedIn, Eric productized LinkedIn’s PYMK with Hadoop. He holds bachelors degrees in Computer Science and Japanese from the University of Texas at Austin, and a M.S. from the University of Tokyo in Computer Science.
Daniel Tunkelang leads LinkedIn’s efforts around query understanding. Before that, he led LinkedIn’s product data science team. He previously led a local search quality team at Google and was a founding employee of Endeca (acquired by Oracle in 2011). He has written a textbook on faceted search, and is a recognized advocate of human-computer interaction and information retrieval (HCIR). He has spoken at three previous Strata conferences, and is on the editorial board of the Journal of Big Data. He has a PhD in Computer Science from CMU, as well as BS and MS degrees from MIT.
Joseph Turian, Ph.D., heads MetaOptimize LLC, which consults on predictive analytics, business intelligence, NLP, ML, and data strategy. He also run the MetaOptimize Q&A site, where Machine Learning and Natural Language Processing experts share their knowledge. He specializes in large data sets.
Joseph Turian holds a Ph.D. in computer science (with a focus on Machine Learning and Natural Language Processing) from New York University since 2007. During his graduate studies, he developed a fast, large-scale machine learning method for parsing natural language. He received his AB from Harvard University in 2001.
As a scientist, Joseph Turian has over 14 refereed publications in top NLP + ML conferences. His team submitted the best parser in EVALITA 2009 Main+Pilot tasks. He is... Read More.
Vineet Tyagi has over 17 years of experience on product development, managing innovation and creating enterprise grade solutions for Healthcare, Transportation, Insurance and Financial sectors. His expertise includes Big Data, Hadoop, Cloud Computing and Agile/Lean software development. Vineet has spearheaded the design & architecture of several large & cloud scale solutions for data lifecycle management at petabyte scale. Vineet also leads many Open source contributions that have received global recognitions and is a noted speaker on Big Data Technologies.
Mauricio Vacas is a solution architect for Accenture Technology Labs, Accenture’s dedicated R&D group, focusing on Data and Platforms. Currently, Mauricio is investigating the role of analytics and visualization in big data and developing a platform to support real-time and ad-hoc analysis of data using tools such as R, Mahout, Hadoop, and Cassandra among others.
Marck Vaisman is a data scientist, consultant, entrepreneur, master munger and hacker. Marck is Senior Data Scientist at DeepMile Networks. Previously he’s been the Principal Data Scientist at DataXtract, LLC helping clients with all kinds of data science projects. His professional experience spans the management consulting, telecommunications, Internet, and technology industries. Marck is the co-founder of Data Community DC [http://www.datacommunitydc.org], an organization focused on building the Washington DC area data community by running meetup and other events. He has an MBA from Vanderbilt University and a B.S. in Mechanical Engineering from Boston University. Marck is a contributing author to O’Reilly’s Bad Data Handbook.
Shivakumar Vaithyanathan is the IBM Chief Scientist for Text Analytics and the Department Manager of the Intelligent Information Systems Group at the IBM Almaden Research Center. Since joining IBM in 1998, he has been involved in multiple research areas including development of learning algorithms, especially for extremely high-dimensional sparse data. His department is currently involved in building systems for Scalable Unstructured Analytics, Enterprise Search and Large-scale machine learning and Statistical Modeling. Technologies developed in his department are embedded in multiple IBM big data products.
Jen is an innovation strategist at Luminary Labs, which helps companies make innovation real. A former Wall Street Analyst and Economist, and has held executive management roles at Organic, Frog Design. Jen was a Partner in Drillteam which was acquired by Dachis Group, where she served as EVP Services, Managing Director, overseeing service design, strategy consulting, and delivery. In addition to serving on the board of the Designers Accord, Jennifer writes and speaks on the topic of social business and sustainable design, and is an Adjunct Professor at NYU’s ITP school and SVA’s Products of Design.
Jen has a BA in comparative religion from Trinity College, and an MBA from HEC in Paris, and you can reach her @jenvandermeer.
Doug VanderMolen is Chief UX Architect of ClearStory Data. Before joining ClearStory Data, Doug led the user experience for Google Analytics, Google AdWords and other Google Ads products. Doug’s designs have helped millions of people intuitively understand and utilize data to make key decisions. Prior to Google, Doug was a key member of the team at MeasureMap, which was acquired by Google in 2006. He received his Masters of Design from the Institute of Design.
Himanshu Vashishtha is a Software engineer at Cloudera. He works in the HBase team.
He did his MSc and BTech from University of Alberta, and Indian Institute of Technology Varanasi, respectively.
Shivaram Venkataraman is a second year PhD student at the University of California, Berkeley and works with Mike Franklin and Ion Stoica at the AMP Lab. His research
interests are in design of storage systems and analytics platforms for big-data applications. Before coming to Berkeley, he completed his M.S at the University of Illinois, Urbana-Champaign.
Anand Venugopal or “AV” has been instrumental in building the Big Data Analytics consulting services practice at Impetus Technologies over the last three years. With a diverse 17 year long techno-business background in Telecom, Interactive entertainment and Hi-Tech industry verticals, AV with the rest of the Impetus team, has been helping IT and line-of-business executives in large enterprises understand and extract the enormous value embedded in their static and “in-motion” Big-Data assets.
Bradley Voytek, PhD is a UCSF neuroscientist making use of data, brain-computer interfacing, and machine learning to figure out cognition. He is also the Data Evangelist for the San Francisco-based on-demand car service, Uber, Inc. Brad is an avid science teacher, outreach advocate, and world zombie brain expert. He’s spoken at events ranging from elementary schools to Ignite, TEDxBerkeley, @GoogleTalks, and SciFoo. His research and science writing has been featured in The Washington Post, Wired, Forbes, The New York Times, The New Yorker, The Guardian, The Atlantic, and Scientific American. He runs the blog Oscillatory Thoughts (http://blog.ketyov.com), tweets at @bradleyvoytek, and co-created brainSCANr.com with his wife Jessica Bolger Voytek.
Dr. Ben Waber is the president and CEO of Sociometric Solutions, a management services firm that uses social-sensing technology to drive innovative transformation services. He is also a visiting scientist at the MIT Media Lab, and he was previously a senior researcher at Harvard Business School. He received his Ph.D. from MIT for his work with Alex “Sandy” Pentland in the Human Dynamics group at the Media Lab. Waber’s work has been featured in major media outlets such as Wired, The Economist, and NPR. He has consulted for industry leaders such as LG, McKinsey & Company, and Gartner on technology trends, social networks, and organizational design. His book People Analytics was released by the Financial Times Press in 2013.
Matt Walker is a engineer at Etsy, where he is building out their big data stack and researching techniques for search and advertising. Previously, he worked as a researcher at Adtuitive and as an engineer at Pervasive Software. He holds an MS in computer science from UT and received his BS in electrical and computer engineering from Rice University.
Dean Wampler, Ph.D., is Principal Consultant at Think Big Analytics, specialists in Big Data, particularly Data Science in the Hadoop ecosystem. He speaks frequently at conferences on various big data and other programming topics.
Patrick Wendell is a Ph.D student working in the U.C. Berkeley AMPLab. His research focus is on large scale data-intensive computing and his adviser is Ion Stoica. Before working on the BDAS stack at Berkeley, he contributed to several Hadoop projects, mostly while working at Cloudera. He holds a B.S. in Computer Science from Princeton University.
Sam William is an Analytics Engineer at Stumbleupon. Before that, he worked as a software Engineer at the Content Platform group at Yahoo.
Matt is a Principal Program Manager on the Big Data team at Microsoft focused on building tools to make developers’ lives easier. Matt has been at Microsoft for 6 years working on developer tooling for distributed systems and cloud applications as well as technical evangelism.
Andrew, an inquisitive humanist, is motivated by the promise of making ours a more rational society. He applies his skills to the problem of converting data into information, a process requiring scripting and research into the relevant fields of study. He holds a B.A. in Mathematics from Reed College. He greatly enjoys his daily bicycle commute, Portland’s artisanal culture, and searing vegetables in cast iron.
Reynold Xin is a third-year PhD student in the AMP Lab at UC Berkeley. He leads the development of the Shark project, which won the Best Demo Award at SIGMOD 2012. He is also the recipient of the inaugural Best Demo Award at VLDB 2011 for his work on the CrowdDB system. Before graduate school, he worked on ads infrastructure at Google and distributed databases at IBM. His interests include data management systems, distributed systems, and algorithms for large-scale data processing.
In his role as SVP, Products, Scott is responsible for the Greenplum’s overall product development and go-to-market efforts, including engineering, product management, and marketing. Scott is a co-founder of Greenplum and was President of the company until Greenplum’s acquisition by EMC. Prior to Greenplum, Scott served as vice president for Digital Island, a publicly traded Internet infrastructure services company that was acquired by Cable & Wireless in 2001. Before Digital Island, Scott served as vice president for Sandpiper Networks, an Internet content delivery services company that merged with Digital Island in 1999. At Sandpiper, Scott helped to create the industry’s first content delivery network (CDN), a globally distributed computing infrastructure comprised of several thousand servers, and used by many of the industry’s... Read More.
Danny Yuan is a cloud system architect in the Platform Engineering Team of Netflix. He leads the effort of building and operating Netflix’s data collection pipeline, as well as the real-time insight project of the Platform Engineering Team. He also built Netflix’s crypto service, which manages all the crypto keys used by Netflix applications in the cloud and serves billions of crypto operations every day.
Matei Zaharia is an assistant professor of computer science at MIT, and the initial creator of Apache Spark. He is currently on industry leave to start Databricks, a company commercializing Spark, where he is CTO.
Charles leads Cloudera’s product function, setting the strategic direction for Cloudera’s Distribution for Apache Hadoop and Cloudera Enterprise. Prior to Cloudera Charles held a number of leadership roles at SAP, BEA Systems as well as venture backed startups.
Charles holds a bachelor’s degree from Carleton College and an MBA from MIT.
At Cloudera, Philip Zeyliger started and continues to work on the Cloudera Manager product, and is a committer on the Apache Avro project. Philip came to Cloudera from Google, where he worked on Megastore, and, before that, he worked in finance, at D.E. Shaw. Philip holds a bachelor’s degree in mathematics from Harvard University.
Dana Zuber Is a strategy and analytics executive at Wells Fargo where she was an early adopter and evangelist of using visual analytics to tell stories with data. She has led analytics teams at Wells Fargo in Wealth Management, Branch Planning and Modeling and most recently in Enterprise Data and Analytics. Dana presents regularly to Wells Fargo executive leadership teams in addition to speaking at conferences. She is a guest speaker on the topic of data visualization and analytics at Stanford, Sonoma State University and The University of San Francisco. Prior to Wells Fargo, Dana has worked at companies such as Walmart.com, L’Oreal USA and Canadian Tire. Dana has a degree in Mechanical Engineering from Queen’s University in Canada and an MBA from... Read More.
For information on exhibition and sponsorship opportunities at the conference, contact Susan Stewart at email@example.com
For information on trade opportunities with O'Reilly conferences contact Kathy Yu at mediapartners
For media-related inquiries, contact Maureen Jennings at firstname.lastname@example.org
View a complete list of Strata contacts