Many of the ideas covered in this talk stemmed from discussing professional experiences amongst a group of Washington DC-based Data Scientists (Marck Vaisman, Harlan Harris, and Sean Murphy) in doing data science work within various organizations. Some frustrations from dealing with management that was not data savvy, from working within limited technical environments which inhibited analytical efforts, and lastly from the perceived gap between what data scientists can do and reality.
Drawing from all of these experiences, we decided to take action and help bring clarity to both sides of the table: the organizations wanting the benefits and the practitioners executing the data science projects. Some of the questions we investigated include:
We highlight common pitfalls facing organizations planning or executing data science. We will cover optimal organizational mindsets, the technical considerations and end with showing the diversity in skills within the Data Science practitioner community, as shown by a survey of several hundred Data Scientists from around the world. This talk is based on material from the upcoming Bad Data Handbook as well as from the analysis of the survey results.
The intent of this session is to surface many issues that have arisen within Data Science as a young, burgeoning, and potentially highly profitable field, and we attempt to establish a common framework for better communication. New practitioners in this field and organizations that are beginning to incorporate data science into their processes will benefit from attending this session.
Marck Vaisman is a data scientist, consultant, entrepreneur, master munger and hacker. Marck is Senior Data Scientist at DeepMile Networks. Previously he’s been the Principal Data Scientist at DataXtract, LLC helping clients with all kinds of data science projects. His professional experience spans the management consulting, telecommunications, Internet, and technology industries. Marck is the co-founder of Data Community DC [http://www.datacommunitydc.org], an organization focused on building the Washington DC area data community by running meetup and other events. He has an MBA from Vanderbilt University and a B.S. in Mechanical Engineering from Boston University. Marck is a contributing author to O’Reilly’s Bad Data Handbook.
Sean Patrick Murphy, with degrees in mathematics, electrical engineering, and biomedical engineering and an MBA from Oxford University, has served as a senior scientist at the Johns Hopkins Applied Physics Laboratory for the past ten years. Previously, he served as the Chief Data Scientist at WiserTogether, a series A funded health care analytics firm, and the Director of Research at Manhattan Prep, a boutique graduate educational company. He was also the co-founder and CEO of a big data-focused startup: CloudSpree.
Help us make this conference the best it can be for you. Have questions you'd like this speaker to address? Suggestions for issues that deserve extra attention? Feedback that you'd like to share with the speaker and other attendees?
Join the conversation here (requires login)
For information on exhibition and sponsorship opportunities at the conference, contact Susan Stewart at email@example.com
For information on trade opportunities with O'Reilly conferences contact Kathy Yu at mediapartners
For media-related inquiries, contact Maureen Jennings at firstname.lastname@example.org
View a complete list of Strata contacts