This tutorial will teach attendees about the key aspects of scalable web mining, via six modules:
1. IntroductionVeteran developer and entrepreneur, 25+ years experience. Founder and President of TransPac Software, a 20 year leader in internationalization, mobile devices, and search consulting. Founder and CTO of Krugle, a vertical search engine and enterprise appliance for code and technical information. Co-founder of Bixo web mining project. Committer for the Apache Tika project. Author and speaker on vertical search and web mining.
Help us make this conference the best it can be for you. Have questions you'd like this speaker to address? Suggestions for issues that deserve extra attention? Feedback that you'd like to share with the speaker and other attendees?
Join the conversation here
(requires login)
For information on exhibition and sponsorship opportunities at the conference, contact Susan Stewart at sstewart@oreilly.com.
For information on trade opportunities with O'Reilly conferences contact Kathy Yu at mediapartners
@oreilly.com
For media-related inquiries, contact Maureen Jennings at maureen@oreilly.com
View a complete list of Strata contacts
Comments
Hi Jim – Mac is actually the preferred platform. Windows users need to install Cygwin to get a Linux-like environment for running Hadoop, but Mac already has that – you can just use the regular Terminal window.
The instructions for lab appear to assume a windows user. Are the tools required already installed for a mac? Is there a preference on a windows vs mac set up?