Using HBase effectively - What You Need to Know as an Application Developer

Jonathan Hsieh (Cloudera, Inc), Himanshu Vashishtha (Cloudera, Inc.)
Hadoop in Practice Great America Ballroom K
Tutorial Please note: to attend, your registration must include Tutorials on Tuesday.
Average rating: ***..
(3.12, 16 ratings)

HBase is one of the new NoSQL data stores that have come up in the recent years and has been gaining popularity at a fast pace. It is a true open source implementation of the Google Bigtable, and is a part of the Hadoop ecosystem. HBase is known to scale to 100s of nodes easily, providing fast random access to terabytes and petabytes of data. This tutorial is to get you started in the world of HBase so you can build a scalable application of your own.

We’ll accomplish this by covering the following aspects:

  1. The background of HBase as a datastore
  2. Setting up HBase on a *nix machine (bring your laptop with Linux on it. Macs work just as well and so does a remote EC2 instance)
  3. Build a real world application from ground up, thinking about scale from the get go. The application will use HBase as it’s backend store.
  4. HBase data model and schema design basics
  5. Overview of HBase internals and design and what that means for your application

We’ll also touch upon the following topics:

  1. Production deployment strategies and things to think about
  2. Tuning HBase for different workloads and performance testing your system

At the end of the tutorial, you’ll have a good understanding of how to effectively use HBase as the backend store for your application.

Photo of Jonathan  Hsieh

Jonathan Hsieh

Cloudera, Inc

Software Engineer @ Cloudera. Apache HBase Commiter, Apache Flume Founder.

Photo of Himanshu Vashishtha

Himanshu Vashishtha

Cloudera, Inc.

Himanshu Vashishtha is a Software engineer at Cloudera. He works in the HBase team.
He did his MSc and BTech from University of Alberta, and Indian Institute of Technology Varanasi, respectively.

Comments on this page are now closed.

Comments

Picture of Jonathan  Hsieh
Jonathan Hsieh
02/27/2013 3:15pm PST

Mina,

Looks like the slides are up now!

Enjoy! Jon

Picture of Jonathan  Hsieh
Jonathan Hsieh
02/27/2013 2:21pm PST

Mina,

I’ve sent the slides to the organizers today, so hopefully they will show up in the next day or so.

Jon.

Minakshi Mukherjee
02/26/2013 7:49pm PST

Hi Jon,

I attended the session today. Are you going to post the slides? Mina

Picture of Jonathan  Hsieh
Jonathan Hsieh
02/26/2013 7:05am PST

Jean-Marc,

I believe the session will be recorded. However, it may only be available for folks who have paid for a recent strata conference, or who have paid for the session. I’ll be posting a copy of the slides publicly after the talk is done.

Jon.

Jean-Marc Spaggiari
02/26/2013 5:10am PST

Hi Jonathan,

Do you know if this session will be recorded and avaible somewhere?

Thanks,

JM

Picture of Jonathan  Hsieh
Jonathan Hsieh
02/26/2013 12:29am PST

Hi John,

Though not required, you can download hbase using these commands:

Download and untar wget http://apache.osuosl.org/hbase/hbase-0.94.5/hbase-0.94.5.tar.gz tar xvfz hbase-0.94.5.tar.gz cd hbase-0.94.5 bin/start-hbase.sh Verify: bin/hbase shell Browse http://localhost:60010

Alternately, you can use a virtual machine image that can be downloaded from here:

https://ccp.cloudera.com/display/SUPPORT/Cloudera%27s+Hadoop+Demo+VM+for+CDH4

Hope it helps! Jon.

John Schwab
02/25/2013 7:39am PST

Are the required downloads available ahead of time for the “Using HBase Effectively” tutorial?

Sponsors

Sponsorship Opportunities

For information on exhibition and sponsorship opportunities at the conference, contact Susan Stewart at sstewart@oreilly.com

Media Partner Opportunities

For information on trade opportunities with O'Reilly conferences contact Kathy Yu at mediapartners
@oreilly.com

Press and Media

For media-related inquiries, contact Maureen Jennings at maureen@oreilly.com

Contact Us

View a complete list of Strata contacts