ApacheCon Europe 2012

Rhein-Neckar-Arena, Sinsheim, Germany

5–8 November 2012

Operating HBase: Things You Need to Know

Christian Gügi

Audience level:
Intermediate
Track:
Big Data

Tuesday 3:15 p.m.–4 p.m. in Level 2 Left

Description

In this talk we will share our experience in running and operating an HBase production cluster. To avoid common pitfalls, we’ll discuss problems and challenges we’ve faced as well as practical solutions for repair. We'll also introduce the tools shipped with HBase and cover some background on HBase internals. We'll conclude by showing our open sourced tool to visualize region sizing/distribution.

Abstract

If you’re running HBase in production, you have to be aware of many things. In this talk we will share our experience in running and operating an HBase production cluster for a customer. To avoid common pitfalls, we’ll discuss problems and challenges we’ve faced as well as practical solutions (real-world techniques) for repair.

Even though HBase provides internal tools for diagnosing issues and for repair, running a healthy cluster can still be challenging for an administrator. We'll cover some background on these tools as well as on HBase internals such as compaction, region splits and their distribution.

We'll also introduce our tool to visualize region sizing and distribution in the cluster, that we recently open sourced.