The Apache HBase Book

Revision History
Revision 0.90.1  
Adding first cuts at Configuration, Getting Started, Data Model
Revision 0.89.20100924 5 October 2010 stack
Initial layout

Abstract

This is the official book of Apache HBase, a distributed, versioned, column-oriented database built on top of Apache Hadoop and Apache ZooKeeper.


Table of Contents

Preface
1. Getting Started
1.1. Introduction
1.2. Quick Start
1.2.1. Download and unpack the latest stable release.
1.2.2. Start HBase
1.2.3. Shell Exercises
1.2.4. Stopping HBase
1.2.5. Where to go next
1.3. Not-so-quick Start Guide
1.3.1. Requirements
1.3.2. HBase run modes: Standalone and Distributed
1.3.3. Example Configurations
2. Upgrading
2.1. Upgrading to HBase 0.90.x from 0.20.x or 0.89.x
3. Configuration
3.1. hbase-site.xml and hbase-default.xml
3.1.1. HBase Default Configuration
3.2. hbase-env.sh
3.3. log4j.properties
3.4. The Important Configurations
3.4.1. Required Configurations
3.4.2. Recommended Configuations
3.5. Client configuration and dependencies connecting to an HBase cluster
4. The HBase Shell
4.1. Scripting
4.2. Shell Tricks
4.2.1. irbrc
4.2.2. LOG data to timestamp
4.2.3. Debug
5. HBase and MapReduce
6. HBase and Schema Design
6.1. Monotonically Increasing Row Keys/Timeseries Data
7. Metrics
8. Cluster Replication
9. Data Model
9.1. Table
9.2. Row
9.3. Column Family
9.4. Cells
9.5. Versions
9.5.1. Versions and HBase Operations
9.5.2. Current Limitations
10. Architecture
10.1. Daemons
10.1.1. Master
10.1.2. RegionServer
10.2. Regions
10.2.1. Region Size
10.2.2. Region Splits
10.2.3. Region Load Balancer
10.2.4. Store
11. The WAL
11.1. What is the purpose of the HBase WAL
11.2. WAL splitting
11.2.1. hbase.hlog.split.skip.errors
11.2.2. How EOFExceptions are treated when splitting a crashed RegionServers' WALs
12. Bloom Filters
12.1. Configurations
12.1.1. HColumnDescriptor option
12.1.2. io.hfile.bloom.enabled global kill switch
12.1.3. io.hfile.bloom.error.rate
12.1.4. io.hfile.bloom.max.fold
12.2. Bloom StoreFile footprint
12.2.1. BloomFilter in the StoreFile FileInfo data structure
12.2.2. BloomFilter entries in StoreFile metadata
A. Tools
A.1. HBase hbck
A.2. HFile Tool
A.3. WAL Tools
A.3.1. HLog tool
A.4. Compression Tool
B. Compression In HBase
B.1. CompressionTest Tool
B.2. hbase.regionserver.codecs
B.3. LZO
B.4. GZIP
C. FAQ
D. YCSB: The Yahoo! Cloud Serving Benchmark and HBase
Index