“Hive is a data warehouse system for Hadoop that facilitates easy data summarization, ad-hoc queries, and the analysis of large datasets stored in Hadoop compatible file systems.”
As a data warehouse, pulling out data from different database is a basic requirement as part of Extract, transform and load (ETL).
“HBase is the Hadoop database.”
So it is very nature to have this idea: Hive can operate HBase, as storage target or data source.
Hive storage is based on Hadoop‘s underlying append-only filesystem (HDFS). It is very good to store static data. At the same time, HBase is good for dynamic data with support of Create, Read, Update and Delete (CRUD).