HBase table size is much bigger than the file in hadoop hdfs

Question

Recently I use hadoop bulk load to put data into hbase Firstly, I call HDFS API to write data into file in hadoop hdfs, totally 7000,000 lines data, the size is 503MB. Secondly, I use org.apache.hadoop.hbase.mapreduce.ImportTsv and org.apache.hadoop.hbase.mapreduce.LoadIncrementalHFiles to put data into hbase.

The most import things that I did is using bulkload tool to put data into hbase,after finished bulkload, I found that the hbase table is 1.96GB. The hdfs replication is 1. I do not know why.

HBase table size is much bigger than the file in hadoop hdfs

Answers (1)

Related Questions