melanie
melanie

Reputation: 11

Hadoop or HBase writing partial results?

I have an issue that happens infrequently where partial results are being written into our HBase database. Here is a description of my mappers and reducers:

Twice in the last 4 months (so not very frequently), results have been written to HBase such that one of the columns has some very low number (like 1 or 3), and the remaining columns have normal numbers. When I re-run the job, the erroneous column value jumps up to its expected value. It was not the same column that was "broken" both times. No errors were written to the logs.

Has anyone else experienced similar behavior? Does anyone have ideas? Any help would be appreciated. Thanks!

Upvotes: 0

Views: 416

Answers (1)

Arnon Rotem-Gal-Oz
Arnon Rotem-Gal-Oz

Reputation: 25919

If you are using hadoop 0.20.* it has missing append support which causes hbase to lose data occasionally. HBase needs append for things like writing to the WAL and without it there's no guarantee all writes make it to disk. - if that's your case you can just update hadoop to a higher version.

Upvotes: 0

Related Questions