Reputation: 1
I have a requirement, I need to refresh the production HAWQ database to QA environment on daily basis.
How to move the every day delta into QA cluster from Production.
Appreciate your help
Thanks Veeru
Upvotes: 0
Views: 67
Reputation: 370
Shameless self-plug - have a look at the following open PR for using Apache Falcon to orchestrate a DR batch job and see if it fits your needs.
https://github.com/apache/incubator-hawq/pull/940
Here is the synopsis of the process:
hawqsync-extract
to capture known-good HDFS file sizes (protects against HDFS / catalog inconsistency if failure during sync)hawqsync-falcon
, which performs the following steps:
MASTER_DATA_DIRECTORY
(MDD) tarball to HDFSThere is also a JIRA with the design description:
https://issues.apache.org/jira/browse/HAWQ-1078
Upvotes: 1
Reputation: 2106
There isn't a built-in tool to do this so you'll have to write some code. It shouldn't be too difficult to write either because HAWQ doesn't support UPDATE or DELETE. You'll only have to append new data to QA.
Upvotes: 0