Big data analysis on Amazon Aurora RDS

Question

I have a Aurora table that has 500 millions of records . I need to perform Big data analysis like finding diff between two tables . Till now i have been doing this using HIVE on files system ,but now we have inserted all files rows into Aurora DB . But still monthly i need to do the same thing finding diff.

So to this what colud be the best option ?

Exporting Aurora data back to S3 as files and then running HIVE query on that(how much time it might take to export all Aurora rows into S3)?
Can i run HIVE query on Aurora table ?(I guess hive on Aurora does not support)
Running spark SQL on Aurora (how will be the performance ) ?

Or is there any better way to this .

Big data analysis on Amazon Aurora RDS

Answers (1)

Related Questions