How do I create a single view of two glue tables in AWS?

Question

I have a scenario, where I have two tables - Table1 and Table2 getting created using crawlers with source as parquet inside my s3 prefix - s3://bucket_name/my_tables/table1/data.parquet, s3://bucket_name/my_tables/table2/data.parquet.

These table have following columns - Table1 - id, type, name, address Table2 - id, type, officenumber, designation

Consider type of all columns as string here.

In athena - Now I want to create a single table called - MyTables with all the columns in the two tables above combined. Basically, I ran a query to create this table manually specifying the source as - s3://bucket_name/my_tables

The table gets created but it is empty. How do I achieve this without duplicating records in S3 and utilizing the same files that were used to create Table1 and Table2. After it is just a table with all columns present in these two tables.

I was able to load partitions individually. First I ran an SQL query to create a table in Athena. And then I added partitions using ALTER PARTITIONS query for all the date partitions.

Is there a way to load the data using single query? Also, if new date partitions gets added, then do I have to add those partitions manually again?

How do I create a single view of two glue tables in AWS?

Answers (1)

Related Questions