Exporting Spark Dataframe to Athena

Question

I'm running a pyspark job which creates a dataframe and stores it to S3 as below:

df.write.saveAsTable(table_name, format="orc", mode="overwrite", path=s3_path)

I can read the orcfile without a problem, just by using spark.read.orc(s3_path), so there's schema information in the orcfile, as expected.

However, I'd really like to view the dataframe contents using Athena. Clearly if I wrote to my hive metastore, I can call hive and do a show create table ${table_name}, but that's a lot of work when all I want is a simple schema.

Is there another way?

Exporting Spark Dataframe to Athena

Answers (1)

Related Questions