Reputation: 41665
I'm using pyspark
df = self.sqlContext.read.option(
"es.resource", indexes
).format("org.elasticsearch.spark.sql").load()
df = df.filter(
df.data.timestamp >= self.period_start
)
I'd like to see sql query version of df
if possible.
something like print(df.query)
to see something like select * from my-indexes where data.timestamp > self.period_start
Upvotes: 2
Views: 7603
Reputation: 7399
You can check out this piece of documentation for pyspark.sql.DataFrame.explain
. explain
prints the (logical and physical) plan to the console for debugging purpose.
Upvotes: 3