PySpark Pushing down timestamp filter

Question

I'm using PySpark version 2.4 to read some tables using jdbc with a Postgres driver.

    df = spark.read.jdbc(url=data_base_url, table="tablename", properties=properties)

One column is a timestamp column and I want to filter it like this:

    df_new_data = df.where(df.ts > last_datetime )

This way the filter is pushed down as a SQL query but the datetime format is not right. So I tried this approach

    df_new_data = df.where(df.ts > F.date_format( F.lit(last_datetime), "y-MM-dd'T'hh:mm:ss.SSS") )

but then the filter is no pushed down anymore.

Can someone clarify why this is the case ?

Answers (1)