Reputation: 889
Looks like spark sql is case sensitive for "like" queries, right ?
spark.sql("select distinct status, length(status) from table")
Returns
Active|6
spark.sql("select distinct status from table where status like '%active%'")
Returns no value
spark.sql("select distinct status from table where status like '%Active%'")
Returns
Active
Upvotes: 6
Views: 14997
Reputation: 8711
Yes, Spark is case sensitive. Most of the RDBMSs are case sensitive by default for string comparison. If you want case-insensitive, try rlike or convert the column to upper/lower case.
scala> val df = Seq(("Active"),("Stable"),("Inactive")).toDF("status")
df: org.apache.spark.sql.DataFrame = [status: string]
scala> df.createOrReplaceTempView("tbl")
scala> df.show
+--------+
| status|
+--------+
| Active|
| Stable|
|Inactive|
+--------+
scala> spark.sql(""" select status from tbl where status like '%Active%' """).show
+------+
|status|
+------+
|Active|
+------+
scala> spark.sql(""" select status from tbl where lower(status) like '%active%' """).show
+--------+
| status|
+--------+
| Active|
|Inactive|
+--------+
scala>
Upvotes: 12