Join using contains in spark sql

Question

I have a dataset like:

+---+--------------+
| id|  name_and_age|
+---+--------------+
|  1|   Anu,23     |
|  2|Suresh,24     |
|  3|  Usha,24     |
|  4| Nisha,25     |
+---+--------------+

and

+---+----+
|id2| age|
+---+----+
|  a|  23|
|  b|  24|
|  c|  24|
|  d|  25|
+---+----+

I need to join both datasets so I am using contains

dataset1.join(dataset2,dataset.col("name_and_age").contains(dataset2.col(age)),"inner")

but I am not getting correct results Are there any other ways to join this? Please consider there are no other columns to use in join conditions.

Join using contains in spark sql

Answers (1)

Related Questions