Scala loop over 2 sequences at the same time

Question

I have these 2 Scala sequences, I need to check whether they are equal or not, ignoring nullable column.

val schemaA = StructType(Seq(StructField("date",DateType,true), StructField("account_name",StringType,true)))

val df_A = spark.createDataFrame(spark.sparkContext.emptyRDD[Row], schemaA)

val schemaB = StructType(Seq(StructField("date",DateType,false), StructField("account_name",StringType,true)))

val df_B = spark.createDataFrame(spark.sparkContext.emptyRDD[Row], schemaB)

In python, I could have simply done this:

 print(
     all(        
         for a,b in zip(df_A.schema, df_B.schema)
           (a.name, a.dataType) == (b.name, b.dataType)
     )
 )

But I got stuck to do the same thing in Scala, any tips?

Dima · Accepted Answer

Another way to go around the "extra columns" problem mentioned in the comments:

val result = schemaA.map { a => a.name -> a.type } == schemaB.map { b => b.name -> b.type }

Scala loop over 2 sequences at the same time

Answers (2)

Related Questions