Join operation on rdds with mutable objects

Question

I have a question，in case I have 2 pair RDDs:

RDD1 = RDD[(1,1), (1,2)]
RDD2 = RDD[(1, obj)]    // obj is an mutable scala object

RDD1.join(RDD2) operation should get: RDD[(1, (1,obj1)), (1, (2,obj2))]

the question is: are obj1 and obj2 references to the same object? If they are, what happened during this join process?

I used to think that they are two objects deserialized from obj's serialization result, but today I found that operations on obj1 could be reflected in obj2, and I suddenly got confused.

Thanks

Join operation on rdds with mutable objects

Answers (1)

Related Questions