do database JOINs make many copies of data in left and right sides of join?

Question

I'm curious: do databases return JOINs as the results of two (or more) SELECT statements with pointers from (segments of) rows in the left side to (segments of) rows in the right, or do they return result sets which may contain many copies of entries from the left and right sides?

You can imagine this being particularly inefficient for dense N:N relations, where N copies of a row on the left might be made to pair up with N matches on the right, and vice versa.

You could hide this optimization in client drivers and make it totally transparent to the database user.

EDIT To clarify, by "dense N:N relations," I mean something like (for a "Facebook clone") you have a "users" table and a "friends" table, and each user is friends with every other user. You would need two JOINs to pull all the pairs of friends out, but could pass the same information to the client as the result of three SELECT statements and let the client match up rows using pointers instead of actually duplicating each user 2(N-1) times (N-1 times on the left, N-1 times on the right).

do database JOINs make many copies of data in left and right sides of join?

Answers (1)

Related Questions