Eugene W. Myers's diff algorithm fails on lcs(»xx«, »x«)?

Question

I implemented both Myers versions of the diff algorithm, first the greedy one (which works, but is highly ressource intensive when it comes to memory consumption - but Myers writes that in his paper already) and then the refined version.

However, that fails on simple inputs like lcs(»xx«, »x«).

The reason from what I understand is, that the common snake it tries to find, is ambiguous: Let's mark the snakes with »L[...]« for »left snake« and »R[...]« for »right snake« to distinguish them. When going forward in step d = 0, the algorithm detects this snakes: »L[x]x«->»L[x]«. Now in the reverse step, the algorithm detects this snakes: »xR[x]«->»R[x]«. In the forward d = 1 step, the algorithm finds the seconds (empty) snake yielding: »L[x]xR[]«->L[x]R[]« and in the reserse step it finds »L[]xR[x]«->L[]R[x]«.

As you can see, the x-matching in both directions find the first possible snake, but in the forward case, it's L[x] but in the reverse case its R[x].

So this algorithm doesn't find a valid common snake.

In Lemma 3, Myers proves, that IF there is a D-path from (0,0) to (n,m) then there MUST be a D/2-path from (0,0) to some (x,y) and a D/2-path from (x,y) to (n,m). I'll skip the prove, because it's rather obvious, that you can just cut the D-path at any »valid« point to get two shorter paths, that combine back to the one D-path and that you can just take the middle of the D-path to do the cutting part. Anyways: He does prove the existence of such an (x,y) pair (actually, he proves this a little bit more sophisticated by adding a second point (u,v) - but for understanding of my question, that doesn't really matter), but (from my understanding) he does NOT prove, that using his algorithm, you're guaranteed to actually find any of this possible cut-points (x,y) and (u,v) for the D-path.

As shown above, I believe, Myers' refined algorithm actually can't find the solution for »xx«->»x«.

Am I missing something important here?

Eugene W. Myers's diff algorithm fails on lcs(»xx«, »x«)?

Answers (1)

Related Questions

Eugene W. Myers&#39;s diff algorithm fails on lcs(&#187;xx&#171;, &#187;x&#171;)?

Answers (1)

Related Questions

Eugene W. Myers's diff algorithm fails on lcs(»xx«, »x«)?