Diff algorithm with fuzzy difference metric

Question

I'm looking for an algorithm similar to largest common subsequence algorithms that has an alphabet letter similarity metric. What I mean is that known algorithms treat all letters of alphabet as completely different, my use case has letters of alphabet that are easier to edit into another letter, hence they should be treated as similar by diffing algorithm.

As usage example you may think about diffing algorithm working on lines of text where some lines are more similar to other lines.

The paper An O(ND) Difference Algorithm and Its Variations states on page 4: Consider adding a weight or cost to every edge. Give diagonal edges weight 0 and non-diagonal edges weight 1. I'd like to have an option to assign any weight from [0;1] interval.

Diff algorithm with fuzzy difference metric

Answers (1)

Related Questions