A faster algorithm computing string edit distances

作者:

Highlights:

摘要

The edit distance between two character strings can be defined as the minimum cost of a sequence of editing operations which transforms one string into the other. The operations we admit are deleting, inserting and replacing one symbol at a time, with possibly different costs for each of these operations. The problem of finding the longest common subsequence of two strings is a special case of the problem of computing edit distances. We describe an algorithm for computing the edit distance between two strings of length n and m, n ⪖ m, which requires O(n · max(1, mlog n)) steps whenever the costs of edit operations are integral multiples of a single positive real number and the alphabet for the strings is finite. These conditions are necessary for the algorithm to achieve the time bound.

论文关键词:

论文评审过程:Received 25 September 1978, Revised 6 June 1979, Available online 3 December 2003.

论文官网地址:https://doi.org/10.1016/0022-0000(80)90002-1