Volume 26, Issue 12 pp. 1439-1458
Research Article

A Comparison of Approximate String Matching Algorithms

PETTERI JOKINEN

Corresponding Author

PETTERI JOKINEN

Department of Computer Science, P.O. Box 26 (Teollisuuskatu 23), FIN-00014 University of Helsinki, Finland

Department of Computer Science, P.O. Box 26 (Teollisuuskatu 23), FIN-00014 University of Helsinki, FinlandSearch for more papers by this author
JORMA TARHIOESKO UKKONEN

Abstract

Experimental comparisons of the running time of approximate string matching algorithms for the k differences problem are presented. Given a pattern string, a text string, and an integer k, the task is to find all approximate occurrences of the pattern in the text with at most k differences (insertions, deletions, changes). We consider seven algorithms based on different approaches including dynamic programming, Boyer–Moore string matching, suffix automata, and the distribution of characters. It turns out that none of the algorithms is the best for all values of the problem parameters, and the speed differences between the methods can be considerable.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.