Pattern matching in Huffman encoded texts

作者:

Highlights:

摘要

For a given text which has been encoded by a static Huffman code, the possibility of locating a given pattern directly in the compressed text is investigated. The main problem is one of synchronization, as an occurrence of the encoded pattern in the encoded text does not necessarily correspond to an occurrence of the pattern in the text. A simple algorithm is suggested which reduces the number of erroneously declared matches. The probability of such false matches is analyzed and empirically tested.

论文关键词:Data compression,Huffman codes,Pattern matching,Compressed matching

论文评审过程:Received 25 April 2003, Accepted 25 August 2003, Available online 25 March 2004.

论文官网地址:https://doi.org/10.1016/j.ipm.2003.08.008