Block trees
作者:
Highlights:
•
摘要
Let string S[1..n] be parsed into z phrases by the Lempel-Ziv algorithm. The corresponding compression algorithm encodes S in O(z) space, but it does not support random access to S. We introduce a data structure, the block tree, that represents S in O(zlog(n/z)) space and extracts any symbol of S in time O(log(n/z)), among other space-time tradeoffs. The structure also supports other queries that are useful for building compressed data structures on top of S. Further, block trees can be built in linear time and in a scalable manner. Our experiments show that block trees offer relevant space-time tradeoffs compared to other compressed string representations for highly repetitive strings.
论文关键词:Compressed data structures,Repetitive string collections,Lempel-Ziv compression
论文评审过程:Received 17 September 2019, Revised 11 May 2020, Accepted 5 November 2020, Available online 18 November 2020, Version of Record 20 November 2020.
论文官网地址:https://doi.org/10.1016/j.jcss.2020.11.002