Compressed double-array tries for string dictionaries supporting fast lookup

作者:Shunsuke Kanda, Kazuhiro Morita, Masao Fuketa

摘要

A string dictionary is a basic tool for storing a set of strings in many kinds of applications. Recently, many applications need space-efficient dictionaries to handle very large datasets. In this paper, we propose new compressed string dictionaries using improved double-array tries. The double-array trie is a data structure that can implement a string dictionary supporting extremely fast lookup of strings, but its space efficiency is low. We introduce approaches for improving the disadvantage. From experimental evaluations, our dictionaries can provide the fastest lookup compared to state-of-the-art compressed string dictionaries. Moreover, the space efficiency is competitive in many cases.

论文关键词:Trie, Double-array, Compressed string dictionaries, Data management, String processing and indexing

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10115-016-0999-8