ReCSA: a dedicated sort accelerator using ReRAM-based content addressable memory
作者:Huize Li, Hai Jin, Long Zheng, Yu Huang, Xiaofei Liao
摘要
With the increasing amount of data, there is an urgent need for efficient sorting algorithms to process large data sets. Hardware sorting algorithms have attracted much attention because they can take advantage of different hardware’s parallelism. But the traditional hardware sort accelerators suffer “memory wall” problems since their multiple rounds of data transmission between the memory and the processor. In this paper, we utilize the in-situ processing ability of the ReRAM crossbar to design a new ReCAM array that can process the matrix-vector multiplication operation and the vector-scalar comparison in the same array simultaneously. Using this designed ReCAM array, we present ReCSA, which is the first dedicated ReCAM-based sort accelerator. Besides hardware designs, we also develop algorithms to maximize memory utilization and minimize memory exchanges to improve sorting performance. The sorting algorithm in ReCSA can process various data types, such as integer, float, double, and strings.
论文关键词:ReCAM, parallel sorting, architecture design, processing-in-memory
论文评审过程:
论文官网地址:https://doi.org/10.1007/s11704-022-1322-3