A parallel computing method using blocked format with optimal partitioning for SpMV on GPU
作者:
Highlights:
• We develop an optimal partitioning strategy to improve the performance of SpMV.
• We present a reordering algorithm in which the time complexity is only .
• We employ a hybrid format to store a blocked sparse matrix partitioned by our optimal partitioning strategy.
摘要
•We develop an optimal partitioning strategy to improve the performance of SpMV.•We present a reordering algorithm in which the time complexity is only .•We employ a hybrid format to store a blocked sparse matrix partitioned by our optimal partitioning strategy.
论文关键词:Blocked format,CPU/GPU,Dynamic programming,Heterogeneous parallel computing,Partitioning,Reordering,Sparse matrix–vector multiplication
论文评审过程:Received 3 January 2016, Revised 9 June 2017, Accepted 26 September 2017, Available online 16 October 2017, Version of Record 13 November 2017.
论文官网地址:https://doi.org/10.1016/j.jcss.2017.09.010