Solving a large scale radiosity problem on GPU-based parallel computers
作者:
Highlights:
•
摘要
The radiosity equation has been used widely in computer graphics and thermal engineering applications. The equation is simple to formulate but is challenging to solve when the number of Lambertian surfaces associated with an application becomes large. In this paper, we present the algorithms to compute the view factors and solve the set of radiosity equations using an out-of-core Cholesky decomposition method. This work details the algorithmic procedures of the computation of the view factors and the Cholesky solver. The data layout of the radiosity matrix follows the block cyclic decomposition scheme used in ScaLAPACK. The parallel computation of the view factors on the GPUs extends the algorithms based on a serial community code called view3d. To handle large matrices that exceed the device memory on GPU, an out-of-core algorithm for parallel Cholesky factorization is implemented. A performance study conducted on Keeneland, a hybrid CPU/GPU cluster at the National Institute for Computational Sciences, composed of 264 nodes of multicore CPU and GPU are shown and discussed.
论文关键词:Radiosity,View factor calculation,Cholesky decomposition,Out-of-core algorithm,Hybrid multicore/GPU system
论文评审过程:Received 4 October 2013, Revised 24 January 2014, Available online 19 February 2014.
论文官网地址:https://doi.org/10.1016/j.cam.2014.02.011