Visual attention guided bit allocation in video compression

作者：

Highlights：

•

摘要

A visual attention-based bit allocation strategy for video compression is proposed. Saliency-based attention prediction is used to detect interesting regions in video. From the top salient locations from the computed saliency map, a guidance map is generated to guide the bit allocation strategy through a new constrained global optimization approach, which can be solved in a closed form and independently of video frame content. Fifty video sequences (300 frames each) and eye-tracking data from 14 subjects were collected to evaluate both the accuracy of the attention prediction model and the subjective quality of the encoded video. Results show that the area under the curve of the guidance map is 0.773 ± 0.002, significantly above chance (0.500). Using a new eye-tracking-weighted PSNR (EWPSNR) measure of subjective quality, more than 90% of the encoded video clips with the proposed method achieve better subjective quality compared to standard encoding with matched bit rate. The improvement in EWPSNR is up to over 2 dB and on average 0.79 dB.

论文关键词：Visual attention,Video compression,Eye-tracking,Video subjective quality

论文评审过程：Received 2 November 2009, Revised 13 May 2010, Accepted 12 July 2010, Available online 18 July 2010.

论文官网地址：https://doi.org/10.1016/j.imavis.2010.07.001