A perceptually optimised video coding system for sign language communication at low bit rates

作者：

Highlights：

•

摘要

The ability to communicate remotely through the use of video as promised by wireless networks and already practised over fixed networks, is for deaf people as important as voice telephony is for hearing people. Sign languages are visual–spatial languages and as such demand good image quality for interaction and understanding. In this paper, we first analyse the sign language viewer's eye-gaze, based on the results of an eye-tracking study that we conducted, as well as the video content involved in sign language person-to-person communication. Based on this analysis we propose a sign language video coding system using foveated processing, which can lead to bit rate savings without compromising the comprehension of the coded sequence or equivalently produce a coded sequence with higher comprehension value at the same bit rate. We support this claim with the results of an initial comprehension assessment trial of such coded sequences by deaf users. The proposed system constitutes a new paradigm for coding sign language image sequences at limited bit rates.

论文关键词：Sign language video coding,Eye tracking,Foveated video coding,Rate control,H.264

论文评审过程：Received 24 May 2005, Revised 17 February 2006, Accepted 20 February 2006, Available online 29 March 2006.

论文官网地址：https://doi.org/10.1016/j.image.2006.02.003