Multi-modal user interaction method based on gaze tracking and gesture recognition
作者:
Highlights:
•
摘要
This paper presents a gaze tracking technology which provides a convenient human–centric interface for multimedia consumption without any wearable device. It enables a user to interact with various multimedia on a large display in distance by tracking user movement and acquiring high resolution eye images. This paper also presents a gesture recognition technology which is helpful to interact with scene descriptions in terms of controlling and rendering scene objects. It is based on Hidden Markov Model and CRF using a commercial depth sensor. And then, this paper shows a collaboration method with those new sensors and MPEG standards in order to achieve interoperability among interactive applications, new user interaction devices and users.
论文关键词:Gaze tracking,Gesture recognition,User interface,MPEG-U,MPEG-V
论文评审过程:Available online 2 November 2012.
论文官网地址:https://doi.org/10.1016/j.image.2012.10.007