Multi-modal user interaction method based on gaze tracking and gesture recognition

作者:

Highlights:

摘要

This paper presents a gaze tracking technology which provides a convenient human–centric interface for multimedia consumption without any wearable device. It enables a user to interact with various multimedia on a large display in distance by tracking user movement and acquiring high resolution eye images. This paper also presents a gesture recognition technology which is helpful to interact with scene descriptions in terms of controlling and rendering scene objects. It is based on Hidden Markov Model and CRF using a commercial depth sensor. And then, this paper shows a collaboration method with those new sensors and MPEG standards in order to achieve interoperability among interactive applications, new user interaction devices and users.

论文关键词:Gaze tracking,Gesture recognition,User interface,MPEG-U,MPEG-V

论文评审过程:Available online 2 November 2012.

论文官网地址:https://doi.org/10.1016/j.image.2012.10.007