Multi-modal user interaction method based on gaze tracking and gesture recognition

作者：

Highlights：

•

摘要

This paper presents a gaze tracking technology which provides a convenient human–centric interface for multimedia consumption without any wearable device. It enables a user to interact with various multimedia on a large display in distance by tracking user movement and acquiring high resolution eye images. This paper also presents a gesture recognition technology which is helpful to interact with scene descriptions in terms of controlling and rendering scene objects. It is based on Hidden Markov Model and CRF using a commercial depth sensor. And then, this paper shows a collaboration method with those new sensors and MPEG standards in order to achieve interoperability among interactive applications, new user interaction devices and users.

论文关键词：Gaze tracking,Gesture recognition,User interface,MPEG-U,MPEG-V

论文评审过程：Available online 2 November 2012.

论文官网地址：https://doi.org/10.1016/j.image.2012.10.007