Modelling and recognition of the linguistic components in American Sign Language
作者:
Highlights:
•
摘要
The manual signs in sign languages are generated and interpreted using three basic building blocks: handshape, motion, and place of articulation. When combined, these three components (together with palm orientation) uniquely determine the meaning of the manual sign. This means that the use of pattern recognition techniques that only employ a subset of these components is inappropriate for interpreting the sign or to build automatic recognizers of the language. In this paper, we define an algorithm to model these three basic components form a single video sequence of two-dimensional pictures of a sign. Recognition of these three components are then combined to determine the class of the signs in the videos. Experiments are performed on a database of (isolated) American Sign Language (ASL) signs. The results demonstrate that, using semi-automatic detection, all three components can be reliably recovered from two-dimensional video sequences, allowing for an accurate representation and recognition of the signs.
论文关键词:American Sign Language,Handshape,Motion reconstruction,Multiple cue recognition,Computer vision
论文评审过程:Received 5 May 2008, Revised 3 February 2009, Accepted 4 February 2009, Available online 26 February 2009.
论文官网地址:https://doi.org/10.1016/j.imavis.2009.02.005