A lip-tracking system based on morphological processing and block matching techniques

作者:

Highlights:

摘要

This paper describes the application of image processing techniques in extracting the lip kinematics parameters (velocity and displacement) from image sequences. The centres of the lips are located by morphological image processing and cluster analysis. The motion of the lips is determined by a block matching algorithm. The paper presents a modified block matching algorithm which solves the problems caused by uniform shading and texture. The paper also describes a method which transforms the motion vectors into lip velocities and displacements. Moreover, the correlation between the lip information and the speech signals is demonstrated. Finally, the paper explains how the lip-tracking system can be applied to speech segmentation. The principal results show that lip information alone is not sufficient for speech segmentation. However, lip information may assist an audio speech segmentation system if the speech signals are corrupted by noise.

论文关键词:Lip-reading,Morphological image processing,Block matching algorithm,Motion vector,Motion estimation,Articulatory dynamics,Speech segmentation

论文评审过程:Received 6 July 1993, Available online 14 August 2003.

论文官网地址:https://doi.org/10.1016/0923-5965(94)90019-1