Most approaches for lip modelling are based on heuristic constraints imposed by the user. We describe the use of Active Shape Models for extracting visual speech features for use by automatic speechreading systems, where the deformation of the lip model as well as image search is based on a priori knowledge learned from a training set. We demonstrate the robustness and accuracy of the technique for locating and tracking lips on a database consisting of a broad variety of talkers and lighting conditions.
Nathan Quentin Faivre, Inaki Asier Iturrate Gil, Michael Eric Anthony Pereira, Xiao Hu, Caroline Peters
Anastasia Ailamaki, Iraklis Psaroudakis