Comparison of Image Transform-Based Features for Visual Speech Recognition in Clean and Corrupted Videos

Seymour, Rowan; Stewart, Darryl; Ming, Ji

doi:10.1155/2008/810362

Research Article
Open access
Published: 17 December 2007

Comparison of Image Transform-Based Features for Visual Speech Recognition in Clean and Corrupted Videos

Rowan Seymour¹,
Darryl Stewart¹ &
Ji Ming¹

EURASIP Journal on Image and Video Processing volume 2008, Article number: 810362 (2007) Cite this article

1560 Accesses
35 Citations
3 Altmetric
Metrics details

Abstract

We present results of a study into the performance of a variety of different image transform-based feature types for speaker-independent visual speech recognition of isolated digits. This includes the first reported use of features extracted using a discrete curvelet transform. The study will show a comparison of some methods for selecting features of each feature type and show the relative benefits of both static and dynamic visual features. The performance of the features will be tested on both clean video data and also video data corrupted in a variety of ways to assess each feature type's robustness to potential real-world conditions. One of the test conditions involves a novel form of video corruption we call jitter which simulates camera and/or head movement during recording.

Publisher note

To access the full article, please see PDF.

Author information

Authors and Affiliations

School of Electronics, Electrical Engineering and Computer Science, Queen's University of Belfast, Belfast, BT7 1NN, Northern Ireland, UK
Rowan Seymour, Darryl Stewart & Ji Ming

Authors

Rowan Seymour
View author publications
You can also search for this author in PubMed Google Scholar
Darryl Stewart
View author publications
You can also search for this author in PubMed Google Scholar
Ji Ming
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Darryl Stewart.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Seymour, R., Stewart, D. & Ming, J. Comparison of Image Transform-Based Features for Visual Speech Recognition in Clean and Corrupted Videos. J Image Video Proc 2008, 810362 (2007). https://doi.org/10.1155/2008/810362

Download citation

Received: 28 February 2007
Revised: 13 September 2007
Accepted: 17 December 2007
Published: 17 December 2007
DOI: https://doi.org/10.1155/2008/810362

Comparison of Image Transform-Based Features for Visual Speech Recognition in Clean and Corrupted Videos

Abstract

Publisher note

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords