Cued Speech Gesture Recognition: A First Prototype Based on Early Reduction

Burger, Thomas; Caplier, Alice; Perret, Pascal

doi:10.1155/2007/73703

Research Article
Open access
Published: 03 January 2008

Cued Speech Gesture Recognition: A First Prototype Based on Early Reduction

Thomas Burger¹,
Alice Caplier² &
Pascal Perret¹

EURASIP Journal on Image and Video Processing volume 2007, Article number: 073703 (2008) Cite this article

1266 Accesses
1 Citations
Metrics details

Abstract

Cued Speech is a specific linguistic code for hearing-impaired people. It is based on both lip reading and manual gestures. In the context of THIMP (Telephony for the Hearing-IMpaired Project), we work on automatic cued speech translation. In this paper, we only address the problem of automatic cued speech manual gesture recognition. Such a gesture recognition issue is really common from a theoretical point of view, but we approach it with respect to its particularities in order to derive an original method. This method is essentially built around a bioinspired method called early reduction. Prior to a complete analysis of each image of a sequence, the early reduction process automatically extracts a restricted number of key images which summarize the whole sequence. Only the key images are studied from a temporal point of view with lighter computation than the complete sequence.

[1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23]

References

Cornett RO: Cued speech. American Annals of the Deaf 1967, 112: 3-13.
Google Scholar
Beautemps D: Telephone for hearing impaired. French RNTS Report 2005. Reseau National des Technologies pour la Santé
Google Scholar
http://www.lis.inpg.fr/pages_perso/caplier/english/geste.html.en/geste_1.html.en.html
Caplier A, Bonnaud L, Malassiotis S, Strintzis M: Comparison of 2D and 3D analysis for automated cued speech gesture recognition. Proceedings of the 9th International Workshop on Speech and Computer (SPECOM '04), September 2004, Saint-Petersburg, Russia
Google Scholar
Attina V, Beautemps D, Cathiard M-A, Odisio M: A pilot study of temporal organization in cued speech production of French syllables: rules for a cued speech synthesizer. Speech Communication 2004,44(1–4):197-214.
Article Google Scholar
Ong SCW, Ranganath S: Automatic sign language analysis: a survey and the future beyond lexical meaning. IEEE Transactions on Pattern Analysis and Machine Intelligence 2005,27(6):873-891. 10.1109/TPAMI.2005.112
Article Google Scholar
Kschischang FR, Frey BJ, Loeliger H-A: Factor graphs and the sum-product algorithm. IEEE Transactions on Information Theory 2001,47(2):498-519. 10.1109/18.910572
Article MathSciNet MATH Google Scholar
Rabiner LR: A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE 1989,77(2):257-286. 10.1109/5.18626
Article Google Scholar
Bilmes J: What HMMs can do. In Tech. Rep. UWEETR-2002-2003. University of Washington, Department Of EE, Seattle, Wash, USA; 2002.
Google Scholar
Burger T, Benoit A, Caplier A: Extracting static hand gestures in dynamic context. Proceedings of the IEEE International Conference on Image Processing (ICIP '06), October 2006, Atlanta, Ga, USA 2081-2084.
Google Scholar
Dorner B, Hagen E: Towards an American sign language interface. Artificial Intelligence Review 1994,8(2-3):235-253. 10.1007/BF00849076
Article Google Scholar
Burger T, Caplier A, Mancini S: Cued speech hand gestures recognition tool. Proceedings of the 13th European Signal Processing Conference (EUSIPCO '05), September 2005, Antalya, Turkey
Google Scholar
Garcia C, Delakis M: Convolutional face finder: a neural architecture for fast and robust face detection. IEEE Transactions on Pattern Analysis and Machine Intelligence 2004,26(11):1408-1423. 10.1109/TPAMI.2004.97
Article Google Scholar
Duffner S, Garcia C: A hierarchical approach for precise facial feature detection. Proceedings of Compression et Représentation des Signaux Audiovisuels (CORESA '05), November 2005, Rennes, France
Google Scholar
Barron JL, Fleet DJ, Beauchemin SS: Performance of optical flow techniques. International Journal of Computer Vision 1994,12(1):43-77. 10.1007/BF01420984
Article Google Scholar
Irani M, Rousso B, Peleg S: Computing occluding and transparent motions. International Journal of Computer Vision 1994,12(1):5-16.
Article Google Scholar
Benoit A, Caplier A: Motion estimator inspired from biological model for head motion interpretation. Proceedings of the 6th European Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS '05), April 2005, Montreux, Switzerland
Google Scholar
Wang S, Zhang J, Wang Y, Zhang J, Li B: Simplest operator based edge detection of binary image. Proceedings of the International Computer Congress on Wavelet Analysis and Its Applications, and Active Media Technology, May 2004, Chongqing, China 1: 51-56.
Article MATH Google Scholar
Morris T, Elshehry OS: Hand segmentation from live video. In Proceedings of the International Conference on Imaging Science Systems and Technology (CISST '02), August 2002, Manchester, UK. UMIST;
Google Scholar
Zhang D, Lu G: Evaluation of MPEG-7 shape descriptors against other shape descriptors. Multimedia Systems 2003,9(1):15-30. 10.1007/s00530-002-0075-y
Article Google Scholar
Cortes C, Vapnik V: Support-vector networks. Machine Learning 1995,20(3):273-297.
MATH Google Scholar
Chang C-C, Lin C-J: LIBSVM: a library for support vector machines. 2001.http://www.csie.ntu.edu.tw/~cjlin/libsvm
Google Scholar
http://www.bioid.com/

Download references

Author information

Authors and Affiliations

France Telecom R&D, 28 chemin du Vieux Chêne, Meylan, 38240, France
Thomas Burger & Pascal Perret
GIPSA-Lab/DIS, 46 avenue Félix Viallet, Grenoble Cedex, 38031, France
Alice Caplier

Authors

Thomas Burger
View author publications
You can also search for this author in PubMed Google Scholar
Alice Caplier
View author publications
You can also search for this author in PubMed Google Scholar
Pascal Perret
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Thomas Burger.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Burger, T., Caplier, A. & Perret, P. Cued Speech Gesture Recognition: A First Prototype Based on Early Reduction. J Image Video Proc 2007, 073703 (2008). https://doi.org/10.1155/2007/73703

Download citation

Received: 10 January 2007
Revised: 02 May 2007
Accepted: 23 August 2007
Published: 03 January 2008
DOI: https://doi.org/10.1155/2007/73703

Cued Speech Gesture Recognition: A First Prototype Based on Early Reduction

Abstract

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords