Statistical Gesture Models for 3D Motion Capture from a Library of Gestures with Variants

Li, Zhenbo; Horain, Patrick; Pez, André-Marie; Pelachaud, Catherine

doi:10.1007/978-3-642-12553-9_19

Zhenbo Li²¹,
Patrick Horain²¹,
André-Marie Pez²² &
…
Catherine Pelachaud^22,23

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5934))

Included in the following conference series:

International Gesture Workshop

1503 Accesses
2 Citations

Abstract

A challenge for 3D motion capture by monocular vision is 3D-2D projection ambiguities that may bring incorrect poses during tracking. In this paper, we propose improving 3D motion capture by learning human gesture models from a library of gestures with variants. This library has been created with virtual human animations. Gestures are described as Gaussian Process Dynamic Models (GPDM) and are used as constraints for motion tracking. Given the raw input poses from the tracker, the gesture model helps to correct ambiguous poses. The benefit of the proposed method is demonstrated with results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: EUR 29.95; Price includes VAT (Germany)

eBook: EUR 42.79; Price includes VAT (Germany)

Softcover Book: EUR 53.49; Price includes VAT (Germany)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Monocular Pose Capture with a Depth Camera Using a Sums-of-Gaussians Body Model

Laban movement analysis and hidden Markov models for dynamic 3D gesture recognition

Article Open access 01 August 2017

Non-trajectory-based gesture recognition in human-computer interaction based on hand skeleton data

Article 11 March 2022

References

Vilhjálmsson, H.: Avatar Augmented Online Conversation, Ph.D. thesis, Media Arts and Sciences, Massachusetts Institute of Technology, Media Laboratory, Cambridge, MA (2003)
Google Scholar
Horain, P., Marques Soares, J., Rai, P.K., Bideau, A.: Virtually enhancing the perception of user actions. In: 15^th International Conference on Artificial Reality and Telexistence (ICAT 2005), Christchurch, New Zealand, pp. 245–246 (2005), doi:10.1145/1152399.1152446
Google Scholar
Moeslund, T., Hilton, A., Kruger, V.: A survey of advances in vision-based human motion capture and analysis. Computer vision and image understanding 104(2-3), 90–126 (2006)
Article Google Scholar
Poppe, R.W.: Vision-based human motion analysis: An overview. Computer Vision and Image Understanding 108(1-2), 4–18 (2007)
Article Google Scholar
Poggi, I.: Mind, Hands, Face and Body. In: A Goal and Belief View of Multimodal Communication, vol. 19. Weidler Verlag, Körper (2007)
Google Scholar
Bevacqua, E., Mancini, M., Niewiadomski, R., Pelachaud, C.: An expressive ECA showing complex emotions. In: AISB 2007 Annual convention, workshop Language, Speech and Gesture for Expressive Characters, Newcastle, UK, pp. 208–216 (2007)
Google Scholar
Wang, J.M., Fleet, D.J., Hertzmann, A.: Gaussian Process Dynamical Models for Human Motion. IEEE Transactions on PAMI 30(2), 283–298 (2008)
Google Scholar
Pullen, K., Bregler, C.: Motion capture assisted animation: Texturing and synthesis. In: SIGGRAPH 2002, pp. 501–508 (2002)
Google Scholar
Safonova, A., Hodgins, J.K., Pollard, N.S.: Synthesizing physically realistic human motion in low-dimensional, behavior-specific spaces. ACM Transactions on Graphics 23(3), 524–521 (2004)
Google Scholar
Elgammal, A.M., Lee, C.-S.: Inferring 3D body pose from silhouettes using activity manifold learning. In: Conference on Computer Vision and Pattern Recognition (CVPR 2004), vol. 2, pp. 681–688 (2004)
Google Scholar
Grochow, K., Martin, S.L., Hertzmann, A., Popovic, Z.: Style-based inverse kinematics. ACM Transactions on Graphics 23(3), 522–531 (2004)
Article Google Scholar
Teh, Y.W., Roweis, S.T.: Automatic alignment of local representations. In: Neural Information Processing Systems 15 (NIPS 2002), pp. 841–848 (2003)
Google Scholar
Lawrence, N.D.: Gaussian process latent variable models for visualisation of high dimensional data. In: Thrun, S., Saul, L., Schölkopf, B. (eds.) Advances in Neural Information Processing Systems, pp. 329–336. MIT Press, Cambridge (2004)
Google Scholar
Carreira-Perpiñán, M.Á., Lu, Z.: The Laplacian Eigenmaps Latent Variable Model. In: 11^th International Conference on Artificial Intelligence and Statistics (AISTATS), Puerto Rico (2007)
Google Scholar
Urtasun, R., Fleet, D.J., Hertzmann, A., Fua, P.: Priors for people tracking from small training sets. In: International Conference On Computer Vision (ICCV 2005), Bei**g, China, vol. 1, pp. 403–410 (2005)
Google Scholar
Urtasun, R., Fleet, D.J., Fua, P.: 3D people tracking with Gaussian process dynamical models. In: Conference on Computer Vision and Pattern Recognition (CVPR 2006), New York, NY, vol. 1, pp. 238–245 (2006)
Google Scholar
Raskin, L., Rivlin, E., Rudzsky, M.: Dimensionality Reduction for Articulated Body Tracking. In: 3DTV 2007, pp. 1–4 (2007)
Google Scholar
Gómez Jáuregui, D.A., Horain, P.: Region-based vs. edge-based registration for 3D motion capture by real time monoscopic vision. In: Gagalowicz, A., Philips, W. (eds.) MIRAGE 2009. LNCS, vol. 5496, pp. 344–355. Springer, Heidelberg (2009)
Chapter Google Scholar
Moon, K., Pavlovic, V.I.: Impact of dynamics on subspace embedding and tracking of sequences. In: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR 2006), June 2006, New York, NY, vol. 1, pp. 198–205 (2006)
Google Scholar
Lu, Z., Carreira-Perpiñán, M.Á., Sminchisescu, C.: People Tracking with the Laplacian Eigenmaps Latent Variable Model. In: Advances in Neural Information Processing Systems, NIPS, vol. 21 (2007)
Google Scholar
Isard, M., Blake, A.: Condensation-conditional density propagation for visual tracking. Int. J. Computer Vision 29(1), 5–28 (1998)
Article Google Scholar
Calbris, G.: The semiotics of French gestures. University Press, Bloomington (1990)
Google Scholar
Gallaher, P.E.: Individual differences in nonverbal behavior: Dimensions of style. Journal of Personality and Social Psychology 63(1), 133–145 (1992)
Article Google Scholar
Mancini, M., Pelachaud, C.: Distinctiveness in multimodal behaviors. In: 7^th International Joint Conference on Autonomous Agents and Multi-Agent Systems, AAMAS 2008, Estoril Portugal (May 2008)
Google Scholar
Kipp, M.: Anvil - A Generic Annotation Tool for Multimodal Dialogue. In: 7^th European Conference on Speech Communication and Technology (Eurospeech), Aalborg, pp. 1367–1370 (2001)
Google Scholar
Davis, J., Agrawala, M., Chuang, E., Popovic, Z., Salesin, D.: A Sketching Interface for Articulated Figure Animation. In: Eurographics/SIGGRAPH Symposium on Computer Animation, SCA (2003)
Google Scholar
Sam, R., Lawrence, S.: Nonlinear dimensionality reduction by locally linear embedding. Science 290(5500), 2323–2326 (2000)
Article Google Scholar
Tenenbaum, J.B., de Silva, V., Langford, J.C.: A Global Geometric Framework for Nonlinear Dimensionality Reduction. Science 290(5500), 2319–2323 (2000)
Article Google Scholar
Neal, R.M.: Bayesian Learning for Neural Networks. Lecture Notes in Statistics, vol. 118. Springer, Heidelberg (1996)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Institut Telecom, Telecom SudParis, 9 rue Charles Fourier, 91011, Evry Cedex, France
Zhenbo Li & Patrick Horain
Institut Telecom, Telecom ParisTech, 46 rue Barrault, 75634, Paris Cedex 13, France
André-Marie Pez & Catherine Pelachaud
CNRS, LTCI, 46 rue Barrault, 75634, Paris Cedex 13, France
Catherine Pelachaud

Authors

Zhenbo Li
View author publications
You can also search for this author in PubMed Google Scholar
Patrick Horain
View author publications
You can also search for this author in PubMed Google Scholar
André-Marie Pez
View author publications
You can also search for this author in PubMed Google Scholar
Catherine Pelachaud
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

CITEC, Bielefeld University, P.O. Box 100131, 33501, Bielefeld, Germany
Stefan Kopp
Artificial Intelligence Group, Faculty of Technology, Bielefeld University, 33594, Bielefeld, Germany
Ipke Wachsmuth

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, Z., Horain, P., Pez, AM., Pelachaud, C. (2010). Statistical Gesture Models for 3D Motion Capture from a Library of Gestures with Variants. In: Kopp, S., Wachsmuth, I. (eds) Gesture in Embodied Communication and Human-Computer Interaction. GW 2009. Lecture Notes in Computer Science(), vol 5934. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12553-9_19

Download citation

DOI: https://doi.org/10.1007/978-3-642-12553-9_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12552-2
Online ISBN: 978-3-642-12553-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Statistical Gesture Models for 3D Motion Capture from a Library of Gestures with Variants

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Monocular Pose Capture with a Depth Camera Using a Sums-of-Gaussians Body Model

Laban movement analysis and hidden Markov models for dynamic 3D gesture recognition

Non-trajectory-based gesture recognition in human-computer interaction based on hand skeleton data

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Statistical Gesture Models for 3D Motion Capture from a Library of Gestures with Variants

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Monocular Pose Capture with a Depth Camera Using a Sums-of-Gaussians Body Model

Laban movement analysis and hidden Markov models for dynamic 3D gesture recognition

Non-trajectory-based gesture recognition in human-computer interaction based on hand skeleton data

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation