Cross-Media Semantics Mining Based on Sparse Canonical Correlation Analysis and Relevance Feedback

Zhang, Hong; Liu, **aoming

doi:10.1007/978-3-642-34778-8_71

Hong Zhang²⁰ &
**aoming Liu²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7674))

Included in the following conference series:

Pacific-Rim Conference on Multimedia

3495 Accesses
1 Citations

Abstract

Cross-media learning is a new hot topic in multimedia content analysis and retrieval. Because multimedia data of different modalities are heterogeneous in feature space and there exists the well-know semantic gap, one of the most challenging issues for cross-media learning is to mine underlying semantics and estimate cross-media correlation. In this paper we propose a cross-media semantics mining approach based on Sparse Canonical Correlation Analysis and relevance feedback. First, we analyze sparse canonical correlation between low-level feature matrices of different modalities in training stage, and construct a Multimodal Sparse Subspace where both canonical correlation and most meaningful features are preserved; then based on geometric distance in the subspace we estimate cross-media correlation and enable cross-media retrieval; also we provide long-term relevance feedback strategy for performance optimization. Our approach is tested with general multimedia data, including image, audio and text. Experiment and comparison results are encouraging and show that the performance of our approach is effective.

This work is supported by National Natural Science Foundation of China (No.61003127).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

A cross-media distance metric learning framework based on multi-view correlation mining and matching

Article 21 April 2015

Semisupervised Cross-Media Retrieval by Distance-Preserving Correlation Learning and Multi-modal Manifold Regularization

Latent semantic factorization for multimedia representation learning

Article 30 August 2017

References

Yang, Y., Zhuang, Y., Wu, F., Pan, Y.: Harmonizing Hierarchical Manifolds for Multimedia Document Semantics Understanding and Cross-media Retrieval. IEEE Transactions on Multimedia 10(3), 437–446 (2008)
Article Google Scholar
Zhang, H., Liu, X.: Boosting Multimodal Semantic Understanding by Local Similarity Adaptation and Global Correlation Propagation. In: Qiu, G., Lam, K.M., Kiya, H., Xue, X.-Y., Kuo, C.-C.J., Lew, M.S. (eds.) PCM 2010, Part I. LNCS, vol. 6297, pp. 148–158. Springer, Heidelberg (2010)
Chapter Google Scholar
Zhuang, Y., Yang, Y., Wu, F.: Mining Semantic Correlation of Heterogeneous Multimedia Data for Cross-media Retrieval. IEEE Transactions on Multimedia 10(2), 221–229 (2008)
Article Google Scholar
Lew, M.: Content Based Multimedia Information Retrieval: State of the Art and Challenges. ACM Transactions on Multimedia Computing, Communications and Applications 2(1), 1–19 (2006)
Article MathSciNet Google Scholar
He, X., Ma, W.Y., Zhang, H.J.: Learning an Image Manifold for Retrieval. In: Proceedings of ACM Multimedia Conference (2004)
Google Scholar
Zhang, H., Weng, J.: Measuring Multi-modality Similarities Via Subspace Learning for Cross-Media Retrieval. In: Zhuang, Y.-T., Yang, S.-Q., Rui, Y., He, Q. (eds.) PCM 2006. LNCS, vol. 4261, pp. 979–988. Springer, Heidelberg (2006)
Chapter Google Scholar
Hotelling, H.: Relations Between Two Sets of Variates. Biometrika, 321–377 (1936)
Google Scholar
Witten, D.M., Tibshirani, R.: Extensions of sparse canonical correlation analysis, with applications to genomic data. Statistical Applications in Genetics and Molecular Biology 8(1) (2009)
Google Scholar
Torres, D.A.: Using sparse CCA for vocabulary selection. M.S. University of California, San Diego (2009)
Google Scholar
Torres, D.A., Turnbull, D., Barrington, L., Sriperumbudur, B.K., Lanckriet, G.: Finding Musically Meaningful Words by Sparse CCA. In: NIPS Workshop on Music, Brain & Cognition (2007)
Google Scholar
Zhang, R., Zhang, Z.: Effective Image Retrieval based on Hidden Concept Discovery in Image Database. IEEE Transactions on Image Processing 16(2), 562–572 (2007)
Article MathSciNet Google Scholar
Yang, Y., Nie, F., Xu, D., Luo, J., Zhuang, Y., Pan, Y.: A multimedia retrieval framework based on semi-supervised ranking and relevance feedback. IEEE Transactions on Pattern Analysis and Machine Intelligence 34(4), 723–742 (2012)
Article Google Scholar

Download references

Author information

Authors and Affiliations

College of Computer Science & Technology, Wuhan University of Science & Technology, 430065, China
Hong Zhang & **aoming Liu

Authors

Hong Zhang
View author publications
You can also search for this author in PubMed Google Scholar
**aoming Liu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Engineering, Nanyang Technologies University, 50 Nanyang Avenue, 639798, Singapore
Weisi Lin , Dong Xu , Jianxin Wu , Ying He & Jianfei Cai , , , &
Department of Computing, University of Surrey, GU2 7XH, Guildford, UK
Anthony Ho
Department of Computer Science, School of Computing, National University of Singapore, Building AS6, Room #05-06, 117417, Singapore
Mohan Kankanhalli
Department of Electrical Engineering, University of Washington, M418 EE/CSE, Box 352500, 98195, Seattle, WA, USA
Ming-Ting Sun

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, H., Liu, X. (2012). Cross-Media Semantics Mining Based on Sparse Canonical Correlation Analysis and Relevance Feedback. In: Lin, W., et al. Advances in Multimedia Information Processing – PCM 2012. PCM 2012. Lecture Notes in Computer Science, vol 7674. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34778-8_71

Download citation

DOI: https://doi.org/10.1007/978-3-642-34778-8_71
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34777-1
Online ISBN: 978-3-642-34778-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Cross-Media Semantics Mining Based on Sparse Canonical Correlation Analysis and Relevance Feedback

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

A cross-media distance metric learning framework based on multi-view correlation mining and matching

Semisupervised Cross-Media Retrieval by Distance-Preserving Correlation Learning and Multi-modal Manifold Regularization

Latent semantic factorization for multimedia representation learning

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Cross-Media Semantics Mining Based on Sparse Canonical Correlation Analysis and Relevance Feedback

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

A cross-media distance metric learning framework based on multi-view correlation mining and matching

Semisupervised Cross-Media Retrieval by Distance-Preserving Correlation Learning and Multi-modal Manifold Regularization

Latent semantic factorization for multimedia representation learning

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation