Survey of Transfer Learning and a Case Study of Emotion Recognition Using Inductive Approach

  • Conference paper
  • First Online:
Data Management, Analytics and Innovation

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1175))

Abstract

In the era of rapid processing, there is a need for application developments that work with different datasets. The novel learning algorithms designed to handle different training and testing sets. Transfer learning allows domains, tasks, and distributions used in training and testing to be different. The concept of Transfer Learning [TL] is motivated by the detail that we can smear knowledge learned formerly to resolve new complications with ease, reusing the knowledge domain. In this paper, we would like to present inductive TL mechanisms to predict emotions from two different languages English and German, where knowledge of emotion identification exists in English language and is extended to learn emotions in German speech. We have attempted to uncover latent features of one language in another language.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. W. Dai, W. Yang, G. Xue Y. Yu, Self-taught Clustering, in 25th ACM International Conference of Machine Learning (2008), pp. 200–207

    Google Scholar 

  2. I. Davidson, S.S. Ravi, Intractability and Clustering with Constraints, in 24th International Conference on Machine Learning (2007), pp. 201–208

    Google Scholar 

  3. G. Grifflin, A. Holub, P. Perona, Caltech-256 Object Category Dataset, Technical Report 7694 (California Institute of Technology, 2007), pp. 1–20

    Google Scholar 

  4. B. Nelson, I. Cohen, Revisiting probabilistic models for clustering with pair-wise constraints, in 24th International Conference on Machine Learning (2007), pp. 673–680

    Google Scholar 

  5. R. Raina, A. Battle, H. Lee, B. Packer, A.Y. Ng, Self-taught learning: transfer learning from unlabeled data, in 24th International Conference on Machine Learning (2007), pp. 759–766

    Google Scholar 

  6. Z. Wang, Y. Song, C. Zhang, Transferred dimensionality reduction, in European Conference on Machine Learning and Knowledge Discovery in Databases. Springer (2008), pp. 550–565

    Google Scholar 

  7. D. Cai, X. He, J. Han, Semi-supervised discriminant analysis, in IEEE International Conference on Computer Vision (2007), pp. 1–7

    Google Scholar 

  8. W. Dai, Q. Yang, G. Xue, Y. Yu, Boosting for transfer learning, in International Conference on Machine Learning (2007), pp. 193–200

    Google Scholar 

  9. G.B. Huang, M. Ramesh, T. Berg, E. Miller, Labeled faces in the wild: a database for studying face recognition in unconstrained environments, in Workshop on Faces in Real-Life Images: Detection, Alignment and Recognition (2008), pp. 1–11

    Google Scholar 

  10. M. Wu, B. Scholkopf, A local learning approach for clustering, in Conference on Neural Information Processing Systems (2007), pp. 1529–1536

    Google Scholar 

  11. J. Ye, Z. Zhao, M. Wu, Discriminative K-means for clustering, in Conference on Neural Information Processing Systems (2007), pp. 1–8

    Google Scholar 

  12. W. Dai, G. Xue, Q. Yang, Y. Yu, Transferring Naïve Bayes classifiers for text classification, in 22nd AAAI Conference on Artificial Intelligence (2007), pp. 540–545

    Google Scholar 

  13. R. Raina, A.Y. Ng, D. Koller, Constructing informative priors using transfer learning, in 23rd International Conference on Machine Learning (2006), pp. 713–720

    Google Scholar 

  14. J. Quionero-Candela, M. Sugiyama, A. Schwaighofer, N.D. Lawrence, Dataset Shift in Machine Learning (The MIT Press, 2009)

    Google Scholar 

  15. J. Jiang, C. Zhai, Instance weighting for domain adaptation in NLP, in 45th Annual Meeting of the Association of Computational Linguistics (2007), pp. 264–271

    Google Scholar 

  16. J. Blitzer, R. McDonald, F. Pereira, Domain adaptation with structural correspondence learning, in Conference on Empirical Methods in Natural Language Processing (2006), pp. 120–128

    Google Scholar 

  17. X. Liao, Y. Xue, L. Carin, Logistic regression with an auxiliary data source, in 21st International Conference on Machine Learning (2005), pp. 505–512

    Google Scholar 

  18. P. Wu, T.G. Dietterich, Improving SVM accuracy by training on auxiliary data sources, in 21st International Conference on Machine Learning (2004), pp. 1–8

    Google Scholar 

  19. J. Juang, A. Smola, A. Gretton, K.M. Borgwardt, B. Scholkopf, Correcting sample selection bias by unlabeled data, in 19th Annual Conference on Neural Informaion Processing Systems (2007), pp. 1–8

    Google Scholar 

  20. A. Gretton, K. Borgwardt, M. Rasch, B. Scholkof, A. Smola, A kernel method for the two-sample problem, in Conference on Neural Information Processing Systems (MIT Press, 2006), pp. 1–8

    Google Scholar 

  21. Bickel, S., Bruckner, M. and Scheffer, T.: Discriminative Learning for Differing Training and Test Distributions, 24th ACM International Conference on Machine Learning, pp. 81–88, (2007)

    Google Scholar 

  22. S. Bickel, T. Scheffer, Dirichlet-enhanced spam fltering based on biased samples, in Conference on Neural Information Processing Systems (2006), pp. 1–8

    Google Scholar 

  23. Y. Xue, X. Liao, L. Carin, B. Krishnapuram, Multi-task learning for classification with dirichlet process priors. J. Mach. Learn. Res. 8, 35–63 (2007)

    MathSciNet  MATH  Google Scholar 

  24. M. Sugiyama, S. Nikajima, H. Kashima, P.V. Buenau, M. Kawanabe, direct importance estimation with model selection and its application to covariate shift adaptation, in 20th Annual Conference on Neural Information Processing Systems (2007), pp. 1433–1440

    Google Scholar 

  25. M. Sugiyama, M. Krauledat, K.R. Muller, Covariate shift adaptation by importance weighted cross validation. J. Mach. Learn. Res. 8, 985–1005 (2007)

    MATH  Google Scholar 

  26. S. Rosset, J. Zhu, H. Zou, T.J. Hastie, A method for inferring label sampling mechanisms in semi-supervised learning, in Conference on Neural Information Processing Systems (2004), pp. 1–8

    Google Scholar 

  27. W. Dai, G. Xue, Q. Yang, Y. Yu, Co-clustering based classification for out-of-domain documents, in 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2007), pp. 210–219

    Google Scholar 

  28. J. Gao, P.N. Tan, H. Cheng, Semi-supervised clustering with partial background information, in 6th SIAM International Conference on Data Mining (2006), pp. 489–493

    Google Scholar 

  29. S. Swarup, S.R. Ray, Cross-domain knowledge transfer using structured representations, in 21st National Conference on Artificial Intelligence (2006), pp. 1–5

    Google Scholar 

  30. R.K. Ando, T. Zhang, A high-performance semi-supervised learning method for text Chunking, in 43rd Annual Meeting on Association for Computational Linguistics (2005), pp. 1–9

    Google Scholar 

  31. R.K. Ando, T. Zhang, A framework for learning predictive structures from multiple tasks and unlabeled data. J. Mach. Learn. Res. 6, 1817–1853 (2005)

    MathSciNet  MATH  Google Scholar 

  32. S. Miller, J. Guinness, A. Zamanian, Name tagging with word clusters and discriminative training, in Conference on Human Language Technologies-North American Chapter of the Association for Computational Linguistics (2004), pp. 1–6

    Google Scholar 

  33. F. Burkhardt, A. Paeschke, M. Rolfes, W. Sendlmeier, B. Weiss, A database of german emotional speech, in Interspeech (2005), pp. 1–4

    Google Scholar 

  34. K. Crammer, O. Dekel, J. Keshet, S. Shalev-Shwartz, Y. Singer, Online passive-aggressive algorithms. J. Mach. Learn. Res. 7, 551–585 (2006)

    MathSciNet  MATH  Google Scholar 

  35. H III Daume, Frustratingly easy domain adaptation, in 45th Annual Meeting of the Association of Computational Linguistics (2007), pp. 256–263

    Google Scholar 

  36. Search-based Structured Prediction, Daume, H III, J. Langford, and D. Marcu. Mach. Learn. J. 75, 297–325 (2009)

    Article  Google Scholar 

  37. A. Argyriou, T. Evgeniou, M. Pontil, Multi-task feature learning, in 19th Annual Conference on Neural Information Processing Systems (2007), pp. 41–48

    Google Scholar 

  38. J. Zhang, Z. Ghahramani, Y. Yang, Learning multiple related tasks using latent independent component analysis, in Conference on Neural Information Processing Systems (2006), pp. 1–8

    Google Scholar 

  39. A. Argyriou, C.A. Micchelli, M. Pontil, Y. Ying, A spectral regularization framework for multi-task structure learning, in 20th Annual Conference on Neural Information Processing Systems (MIT Press, 2008), pp. 25–32

    Google Scholar 

  40. S.G. Koolagudi, K.S. Rao, Real life emotion classification using VOP and pitch based spectral features, in IEEE India Conference, INDICON (2010), pp. 1–4

    Google Scholar 

  41. S.I. Lee, V. Chatalbashev, D. Vickrey, D. Kollar, Learning a meta-level prior for feature relevance from multiple related tasks, in 24th ACM International Conference on Machine Learning (2007), pp. 489–496

    Google Scholar 

  42. R. Raina, A.Y. Ng, D. Koller, Transfer learning by constructing informative priors, in 21st International Conference on Machine Learning (2006), pp. 1–4

    Google Scholar 

  43. T. Jebara, Multi-task feature and kernel selection for SVMs, in 21st ACM International Conference on Machine Learning (2004), pp. 1–8

    Google Scholar 

  44. S. Ben-David, R. Schuller, Exploiting task relatedness for multiple task learning, in Conference on Learning Theory and Kernel Machines. LNCS Springer, vol. 2777 (2003), pp. 567–580

    Google Scholar 

  45. C. Wang, S. Mahadevan, Manifold alignment using procrustes analysis, in 25th ACM International Conference on Machine Learning (2008), pp. 1120–1127

    Google Scholar 

  46. F. Diaz, D. Metzler, Pseudo-aligned multilingual corpora, in International Joint Conference on Artificial Intelligence (2007), pp. 2727–2732

    Google Scholar 

  47. N.D. Lawrence, J.C. Platt, Learning to learn with the informative vector machine, in 21st ACM International Conference on Machine Learning (2004), pp. 1–8

    Google Scholar 

  48. N.D. Lawrence, M. Seeger, R. Herbrich, Fast Sparse Gaussian process methods: the informative vector machine, in Conference on Neural Information Processing Systems (MIT Press, 2002), pp. 625–632

    Google Scholar 

  49. E. Bonilla, K.M. Chai, C. Williams, Multi-task Gaussian process prediction, in 20th Annual Conference on Neural Information Processing Systems (MIT Press, 2007), pp. 153–160

    Google Scholar 

  50. E.V. Bonilla, F.V. Agakov, C.K.I. Williams, Kernel Multi-task learning using task-specific features, in 11th International Conference on Artificial Intelligence and Statistics (2007), pp. 43–50

    Google Scholar 

  51. O. Chapelle, Z. Harchaoui, A machine learning approach to conjoint analysis, in International Conference on Neural Information Processing Systems (MIT Press, 2004), pp. 257–264

    Google Scholar 

  52. T. Evgeniou, M. Pontil, Regularized multi-task learning, in 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2004), pp. 109–117

    Google Scholar 

  53. B. Bakker, T. Heskes, Task clustering and gating for bayesian multitask learning. J. Mach. Learn. Res. 4, 83–99 (2003)

    MATH  Google Scholar 

  54. J. Gao, W. Fan, J. Jiang, J. Han, Knowledge transfer via multiple model local structure map**, in 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2008), pp. 283–291

    Google Scholar 

  55. S. Ben-David, J. Blitzer, K. Crammer, F. Pereira, Analysis of representations for domain adaptation, in 19th International Conference on Neural Information Processing Systems (2006), pp. 137–144

    Google Scholar 

  56. L. Mihalkova, T. Huynh, R.J. Mooney, Map** and revising Markov logic networks for transfer learning, in 22nd AAAI Conference on Artificial Intelligence (2007), pp. 608–614

    Google Scholar 

  57. O. Kwon, K. Chan, J. Hao, T. Lee, Emotion recognition by speech signals, in 8th European Conference on Speech Communication and Technology (2003), pp. 125–128

    Google Scholar 

  58. L. Mihalkova, R.J. Mooney, Transfer learning by map** with minimal target data, in AAAI-08 Workshop on Transfer Learning for Complex Tasks (2008), pp. 1–6

    Google Scholar 

  59. M.E. Taylor, G. Kuhlmann, P. Stone, Autonomous transfer for reinforcement learning, in The Autonomous Agents and Multi-Agent Systems Conference (2008), pp. 283–290

    Google Scholar 

  60. J. Davis, P. Domingos, Deep tansfer via second-order Markov logic, in 26th ACM Annual Conference on Machine Learning (2009), pp. 217–224

    Google Scholar 

  61. W. Bridewell, L. Todorovski, Learning declarative bias, in International Conference on Inductive Logic Programming. LNCS Springer, vol. 4894 (2007), pp. 63–77

    Google Scholar 

  62. M. Lugger, B. Yang, The relevance of voice quality features in speaker independent emotion recognition, in IEEE International Conference on Acoustics, Speech and Signal Processing (2007), pp. IV17–IV20

    Google Scholar 

  63. S. Wu, T.H. Falk, W.Y. Chan, Automatic recognition of speech emotion using long-term spectro-temporal features, in 16th International Conference on Digital Signal Processing (2009), pp. 1–6

    Google Scholar 

  64. U. Ruckert, S. Kramer, Kernel-based Inductive Transfer, in European Conference on Machine Learning and Knowledge Discovery in Databases. LNCS Springer (2008), pp. 220–233

    Google Scholar 

  65. S. Kaski, J. Peltonen, Learning from relevant tasks only, in European Conference on Machine Learning. LNCS Springer, vol. 4701 (2007), pp. 608–615

    Google Scholar 

  66. H. Lee, A. Battle, R. Raina, A.Y. Ng, Efficient sparse coding algorithms, in 19th Annual Conference on Neural Information Processing Systems (MIT Press, 2007), pp. 801–808

    Google Scholar 

  67. S. Perkins, J. Theiler, Online feature selection using grafting, in 12th International Conference on Machine Learning (2003), pp. 592–599

    Google Scholar 

  68. S.J. Pan, Q. Yang, A survey on transfer learning. IEEE Trans. Knowl. Data Eng. 22(10), 1345–1359 (2009)

    Article  Google Scholar 

  69. S.G. Koolagudi, K.S. Rao, Emotion recognition from speech: a review. Int. J. Speech Technol. Springer 15(2), 99–117 (2012)

    Article  Google Scholar 

  70. N. Cummins, S. Amiriparian, G. Hagerer, A. Batliner, S. Steidl, B.W. Schuller, An Image-based deep spectrum feature representation for the recognition of emotional speech, in ACM Conference on Multimedia (2017), pp. 478–484

    Google Scholar 

  71. W. Lim, D. Jang, T. Lee, Speech emotion recognition using convolutional and recurrent neural networks, in IEEE Asia-Pacific Signal and Information Processing Association Annual Summit and Conference

    Google Scholar 

  72. B.W. Schuller, S. Steidl, A. Batliner, The interspeech 2009 emotion challenge, in Interspeech (2009), pp. 1–4

    Google Scholar 

  73. F. Yu, E. Chang, Y.Q. Xu, H.Y. Shum, Emotion detection from speech to enrich multimedia content, in Advances in Multimedia Information Processing-Pacific-Rim Conference on Multimedia. LNCS Springer, vol. 2195 (2001), pp. 550–557

    Google Scholar 

  74. Y. Zhou, Y Sun, J. Zhang, Y. Yan, Speech emotion recognition using both spectral and prosodic features, in IEEE International Conference on Information Engineering and Computer Science (2009), pp. 1–4

    Google Scholar 

  75. A. Zhu, Q. Luo, study on speech emotion recognition system in E-learning, in International Conference on Human Computer Interaction, HCI Intelligent Multimodal Environments. Springer LNCS, vol. 4552 (2007), pp. 544–552

    Google Scholar 

  76. W.Q. Zheng, J.S. Yu, Y.X. Zou, An experimental study of speech emotion recognition based on deep convolutional neural networks, in IEEE International Conference on Affective Computing and Intelligent Interaction (2015), pp. 827–831

    Google Scholar 

  77. F. Pokorny, F. Graf, F. Pernkopf, B. Schuller, Detection of negative emotions in speech signals using bags-of-audio-words, in IEEE International Conference on Affective Computing and Intelligent Interaction (2015), pp. 879–884

    Google Scholar 

  78. L. Devillers, L. Vidrascu, Real-life emotion recognition in speech, in Speaker Classification II. LNCS Springer, vol. 4441 (2007), pp. 34–42

    Google Scholar 

  79. Z. Huang, M. Dong, Q. Mao, Y. Zhan, Speech emotion recognition using CNN, in 22nd ACM International Conference on Multimedia (2014) pp. 801–804

    Google Scholar 

  80. B. Schuller, A. Batliner, S. Steidl, D. Seppi, Recognising realistic emotions and affect in speech: state of the art and lessons learnt from the first challenge. Speech Commun. 53(9–10), 1062–1087 (2011)

    Article  Google Scholar 

  81. B. Schuller, D. Arsic, F. Wallhoff, G. Rigoll, Emotion recognition in the noise applying large acoustic feature sets, in Proceedings Speech Prosody (2006), pp. 276–289

    Google Scholar 

  82. A. Alvarez, I. Cearreta, J.M. Lopez, A. Arruti, E. Lazkano, B. Sierra, N. Garay, Feature subset selection based on evolutionary algorithms for automatic emotion recognition in spoken spanish and standard basque language, in International Conference on Text, Speech and Dialogue. LNCS Springer, vol. 4188 (2006), pp. 565–572

    Google Scholar 

  83. K. Han, D. Yu, I. Tashev, Speech emotion recognition using deep neural network and extreme learning machine, in Interspeech (2014), pp. 223–227

    Google Scholar 

  84. X. Zhou, J. Guo, R. Bie, Deep learning based affective model for speech emotion recognition, in IEEE Conference on Ubiquitous Intelligence and Computing, Advanced and Trusted Computing, Scalable Computing and Communications, Cloud and Big Data Computing, Internet of People and Smart World Congress (2016), pp. 841–846

    Google Scholar 

  85. C. Busso, M. Bulut, S. Narayanan, Toward effective automatic recognition systems of emotion in speech, in Social Emotions in Nature and Artifact: Emotions in Human and Human-Computer Interaction (2013), pp. 110–127

    Google Scholar 

  86. M. Tahon, L. Devillers, Towards a small set of robust acoustic features for emotion recognition: challenges. IEEE/ACM Trans. Audio, Speech, Lang. Process. 24(1), 16–28 (2016)

    Article  Google Scholar 

  87. A. Mencattini, E. Martinelli, F. Ringeval, B. Schuller, C.D. Natale, Continuous estimation of emotions in speech by dynamic cooperative speaker models. IEEE Trans. Affect. Comput. 8(3), 314–327 (2017)

    Article  Google Scholar 

  88. S. Zhang, T. Huang, W. Gao, Speech emotion recognition using deep convolutional neural network and discriminant temporal pyramid matching. IEEE Trans. Multimed. 20(6), 1576–1590 (2018)

    Article  Google Scholar 

  89. M. Chen, X. He, J. Yang, H. Zhang, IEEE Signal Process. Lett. 25(10), 1440–1444 (2018)

    Article  Google Scholar 

  90. M. Egger, M. Ley, S. Hanke, Emotion recognition from physiological signal analysis: a review. Electron. Notes Theor. Comput. Sci. 343, 35–55 (2019)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Abhinand Poosarala .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Poosarala, A., Jayashree, R. (2021). Survey of Transfer Learning and a Case Study of Emotion Recognition Using Inductive Approach. In: Sharma, N., Chakrabarti, A., Balas, V.E., Martinovic, J. (eds) Data Management, Analytics and Innovation. Advances in Intelligent Systems and Computing, vol 1175. Springer, Singapore. https://doi.org/10.1007/978-981-15-5619-7_9

Download citation

Publish with us

Policies and ethics

Navigation