Two-Aspect Information Interaction Model for ABAW4 Multi-task Challenge

Sun, Haiyang; Lian, Zheng; Liu, Bin; Tao, Jianhua; Sun, Licai; Cai, Cong; He, Yu

doi:10.1007/978-3-031-25075-0_13

Haiyang Sun^10,11,
Zheng Lian¹⁰,
Bin Liu¹⁰,
Jianhua Tao^10,11,12,
Licai Sun^10,11,
Cong Cai^10,11 &
…
Yu He^10,11

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13806))

Included in the following conference series:

European Conference on Computer Vision

1417 Accesses
2 Citations

Abstract

The task of ABAW is to predict frame-level emotion descriptors from videos: discrete emotional state; valence and arousal; and action units. In this paper, we propose the solution to the Multi-Task Learning (MTL) Challenge of the 4th Affective Behavior Analysis in-the-wild (ABAW) competition. Although researchers have proposed several approaches and achieved promising results in ABAW, current works in this task rarely consider interactions between different emotion descriptors. To this end, we propose a novel end to end architecture to achieve full integration of different types of information. Experimental results demonstrate the effectiveness of our proposed solution. Code are available at https://github.com/Swiftsss/ECCV2022MTL.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Multimodal Dimensional and Continuous Emotion Recognition in Dyadic Video Interactions

Hybrid multi-modal emotion recognition framework based on InceptionV3DenseNet

Article 27 March 2023

HEU Emotion: a large-scale database for multimodal emotion recognition in the wild

Article 04 January 2021

References

Deng, D.: Multiple emotion descriptors estimation at the ABAW3 challenge. CoRR abs/2203.12845 (2022). https://doi.org/10.48550/ar**v.2203.12845
Ekman, P., Friesen, W.V.: The repertoire of nonverbal behavior: categories, origins, usage, and coding. Semiotica 1(1), 49–98 (1969)
Article Google Scholar
Jacob, G.M., Stenger, B.: Facial action unit detection with transformers. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 7680–7689 (2021)
Google Scholar
Kollias, D.: ABAW: learning from synthetic data & multi-task learning challenges. ar**v preprint ar**v:2207.01138 (2022)
Kollias, D.: ABAW: valence-arousal estimation, expression recognition, action unit detection & multi-task learning challenges. CoRR abs/2202.10659 (2022). https://arxiv.org/abs/2202.10659
Kollias, D., Cheng, S., Pantic, M., Zafeiriou, S.: Photorealistic facial synthesis in the dimensional affect space. In: Leal-Taixé, L., Roth, S. (eds.) ECCV 2018. LNCS, vol. 11130, pp. 475–491. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-11012-3_36
Chapter Google Scholar
Kollias, D., Cheng, S., Ververas, E., Kotsia, I., Zafeiriou, S.: Deep neural network augmentation: generating faces for affect analysis. Int. J. Comput. Vis. 128(5), 1455–1484 (2020). https://doi.org/10.1007/s11263-020-01304-3
Article Google Scholar
Kollias, D., Nicolaou, M.A., Kotsia, I., Zhao, G., Zafeiriou, S.: Recognition of affect in the wild using deep neural networks. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, CVPR Workshops 2017, Honolulu, HI, USA, 21–26 July 2017, pp. 1972–1979 (2017). https://doi.org/10.1109/CVPRW.2017.247
Kollias, D., Sharmanska, V., Zafeiriou, S.: Distribution matching for heterogeneous multi-task learning: a large-scale face study. CoRR abs/2105.03790 (2021). https://arxiv.org/abs/2105.03790
Kollias, D., et al.: Deep affect prediction in-the-wild: aff-wild database and challenge, deep architectures, and beyond. Int. J. Comput. Vis. 127(6–7), 907–929 (2019). https://doi.org/10.1007/s11263-019-01158-4
Article Google Scholar
Kollias, D., Zafeiriou, S.: Expression, affect, action unit recognition: aff-wild2, multi-task learning and arcface. In: 30th British Machine Vision Conference 2019, BMVC 2019, Cardiff, UK, 9–12 September 2019, p. 297 (2019). https://bmvc2019.org/wp-content/uploads/papers/0399-paper.pdf
Kollias, D., Zafeiriou, S.: VA-StarGAN: continuous affect generation. In: Blanc-Talon, J., Delmas, P., Philips, W., Popescu, D., Scheunders, P. (eds.) ACIVS 2020. LNCS, vol. 12002, pp. 227–238. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-40605-9_20
Chapter Google Scholar
Kollias, D., Zafeiriou, S.: Affect analysis in-the-wild: valence-arousal, expressions, action units and a unified framework. CoRR abs/2103.15792 (2021). https://arxiv.org/abs/2103.15792
Zafeiriou, S., Kollias, D., Nicolaou, M.A., Papaioannou, A., Zhao, G., Kotsia, I.: Aff-wild: valence and arousal ‘in-the-wild’ challenge. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, CVPR Workshops 2017, Honolulu, HI, USA, 21–26 July 2017, pp. 1980–1987 (2017). https://doi.org/10.1109/CVPRW.2017.248

Download references

Acknowledgments

This work is supported by the National Natural Science Foundation of China (NSFC) (No. 61831022, No. U21B2010, No. 61901473, No. 62101553), Open Research Projects of Zhejiang Lab (NO. 2021KH0AB06).

Author information

Authors and Affiliations

National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Bei**g, China
Haiyang Sun, Zheng Lian, Bin Liu, Jianhua Tao, Licai Sun, Cong Cai & Yu He
School of Artificial Intelligence, University of Chinese Academy of Sciences, Bei**g, China
Haiyang Sun, Jianhua Tao, Licai Sun, Cong Cai & Yu He
CAS Center for Excellence in Brain Science and Intelligence Technology, Bei**g, China
Jianhua Tao

Authors

Haiyang Sun
View author publications
You can also search for this author in PubMed Google Scholar
Zheng Lian
View author publications
You can also search for this author in PubMed Google Scholar
Bin Liu
View author publications
You can also search for this author in PubMed Google Scholar
Jianhua Tao
View author publications
You can also search for this author in PubMed Google Scholar
Licai Sun
View author publications
You can also search for this author in PubMed Google Scholar
Cong Cai
View author publications
You can also search for this author in PubMed Google Scholar
Yu He
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Haiyang Sun .

Editor information

Editors and Affiliations

IBM Research - MIT-IBM Watson AI Lab, Massachusetts, USA
Leonid Karlinsky
Technion – Israel Institute of Technology, Haifa, Israel
Tomer Michaeli
Kyoto University, Kyoto, Japan
Ko Nishino

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sun, H. et al. (2023). Two-Aspect Information Interaction Model for ABAW4 Multi-task Challenge. In: Karlinsky, L., Michaeli, T., Nishino, K. (eds) Computer Vision – ECCV 2022 Workshops. ECCV 2022. Lecture Notes in Computer Science, vol 13806. Springer, Cham. https://doi.org/10.1007/978-3-031-25075-0_13

Download citation

DOI: https://doi.org/10.1007/978-3-031-25075-0_13
Published: 19 February 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-25074-3
Online ISBN: 978-3-031-25075-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Two-Aspect Information Interaction Model for ABAW4 Multi-task Challenge

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Multimodal Dimensional and Continuous Emotion Recognition in Dyadic Video Interactions

Hybrid multi-modal emotion recognition framework based on InceptionV3DenseNet

HEU Emotion: a large-scale database for multimodal emotion recognition in the wild

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Two-Aspect Information Interaction Model for ABAW4 Multi-task Challenge

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Multimodal Dimensional and Continuous Emotion Recognition in Dyadic Video Interactions

Hybrid multi-modal emotion recognition framework based on InceptionV3DenseNet

HEU Emotion: a large-scale database for multimodal emotion recognition in the wild

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation