Search Page | SpringerLink

Hierarchical multiples self-attention mechanism for multi-modal analysis

Because of the massive multimedia in daily life, people perceive the world by concurrently processing and fusing multi-modalities with...

Wu Jun, Zhu Tianliang, ... Wang Chunzhi in Multimedia Systems

Article 22 July 2023

Multi-modal fusion learning through biosignal, audio, and visual content for detection of mental stress

Mental stress is a significant risk factor for several maladies and can negatively impact a person’s quality of life, including their work and...

Gulin Dogan, Fatma Patlar Akbulut in Neural Computing and Applications

Article 03 October 2023

TMIF: transformer-based multi-modal interactive fusion for automatic rumor detection

The rapid development of social media platforms has made them one of the most important news sources. While it provides people with convenient...

Jiandong Lv, **ngang Wang, Cuiling Shao in Multimedia Systems

Article 31 March 2022

Video Rumor Classification Based on Multi-modal Theme and Keyframe Fusion

In recent years, short video platforms have become the main source of online rumors. According to the statistics of Shanghai online rumor refutation...

**peng You, Yanghao Lin, ... Donglin Cao in Computer Supported Cooperative Work and Social Computing

Conference paper 2023

Multi-Modal Generative DeepFake Detection via Visual-Language Pretraining with Gate Fusion for Cognitive Computation

With the widespread adoption of deep learning, there has been a notable increase in the prevalence of multimodal deepfake content. These deepfakes...

Guisheng Zhang, Mingliang Gao, ... Gwanggil Jeon in Cognitive Computation

Article 25 June 2024

Dictionary-Induced Manifold Learning for Incomplete Multi-modal Fusion

Data missing is a common problem in multi-modal fusion, and existing incomplete multi-modal methods usually only consider the case of two modalities...

Bingliang Xu, Haizhou Ye, ... Qi Zhu in Web and Big Data

Conference paper 2023

Label graph learning for multi-label image recognition with cross-modal fusion

It has become popular to learn the correlation between labels in most existing multi-label image recognition tasks. Existing approaches begin to...

Yanzhao **e, Yangtao Wang, ... Ke Zhou in Multimedia Tools and Applications

Article 23 March 2022

Multi-grained encoding and joint embedding space fusion for video and text cross-modal retrieval

Video-text cross-modal retrieval is significant to computer vision. Most of existing works focus on exploring the global similarity between...

**aotao Cui, **g **ao, ... Jia Zhu in Multimedia Tools and Applications

Article 30 May 2022

MutualFormer: Multi-modal Representation Learning via Cross-Diffusion Attention

Aggregating multi-modal data to obtain reliable data representation attracts more and more attention. Recent studies demonstrate that Transformer...

**xi Wang, **ao Wang, ... Bin Luo in International Journal of Computer Vision

Article 24 April 2024

EnCoSum: enhanced semantic features for multi-scale multi-modal source code summarization

Code summarization aims to generate concise natural language descriptions for a piece of code, which can help developers comprehend the source code....

Yuexiu Gao, Hongyu Zhang, Chen Lyu in Empirical Software Engineering

Article 19 September 2023

MLF-DET: Multi-Level Fusion for Cross-Modal 3D Object Detection

In this paper, we propose a novel and effective Multi-Level Fusion network, named as MLF-DET, for high-performance cross-modal 3D object DETection,...

Zewei Lin, Yanqing Shen, ... Nanning Zheng in Artificial Neural Networks and Machine Learning – ICANN 2023

Conference paper 2023

3D reconstruction-oriented fully automatic multi-modal tumor segmentation by dual attention-guided VNet

Existing automatic contouring methods for primary nasopharyngeal carcinoma (NPC) and metastatic lymph nodes (MLNs) may suffer from low segmentation...

Dongdong Meng, Sheng Li, ... Xueqing Yan in The Visual Computer

Article 13 July 2023

Deep adversarial multi-label cross-modal hashing algorithm

In recent years, more and more researchers employ the hashing algorithm to improve the large-scale cross-modal retrieval efficiency by map** the...

**aohan Yang, Zhen Wang, ... Nannan Wu in International Journal of Multimedia Information Retrieval

Article 29 July 2023

CMC-MMR: multi-modal recommendation model with cross-modal correction

Multi-modal recommendation using multi-modal features (e.g., image and text features) has received significant attention and has been shown to have...

YuBin Wang, HongBin **a, Yuan Liu in Journal of Intelligent Information Systems

Article 20 February 2024

Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection

Multi-modal 3D object detection has been an active research topic in autonomous driving. Nevertheless, it is non-trivial to explore the cross-modal...

**n Li, Botian Shi, ... Liang He in Computer Vision – ECCV 2022

Conference paper 2022

Gated Multi-modal Fusion with Cross-modal Contrastive Learning for Video Question Answering

Video Question Answering (VideoQA) is a challenging task that requires the model to understand the complex nature of video data and the variety of...

Chenyang Lyu, Wenxi Li, ... Cathal Gurrin in Artificial Neural Networks and Machine Learning – ICANN 2023

Conference paper 2023

The effectiveness of children’s English enlightenment network teaching based on multi-modal teaching model

To enhance the efficacy of traditional English enlightenment education for children, this research delves into a multi-modal teaching approach and...

Lan Zhang in Service Oriented Computing and Applications

Article 16 May 2024

Multi-Modal 3D Object Detection in Autonomous Driving: A Survey

The past decade has witnessed the rapid development of autonomous driving systems. However, it remains a daunting task to achieve full autonomy,...

Yingjie Wang, Qiuyu Mao, ... Yanyong Zhang in International Journal of Computer Vision

Article 17 May 2023

Multi-modal Depression Estimation Based on Sub-attentional Fusion

Failure to timely diagnose and effectively treat depression leads to over 280 million people suffering from this psychological disorder worldwide....

**-Cheng Wei, Kunyu Peng, ... Rainer Stiefelhagen in Computer Vision – ECCV 2022 Workshops

Conference paper 2023

Multi-modal medical image fusion based on densely-connected high-resolution CNN and hybrid transformer

Multi-modal medical image fusion (MMIF) has found wide application in the field of disease diagnosis and surgical guidance. Despite the popularity of...

Quan Zhou, Shaozhuang Ye, ... Xuming Zhang in Neural Computing and Applications

Article 29 July 2022

Search

Filters

Search Results

Search

Navigation