Search Results - Springer

Chapter and Conference Paper

Multi-level Patch Transformer for Style Transfer with Single Reference Image

Despite the recent success of image style transfer with Generative Adversarial Networks (GANs), this task remains challenging due to the requirements of large volumes of style image data. In this work, we pres...

Yue He, Lan Chen, Yu-Jie Yuan, Shu-Yu Chen, Lin Gao in Computational Visual Media (2024)

Chapter and Conference Paper

Improving Small License Plate Detection with Bidirectional Vehicle-Plate Relation

License plate detection is a critical component of license plate recognition systems. A challenge in this domain is detecting small license plates captured at a considerable distance. Previous researchers have...

Songkang Dai, Song-Lu Chen, Qi Liu, Chao Zhu, Yan Liu, Feng Chen… in MultiMedia Modeling (2024)

Chapter and Conference Paper

Adversarially Robust Deepfake Detection via Adversarial Feature Similarity Learning

Deepfake technology has raised concerns about the authenticity of digital content, necessitating the development of effective detection methods. However, the widespread availability of deepfakes has given rise...

Sarwar Khan, Jun-Cheng Chen, Wen-Hung Liao, Chu-Song Chen in MultiMedia Modeling (2024)

Chapter and Conference Paper

Gait Recognition Based on Temporal Gait Information Enhancing

Gait recognition is a long range biometric technology that identifies individuals by their walking patterns. Currently, gait recognition primarily extracts gait features using convolutional neural networks, wh...

Qizhen Chen, **n Chen, **aoling Deng, Yubin Lan in MultiMedia Modeling (2024)

Chapter and Conference Paper

MARS: An Instance-Aware, Modular and Realistic Simulator for Autonomous Driving

Nowadays, autonomous cars can drive smoothly in ordinary cases, and it is widely recognized that realistic sensor simulation will play a critical role in solving remaining corner cases by simulating them. To t...

Zirui Wu, Tianyu Liu, Liyi Luo, Zhide Zhong, Jianteng Chen… in Artificial Intelligence (2024)

Chapter and Conference Paper

A Purified Stacking Ensemble Framework for Cytology Classification

Cancer is one of the fatal threats to human beings. However, early detection and diagnosis can significantly reduce death risk, in which cytology classification is indispensable. Researchers have proposed many...

Linyi Qian, Qian Huang, Yulin Chen, Junzhou Chen in MultiMedia Modeling (2024)

Chapter and Conference Paper

Irregular License Plate Recognition via Global Information Integration

Irregular license plate recognition remains challenging due to the irregular layouts of characters, such as multi-line and perspective-distorted layouts. Many previous methods are based on different attention ...

Yuan-Yuan Liu, Qi Liu, Song-Lu Chen, Feng Chen, Xu-Cheng Yin in MultiMedia Modeling (2024)

Chapter and Conference Paper

TNT-Net: Point Cloud Completion by Transformer in Transformer

Estimating the overall structure of a point cloud from a partial 3D point cloud input is a crucial task in computer vision. However, existing point cloud completion methods often overlook object detail informa...

**aohai Zhang, **ming Zhang, Jianliang Li, Ming Chen in MultiMedia Modeling (2024)

Chapter and Conference Paper

Weakly Supervised Optical Remote Sensing Salient Object Detection Based on Adaptive Discriminative Region Suppression

Salient object detection in optical remote sensing images aims to detect attractive objects from optical remote sensing images, providing important prior information for many remote sensing tasks, which have r...

**ngyu Li, Jieyu Wu, Yuan Zhou, **gwei Yuan, Yanwen Chen in Artificial Intelligence (2024)

Chapter and Conference Paper

Fast Hierarchical Depth Super-Resolution via Guided Attention

Depth maps captured by mainstream depth sensors are still of low resolution compared with color images. The main difficulties in depth super-resolution lie in the recovery of tiny textures from severely unders...

Yusen Hou, Changyi Chen, Gaosheng Liu, Huan**g Yue, Kun Li… in Artificial Intelligence (2024)

Chapter and Conference Paper

C3-PO: A Convolutional Neural Network for COVID Onset Prediction from Cough Sounds

This study presents a novel approach to diagnosing the highly contagious COVID-19 respiratory disease. Traditional diagnosis methods, such as polymerase chain reaction (PCR) and rapid antigen test (RAT), have ...

**angyu Chen, Md Ayshik Rahman Khan, Md Rakibul Hasan, Tom Gedeon… in MultiMedia Modeling (2024)

Chapter and Conference Paper

Weakly-Supervised Grounding for VQA with Dual Visual-Linguistic Interaction

Visual question answer (VQA) grounding, aimed at locating the visual evidence associated with the answers while answering questions, has attracted increasing research interest. To locate the evidence, most exi...

Yi Liu, Junwen Pan, Qilong Wang, Guanlin Chen, Weiguo Nie… in Artificial Intelligence (2024)

Chapter and Conference Paper

Semantic Transition Detection for Self-supervised Video Scene Segmentation

Video scene segmentation is a crucial task in temporally parsing long-form videos into basic story units. Most advanced self-supervised methods of video scene segmentation focus heavily on learning video shot ...

Lu Chen, Jiawei Tan, **an Yang, Hongxing Wang in MultiMedia Modeling (2024)

Chapter and Conference Paper

Equivariant Indoor Illumination Map Estimation from a Single Image

Thanks to the recent development of inverse rendering, photorealistic re-synthesis of indoor scenes have brought augmented reality closer to reality. All-angle environment illumination map estimation of arbitr...

Yusen Ai, **aoxue Chen, **n Wu, Hao Zhao in Artificial Intelligence (2024)

Chapter and Conference Paper

Global-to-Local Feature Mining Network for RGB-Infrared Person Re-Identification

RGB-Infrared person Re-Identification (RGB-IR ReID) is a challenging matching task that retrieves a RGB/infrared pedestrian image from the existing infrared/RGB set captured by non-overlap** visible or infra...

Qiang Chen, Fuxiao He, Guoqiang **ao in MultiMedia Modeling (2024)

Chapter and Conference Paper

Pseudo-label Based Unsupervised Momentum Representation Learning for Multi-domain Image Retrieval

Although many current cross-domain image retrieval researches have made good progress, most of the works is targeted at specific domains. At the same time, we also noticed that many works are based on manually...

Mingyuan Ge, Jianan Shui, Junyu Chen, Mingyong Li in MultiMedia Modeling (2024)

Chapter and Conference Paper

STU3: Multi-organ CT Medical Image Segmentation Model Based on Transformer and UNet

With the popularity of artificial intelligence applications in the medical field, U-shaped convolutional neural network (CNN) has garnered significant attention for their efficacy in medical image analysis tas...

Wen** Zheng, Bo Li, Wanyi Chen in Artificial Intelligence (2024)

Chapter and Conference Paper

Lightweight Image Captioning Model Based on Knowledge Distillation

The performance of image captioning models based on deep learning has been significantly improved compared with traditional algorithms. However, due to the complex network structure and huge parameters, these ...

Zhenlei Cui, Zhenhua Tang, Jianze Li, Kai Chen in MultiMedia Modeling (2024)

Chapter and Conference Paper

AST: An Attention-Guided Segment Transformer for Drone-Based Cross-View Geo-Localization

To tackle the problem of drone-based cross-view geo-localization, we address how to match drone-view images and satellite-view images, which is extremely challenging due to the variability of view angles and v...

Zichuan Zhao, Tianhang Tang, Jie Chen, Xuelei Shi… in Computational Visual Media (2024)

Chapter and Conference Paper

CESegNet:Context-Enhancement Semantic Segmentation Network Based on Transformer

CNN-based methods have achieved success in semantic segmentation. However, research on improving network robustness in this domain has been limited. Similarly, transformer and its variants have recently shown ...

Xu Chen, Zhibin Zhang in MultiMedia Modeling (2024)

2,915 Result(s)

Multi-level Patch Transformer for Style Transfer with Single Reference Image

Improving Small License Plate Detection with Bidirectional Vehicle-Plate Relation

Adversarially Robust Deepfake Detection via Adversarial Feature Similarity Learning

Gait Recognition Based on Temporal Gait Information Enhancing

MARS: An Instance-Aware, Modular and Realistic Simulator for Autonomous Driving

A Purified Stacking Ensemble Framework for Cytology Classification

Irregular License Plate Recognition via Global Information Integration

TNT-Net: Point Cloud Completion by Transformer in Transformer

Weakly Supervised Optical Remote Sensing Salient Object Detection Based on Adaptive Discriminative Region Suppression

Fast Hierarchical Depth Super-Resolution via Guided Attention

C3-PO: A Convolutional Neural Network for COVID Onset Prediction from Cough Sounds

Weakly-Supervised Grounding for VQA with Dual Visual-Linguistic Interaction

Semantic Transition Detection for Self-supervised Video Scene Segmentation

Equivariant Indoor Illumination Map Estimation from a Single Image

Global-to-Local Feature Mining Network for RGB-Infrared Person Re-Identification

Pseudo-label Based Unsupervised Momentum Representation Learning for Multi-domain Image Retrieval

STU3: Multi-organ CT Medical Image Segmentation Model Based on Transformer and UNet

Lightweight Image Captioning Model Based on Knowledge Distillation

AST: An Attention-Guided Segment Transformer for Drone-Based Cross-View Geo-Localization

CESegNet:Context-Enhancement Semantic Segmentation Network Based on Transformer

Our Content

Other Sites

Help & Contacts