Search Results - Springer

Chapter and Conference Paper

Abstracts Embeddings Evaluation: A Case Study of Artificial Intelligence and Medical Imaging for the COVID-19 Infection

During the COVID-19 pandemic, a huge amount of literature was produced covering different aspects of infection. The use of artificial intelligence (AI) in medical imaging has been shown to improve screening, d...

Giovanni Zurlo… in Image Analysis and Processing - ICIAP 2023 Workshops (2024)

Chapter and Conference Paper

Video-Based Emotion Estimation Using Deep Neural Networks: A Comparative Study

In this study we investigate the effectiveness of deep neural networks in predicting valence and arousal solely from visual information of video sequences. Several recent Convolutional Neural Network (CNN) and...

Leonardo Alchieri, Luigi Celona… in Image Analysis and Processing - ICIAP 2023… (2024)

Chapter and Conference Paper

On-Device Learning with Binary Neural Networks

Existing Continual Learning (CL) solutions only partially address the constraints on power, memory and computation of the deep learning models when deployed on low-power embedded CPUs. In this paper, we propos...

Lorenzo Vorabbi, Davide Maltoni… in Image Analysis and Processing - ICIAP 2023… (2024)

Chapter and Conference Paper

Multi-level Patch Transformer for Style Transfer with Single Reference Image

Despite the recent success of image style transfer with Generative Adversarial Networks (GANs), this task remains challenging due to the requirements of large volumes of style image data. In this work, we pres...

Yue He, Lan Chen, Yu-Jie Yuan, Shu-Yu Chen, Lin Gao in Computational Visual Media (2024)

Chapter and Conference Paper

Correlation Analysis Between Insomnia Severity and Depressive Symptoms of College Students Based on Pseudo-Siamese Network

To explore the correlation between emotional mood and sleep quality in a college student population, we propose a new method based on pseudo-siamese network, which can quickly diagnose the causes of depression...

Ya-fei Wang, Yan-ling Zhu, Peng Wu, Meng Liu… in Advanced Computational Intelligence and In… (2024)

Chapter and Conference Paper

Ookpik- A Collection of Out-of-Context Image-Caption Pairs

The development of AI-based Cheapfakes detection models has been hindered by a significant challenge - the scarcity of real-world datasets. Our work directly tackles this issue by focusing on out-of-context (O...

Kha-Luan Pham, Minh-Khoi Nguyen-Nhat, Anh-Huy Dinh, Quang-Tri Le… in MultiMedia Modeling (2024)

Chapter and Conference Paper

Streaming Graph-Based Supervoxel Computation Based on Dynamic Iterative Spanning Forest

Streaming video segmentation decreases processing time by creating supervoxels taking into account small parts of the video instead of using all video content. Thanks to the good performance of the Iterative S...

Danielle Vieira, Isabela Borlido Barcelos… in Progress in Pattern Recognition, Image Ana… (2024)

Chapter and Conference Paper

Improving Small License Plate Detection with Bidirectional Vehicle-Plate Relation

License plate detection is a critical component of license plate recognition systems. A challenge in this domain is detecting small license plates captured at a considerable distance. Previous researchers have...

Songkang Dai, Song-Lu Chen, Qi Liu, Chao Zhu, Yan Liu, Feng Chen… in MultiMedia Modeling (2024)

Chapter and Conference Paper

Lightweight Image Captioning Model Based on Knowledge Distillation

The performance of image captioning models based on deep learning has been significantly improved compared with traditional algorithms. However, due to the complex network structure and huge parameters, these ...

Zhenlei Cui, Zhenhua Tang, Jianze Li, Kai Chen in MultiMedia Modeling (2024)

Chapter and Conference Paper

Deformable CNN with Position Encoding for Arbitrary-Scale Super-Resolution

Implicit neural representation (INR) has been widely used to learn continuous representation of images, as it enables arbitrary-scale super-resolution (SR). However, most existing INR-based arbitrary-scale SR ...

Yuanbin Ding, Kehan Zhu, ** Wei, Yu Lin, Ruxin Wang in Computational Visual Media (2024)

Chapter and Conference Paper

Analysis and Impact of Training Set Size in Cross-Subject Human Activity Recognition

The ubiquity of consumer devices with sensing and computational capabilities, such as smartphones and smartwatches, has increased interest in their use in human activity recognition for healthcare monitoring a...

Miguel Matey-Sanz, Joaquín Torres-Sospedra… in Progress in Pattern Recognition, Image Ana… (2024)

Chapter and Conference Paper

Self-supervised Monocular Depth Estimation on Unseen Synthetic Cameras

Monocular depth estimation is a critical task in computer vision, and self-supervised deep learning methods have achieved remarkable results in recent years. However, these models often struggle on camera gene...

Cecilia Diana-Albelda… in Progress in Pattern Recognition, Image Ana… (2024)

Chapter and Conference Paper

A Detail-Guided Multi-source Fusion Network for Remote Sensing Object Detection

Optical and synthetic aperture radar (SAR) remote sensing have established themselves as valuable tools for object detection. Optical images exhibit weather-dependence but offer intricate information, whereas ...

**aoting Li, Shouhong Wan, Hantao Zhang, Peiquan ** in MultiMedia Modeling (2024)

Chapter and Conference Paper

Multi-task Collaborative Network for Image-Text Retrieval

Image-text retrieval aims to capture semantic relevance between images and texts. Most existing approaches rely solely on the image-text pairs to learn visual-semantic representation through fine-grained align...

Xueyang Qin, Lishuang Li, **g Hao, Meiling Ge, Jiayi Huang… in MultiMedia Modeling (2024)

Chapter and Conference Paper

Semantic Importance-Based Deep Image Compression Using a Generative Approach

Semantic image compression can greatly reduce the amount of transmitted data by representing and reconstructing images using semantic information. Considering the fact that objects in an image are not equally ...

** Gu, Yuanyuan Xu, Kun Zhu in MultiMedia Modeling (2024)

Chapter and Conference Paper

MoPE: Mixture of Pooling Experts Framework for Image-Text Retrieval

Image-text retrieval is a fundamental and crucial task in the field of multimodal interaction, which assists internet users in retrieving the required visual and textual information conveniently. The dominant ...

Jiangfeng Li, Bowen Wang, Yongrui Qin, Chenxi Zhang, Gang Yu… in MultiMedia Modeling (2024)

Chapter and Conference Paper

A Systematic Review of Multimodal Deep Learning Approaches for COVID-19 Diagnosis

During and after the years of the COVID-19 pandemic, researchers and domain experts put all their effort into the discovery of accurate and reliable techniques for the detection and diagnosis of this disease i...

Salvatore Capuozzo, Carlo Sansone in Image Analysis and Processing - ICIAP 2023 Workshops (2024)

Chapter and Conference Paper

Self-supervised Edge Structure Learning for Multi-view Stereo and Parallel Optimization

Recent studies have witnessed that many self-supervised methods obtain clear progress on the multi-view stereo (MVS). However, existing methods ignore the edge structure information of the reconstructed target...

Pan Li, Su** Wu, **tie Zhang, Yuxin Peng, Boyang Zhang, Bin Wang in MultiMedia Modeling (2024)

Chapter and Conference Paper

Lightweight Rolling Shutter Image Restoration Network Based on Undistorted Flow

Rolling shutter(RS) cameras are widely used in fields such as drone photography and robot navigation. However, when shooting a fast-moving target, the captured image may be distorted and blurred due to the fea...

Binfeng Wang, Yunhao Zou, Zhijie Gao, Ying Fu in Artificial Intelligence (2024)

Chapter and Conference Paper

Mitigating Fine-Grained Hallucination by Fine-Tuning Large Vision-Language Models with Caption Rewrites

Large language models (LLMs) have shown remarkable performance in natural language processing (NLP) tasks. To comprehend and execute diverse human instructions over image data, instruction-tuned large vision-l...

Lei Wang, Jiabang He, Shenshen Li, Ning Liu, Ee-Peng Lim in MultiMedia Modeling (2024)

63,265 Result(s)

Abstracts Embeddings Evaluation: A Case Study of Artificial Intelligence and Medical Imaging for the COVID-19 Infection

Video-Based Emotion Estimation Using Deep Neural Networks: A Comparative Study

On-Device Learning with Binary Neural Networks

Multi-level Patch Transformer for Style Transfer with Single Reference Image

Correlation Analysis Between Insomnia Severity and Depressive Symptoms of College Students Based on Pseudo-Siamese Network

Ookpik- A Collection of Out-of-Context Image-Caption Pairs

Streaming Graph-Based Supervoxel Computation Based on Dynamic Iterative Spanning Forest

Improving Small License Plate Detection with Bidirectional Vehicle-Plate Relation

Lightweight Image Captioning Model Based on Knowledge Distillation

Deformable CNN with Position Encoding for Arbitrary-Scale Super-Resolution

Analysis and Impact of Training Set Size in Cross-Subject Human Activity Recognition

Self-supervised Monocular Depth Estimation on Unseen Synthetic Cameras

A Detail-Guided Multi-source Fusion Network for Remote Sensing Object Detection

Multi-task Collaborative Network for Image-Text Retrieval

Semantic Importance-Based Deep Image Compression Using a Generative Approach

MoPE: Mixture of Pooling Experts Framework for Image-Text Retrieval

A Systematic Review of Multimodal Deep Learning Approaches for COVID-19 Diagnosis

Self-supervised Edge Structure Learning for Multi-view Stereo and Parallel Optimization

Lightweight Rolling Shutter Image Restoration Network Based on Undistorted Flow

Mitigating Fine-Grained Hallucination by Fine-Tuning Large Vision-Language Models with Caption Rewrites

Our Content

Other Sites

Help & Contacts