Search Page | SpringerLink

DVC-Net: a new dual-view context-aware network for emotion recognition in the wild

Emotion recognition in the wild (ERW) is a challenging task due to unknown and the unconstrained scenes in the wild environment. Different from...

Linbo Qing, Hongqian Wen, ... Yonghong Peng in Neural Computing and Applications

Article 04 October 2023

Deep feature voting: a semantic-driven and local context-aware approach for image classification

In the context of addressing new image classification tasks with insufficient training samples via pre-trained deep learning models, the methods...

Ye Xu, Lihua Duan, ... Chongpeng Huang in Multimedia Tools and Applications

Article 23 December 2023

Context-Aware Enhanced Virtual Try-On Network with fabric adaptive registration

Image-based virtual try-on technology provides a better shop** experience for online customers and holds immense commercial value. However,...

Shuo Tong, Han Liu, ... Ding Liu in The Visual Computer

Article 11 May 2024

Dual-Stream Context-Aware Neural Network for Survival Prediction from Whole Slide Images

Whole slide images (WSI) encompass a wealth of information about the tumor micro-environment, which holds prognostic value for patients’ survival....

Junxiu Gao, Shan **, ... Hongming Xu in Pattern Recognition and Computer Vision

Conference paper 2024

Evaluating Word Embedding Feature Extraction Techniques for Host-Based Intrusion Detection Systems

Research into Intrusion and Anomaly Detectors at the Host level typically pays much attention to extracting attributes from system call traces. These...

Paul K. Mvula, Paula Branco, ... Herna L. Viktor in Discover Data

Article Open access 30 March 2023

A Context Aware Lung Cancer Survival Prediction Network by Using Whole Slide Images

Lung cancer has caused enormous harm to human life and traditional whole slide image (WSI) based lung cancer survival prediction methods suffer from...

**nyu Liu, Yicheng Wang, Ye Luo in Neural Information Processing

Conference paper 2024

Stacked cross-modal feature consolidation attention networks for image captioning

The attention-enriched encoder-decoder framework has recently aroused great interest in image captioning due to its overwhelming progress. Many...

Mozhgan Pourkeshavarz, Shahabedin Nabavi, ... Mehrnoush Shamsfard in Multimedia Tools and Applications

Article 23 June 2023

A Dynamic Feature Interaction Framework for Multi-task Visual Perception

Multi-task visual perception has a wide range of applications in scene understanding such as autonomous driving. In this work, we devise an efficient...

Yuling **, Hao Chen, ... Yifan Liu in International Journal of Computer Vision

Article 09 July 2023

Visualizations for universal deep-feature representations: survey and taxonomy

In data science and content-based retrieval, we find many domain-specific techniques that employ a data processing pipeline with two fundamental...

Tomáš Skopal, Ladislav Peška, ... David Bernhauer in Knowledge and Information Systems

Article Open access 16 September 2023

ADOSMNet: a novel visual affordance detection network with object shape mask guided feature encoders

Visual affordance detection aims to understand the functional attributes of objects, which is crucial for robots to achieve interactive tasks. Most...

Dongpan Chen, Dehui Kong, ... Baocai Yin in Multimedia Tools and Applications

Article 18 September 2023

EFECL: Feature encoding enhancement with contrastive learning for indoor 3D object detection

Good proposal initials are critical for 3D object detection applications. However, due to the significant geometry variation of indoor scenes,...

Yao Duan, Renjiao Yi, ... Chenyang Zhu in Computational Visual Media

Article Open access 03 August 2023

Multimodal emotion recognition model via hybrid model with improved feature level fusion on facial and EEG feature set

In recent years, academics have placed a high value on multi-modal emotion identification, as well as extensive research has been conducted in the...

Pratima Singh, Mukesh Kumar Tripathi, ... Madugundu Neelakantappa in Multimedia Tools and Applications

Article 26 April 2024

Enhanced feature pyramid for multi-view stereo with adaptive correlation cost volume

Abstract

Multi-level features are commonly employed in the cascade network, which is currently the dominant framework in multi-view stereo (MVS)....

Ming Han, Hui Yin, ... Qianqian Du in Applied Intelligence

Article 15 June 2024

Deep neural networks for explainable feature extraction in orchid identification

Automated image-based plant identification systems are black-boxes, failing to provide an explanation of a classification. Such explanations are seen...

Diah Harnoni Apriyanti, Luuk J. Spreeuwers, Peter J.F. Lucas in Applied Intelligence

Article Open access 21 August 2023

Customizing the feature modulation for visual tracking

In visual tracking, the target always undergoes appearance variations due to a variety of challenging situations, such as deformation and rotation....

Yu** Zhang, Zepeng Yang, ... Fusheng ** in The Visual Computer

Article 29 December 2023

Training a Multi-task Model for Classification and Grasp Detection of Surgical Tools Using Transfer Learning

This paper proposes a multi-task model for the classification and grasp detection of surgical tools so that the tasks such as handing, collection...

Vijay Bhaskar Semwal, Yogesh Kumar Prajapat, Rahul Jain in SN Computer Science

Article 31 July 2023

Region Feature Disentanglement for Domain Adaptive Object Detection

In recent years, deep learning based object detection has shown impressive results. However, applying an object detector learned from one data domain...

Rui Wang, Shouhong Wan, Peiquan ** in Artificial Neural Networks and Machine Learning – ICANN 2023

Conference paper 2023

Few-shot defect detection using feature enhancement and image generation for manufacturing quality inspection

Visual defect detection, which is pivotal in industrial quality control, often requires extensive datasets for training deep-learning models....

Yu Gong, Mingzhou Liu, ... **g Hu in Applied Intelligence

Article 12 December 2023

An efficient multi-scale contextual feature fusion network for counting crowds with varying densities and scales

The crowd counting problem aims to predict the number of pedestrians in a surveillance video or an image and produce a crowd density map. Achieving...

Liyan **ong, Hu Yi, ... Weichun Huang in Multimedia Tools and Applications

Article 26 September 2022

Less Is More: Similarity Models for Content-Based Video Retrieval

The concept of object-to-object similarity plays a crucial role in interactive content-based video retrieval tools. Similarity (or distance) models...

Patrik Veselý, Ladislav Peška in MultiMedia Modeling

Conference paper 2023

Search

Filters

Search Results

Search

Navigation