Search
Search Results
-
Are Business Expectations Aligned with the Development Plan Made by the Software Architecture Area? A Case Study on Agile Teams in a Large Company
In the current scenario of digital transformation, understanding the interaction between the areas of business and software architecture is... -
An Integrated System for Spatio-temporal Summarization of 360-Degrees Videos
In this work, we present an integrated system for spatio-temporal summarization of 360-degrees videos. The video summary production involves the... -
VISIONE 5.0: Enhanced User Interface and AI Models for VBS2024
In this paper, we introduce the fifth release of VISIONE, an advanced video retrieval system offering diverse search functionalities. The user can... -
Exquisitor at the Video Browser Showdown 2024: Relevance Feedback Meets Conversational Search
An important open problem in video retrieval and exploration concerns the generation and refinement of queries for complex tasks that standard... -
Using Saliency and Crop** to Improve Video Memorability
Video memorability is a measure of how likely a particular video is to be remembered by a viewer when that viewer has no emotional connection with... -
A Lightweight Local Attention Network for Image Super-Resolution
For many years, deep neural networks have been used for Single Image Super-resolution (SISR) tasks. However, more extensive networks require higher... -
Cross-Modal Semantic Alignment Learning for Text-Based Person Search
Text-based person search aims to retrieve pedestrian images corresponding to a specific identity based on a textual description. Existing methods... -
CLF-Net: A Few-Shot Cross-Language Font Generation Method
Designing a font library takes a lot of time and effort. Few-shot font generation aims to generate a new font library by referring to only a few... -
A Detail-Guided Multi-source Fusion Network for Remote Sensing Object Detection
Optical and synthetic aperture radar (SAR) remote sensing have established themselves as valuable tools for object detection. Optical images exhibit... -
Find the Cliffhanger: Multi-modal Trailerness in Soap Operas
Creating a trailer requires carefully picking out and piecing together brief enticing moments out of a longer video, making it a challenging and... -
MAVAR-SE: Multi-scale Audio-Visual Association Representation Network for End-to-End Speaker Extraction
Speaker extraction to separate the target speech from the mixed audio is a problem worth studying in the speech separation field. Since human... -
SEAS-Net: Segment Exchange Augmentation for Semi-supervised Brain Tumor Segmentation
Accurate segmentation of brain tumors is crucial for cancer diagnosis, treatment planning, and evaluation. However, semi-supervised brain tumor image... -
A Language-Based Solution to Enable Metaverse Retrieval
Recently, the Metaverse is becoming increasingly attractive, with millions of users accessing the many available virtual worlds. However, how do... -
Hierarchical Supervised Contrastive Learning for Multimodal Sentiment Analysis
Multimodal sentiment analysis (MSA) is dedicated to deciphering human emotions in videos. It is a challenging task due to the semantic disparities... -
Localization and Local Motion Magnification of Pulsatile Regions in Endoscopic Surgery Videos
Localization of neurovascular bundles or vessels is critical in endoscopic surgery. It still remains challenging to identify neurovascular bundles... -
A Review of Image and Point Cloud Fusion in Autonomous Driving
In the task of autonomous driving perception scenarios, multi-sensor fusion is gradually becoming the current mainstream trend. At this stage,... -
Cross-Modal Hashing
Cross-modal retrieval [1] aims to retrieve semantically relevant items in other modalities given queries from a specific modality. To support... -
Systeme für skalierbares Datenmanagement
Unabhängig von der serverseitigen Architektur ist das skalierbare Datenmanagement die primäre Herausforderung für hohe Leistung. Geschäfts- und... -
JARAD: An Approach for Java API Mention Recognition and Disambiguation in Stack Overflow
Invoking APIs is a common way to improve the efficiency of software development. Developers often discuss various problems encountered or share the... -
Resource Cooperative Scheduling Optimization Considering Security in Edge Mobile Networks
With the rapid development of technologies such as the Internet of Things and artificial intelligence, the contradiction between limited user...