Search Page | SpringerLink

CLF-Net: A Few-Shot Cross-Language Font Generation Method

Designing a font library takes a lot of time and effort. Few-shot font generation aims to generate a new font library by referring to only a few...

Qianqian **, Fazhi He, Wei Tang in MultiMedia Modeling

Conference paper 2024

A Detail-Guided Multi-source Fusion Network for Remote Sensing Object Detection

Optical and synthetic aperture radar (SAR) remote sensing have established themselves as valuable tools for object detection. Optical images exhibit...

**aoting Li, Shouhong Wan, ... Peiquan ** in MultiMedia Modeling

Conference paper 2024

Find the Cliffhanger: Multi-modal Trailerness in Soap Operas

Creating a trailer requires carefully picking out and piecing together brief enticing moments out of a longer video, making it a challenging and...

Carlo Bretti, Pascal Mettes, ... Nanne van Noord in MultiMedia Modeling

Conference paper 2024

MAVAR-SE: Multi-scale Audio-Visual Association Representation Network for End-to-End Speaker Extraction

Speaker extraction to separate the target speech from the mixed audio is a problem worth studying in the speech separation field. Since human...

Shilong Yu, Chenhui Yang in MultiMedia Modeling

Conference paper 2024

SEAS-Net: Segment Exchange Augmentation for Semi-supervised Brain Tumor Segmentation

Accurate segmentation of brain tumors is crucial for cancer diagnosis, treatment planning, and evaluation. However, semi-supervised brain tumor image...

**g Zhang, Wei Wu in MultiMedia Modeling

Conference paper 2024

A Language-Based Solution to Enable Metaverse Retrieval

Recently, the Metaverse is becoming increasingly attractive, with millions of users accessing the many available virtual worlds. However, how do...

Ali Abdari, Alex Falcon, Giuseppe Serra in MultiMedia Modeling

Conference paper 2024

Hierarchical Supervised Contrastive Learning for Multimodal Sentiment Analysis

Multimodal sentiment analysis (MSA) is dedicated to deciphering human emotions in videos. It is a challenging task due to the semantic disparities...

Kezhou Chen, Shuo Wang, Yanbin Hao in MultiMedia Modeling

Conference paper 2024

Localization and Local Motion Magnification of Pulsatile Regions in Endoscopic Surgery Videos

Localization of neurovascular bundles or vessels is critical in endoscopic surgery. It still remains challenging to identify neurovascular bundles...

Honglei Zheng, Wenkang Fan, ... **ongbiao Luo in MultiMedia Modeling

Conference paper 2024

Pattern Recognition Techniques in Image-Based Material Classification of Ancient Manuscripts

Classifying ancient manuscripts based on their writing surfaces often becomes essential for palaeographic research, including writer identification,...

Maruf A. Dhali, Thomas Reynolds, ... Lambert Schomaker in Pattern Recognition Applications and Methods

Conference paper 2024

Improving Person Re-identification Through Low-Light Image Enhancement

Person re-identification (ReID) is a popular area of research in the field of computer vision. Despite the significant advancements achieved in...

Oliverio J. Santana, Javier Lorenzo-Navarro, ... Modesto Castrillón-Santana in Pattern Recognition Applications and Methods

Conference paper 2024

Short Summary on Main Cenozoic Fossiliferous Localities and South American Land Mammal Ages (SALMAs)

This chapter includes a cursory, but critical analysis of the South American Land Mammal ages, their chronology, and brief comments about some...

Federico Agnolin in History of Cenozoic Mammals from South America

Chapter 2024

Dinosaur Footprints Throughout Mesozoic Basins in Brazil

The vertebrate ichnological data from the Brazilian Mesozoic basinsMesozoic basins represented by the fossil tracks are well-known, in...

Ismar de Souza Carvalho in Dinosaur Tracks of Mesozoic Basins in Brazil

Chapter 2024

RESET: Relational Similarity Extension for V3C1 Video Dataset

Effective content-based information retrieval (IR) is crucial across multimedia platforms, especially in the realm of videos. Whether navigating a...

Patrik Veselý, Ladislav Peška in MultiMedia Modeling

Conference paper 2024

PDTW150K: A Dataset for Patent Drawing Retrieval

We introduce a new large-scale patent dataset termed PDTW150K for patent drawing retrieval. The dataset contains more than 150,000 patents associated...

Chan-Ming Hsu, Tse-Hung Lin, ... Chih-Yi Chiu in MultiMedia Modeling

Conference paper 2024

MSAA-Net: Multi-Scale Attention Assembler Network Based on Multiple Instance Learning for Pathological Image Analysis

In this paper, we present a multi-scale attention assembler network (MSAA-Net) tailored for multi-scale pathological image analysis. The proposed...

Takeshi Yoshida, Kazuki Uehara, ... Masahiro Murakawa in Pattern Recognition Applications and Methods

Conference paper 2024

Splendid Isolation Revisited: The Entente Cordiale Model

Previous pages were the necessary introduction to understand the topics that are included in present chapter. Here the main pillars of a new model...

Federico Agnolin in History of Cenozoic Mammals from South America

Chapter 2024

Brief History of South American Biogeography

The early history of the study of vertebrate palaeobiogeography in South America is marked in the first decade of the twentieth century by two...

Federico Agnolin in History of Cenozoic Mammals from South America

Chapter 2024

Major Clades of South American Mammals

In this chapter, I shall provide a brief introduction to the main lineages that form part of the early history of South American mammals. Here, the...

Federico Agnolin in History of Cenozoic Mammals from South America

Chapter 2024

Tracking Dinosaurs During the Equatorial and South Atlantic Opening

The Mesozoic rift basinsRift basins of northeastern BrazilBrazil, particularly those at the edge of the Atlantic marginAtlantic margin, are small,...

Giuseppe Leonardi, Maria de Fátima C. F. dos Santos, Fernando Henrique de Souza Barbosa in Dinosaur Tracks of Mesozoic Basins in Brazil

Chapter 2024

WikiMuTe: A Web-Sourced Dataset of Semantic Descriptions for Music Audio

Multi-modal deep learning techniques for matching free-form text with music have shown promising results in the field of Music Information Retrieval...

Benno Weck, Holger Kirchhoff, ... Xavier Serra in MultiMedia Modeling

Conference paper 2024

Search

Filters

Search Results

Search

Navigation