Search
Search Results
-
CLF-Net: A Few-Shot Cross-Language Font Generation Method
Designing a font library takes a lot of time and effort. Few-shot font generation aims to generate a new font library by referring to only a few... -
A Detail-Guided Multi-source Fusion Network for Remote Sensing Object Detection
Optical and synthetic aperture radar (SAR) remote sensing have established themselves as valuable tools for object detection. Optical images exhibit... -
Find the Cliffhanger: Multi-modal Trailerness in Soap Operas
Creating a trailer requires carefully picking out and piecing together brief enticing moments out of a longer video, making it a challenging and... -
MAVAR-SE: Multi-scale Audio-Visual Association Representation Network for End-to-End Speaker Extraction
Speaker extraction to separate the target speech from the mixed audio is a problem worth studying in the speech separation field. Since human... -
SEAS-Net: Segment Exchange Augmentation for Semi-supervised Brain Tumor Segmentation
Accurate segmentation of brain tumors is crucial for cancer diagnosis, treatment planning, and evaluation. However, semi-supervised brain tumor image... -
A Language-Based Solution to Enable Metaverse Retrieval
Recently, the Metaverse is becoming increasingly attractive, with millions of users accessing the many available virtual worlds. However, how do... -
Hierarchical Supervised Contrastive Learning for Multimodal Sentiment Analysis
Multimodal sentiment analysis (MSA) is dedicated to deciphering human emotions in videos. It is a challenging task due to the semantic disparities... -
Localization and Local Motion Magnification of Pulsatile Regions in Endoscopic Surgery Videos
Localization of neurovascular bundles or vessels is critical in endoscopic surgery. It still remains challenging to identify neurovascular bundles... -
Pattern Recognition Techniques in Image-Based Material Classification of Ancient Manuscripts
Classifying ancient manuscripts based on their writing surfaces often becomes essential for palaeographic research, including writer identification,... -
Improving Person Re-identification Through Low-Light Image Enhancement
Person re-identification (ReID) is a popular area of research in the field of computer vision. Despite the significant advancements achieved in... -
Short Summary on Main Cenozoic Fossiliferous Localities and South American Land Mammal Ages (SALMAs)
This chapter includes a cursory, but critical analysis of the South American Land Mammal ages, their chronology, and brief comments about some... -
Dinosaur Footprints Throughout Mesozoic Basins in Brazil
The vertebrate ichnological data from the Brazilian Mesozoic basinsMesozoic basins represented by the fossil tracks are well-known, in... -
RESET: Relational Similarity Extension for V3C1 Video Dataset
Effective content-based information retrieval (IR) is crucial across multimedia platforms, especially in the realm of videos. Whether navigating a... -
PDTW150K: A Dataset for Patent Drawing Retrieval
We introduce a new large-scale patent dataset termed PDTW150K for patent drawing retrieval. The dataset contains more than 150,000 patents associated... -
MSAA-Net: Multi-Scale Attention Assembler Network Based on Multiple Instance Learning for Pathological Image Analysis
In this paper, we present a multi-scale attention assembler network (MSAA-Net) tailored for multi-scale pathological image analysis. The proposed... -
Splendid Isolation Revisited: The Entente Cordiale Model
Previous pages were the necessary introduction to understand the topics that are included in present chapter. Here the main pillars of a new model... -
Brief History of South American Biogeography
The early history of the study of vertebrate palaeobiogeography in South America is marked in the first decade of the twentieth century by two... -
Major Clades of South American Mammals
In this chapter, I shall provide a brief introduction to the main lineages that form part of the early history of South American mammals. Here, the... -
Tracking Dinosaurs During the Equatorial and South Atlantic Opening
The Mesozoic rift basinsRift basins of northeastern BrazilBrazil, particularly those at the edge of the Atlantic marginAtlantic margin, are small,... -
WikiMuTe: A Web-Sourced Dataset of Semantic Descriptions for Music Audio
Multi-modal deep learning techniques for matching free-form text with music have shown promising results in the field of Music Information Retrieval...