We are improving our search experience. To check which content you have full access to, or for advanced search, go back to the old search.

Search

Please fill in this field.

Search Results

Showing 141-160 of 10,000 results
  1. CLF-Net: A Few-Shot Cross-Language Font Generation Method

    Designing a font library takes a lot of time and effort. Few-shot font generation aims to generate a new font library by referring to only a few...
    Qianqian **, Fazhi He, Wei Tang in MultiMedia Modeling
    Conference paper 2024
  2. A Detail-Guided Multi-source Fusion Network for Remote Sensing Object Detection

    Optical and synthetic aperture radar (SAR) remote sensing have established themselves as valuable tools for object detection. Optical images exhibit...
    **aoting Li, Shouhong Wan, ... Peiquan ** in MultiMedia Modeling
    Conference paper 2024
  3. Find the Cliffhanger: Multi-modal Trailerness in Soap Operas

    Creating a trailer requires carefully picking out and piecing together brief enticing moments out of a longer video, making it a challenging and...
    Carlo Bretti, Pascal Mettes, ... Nanne van Noord in MultiMedia Modeling
    Conference paper 2024
  4. MAVAR-SE: Multi-scale Audio-Visual Association Representation Network for End-to-End Speaker Extraction

    Speaker extraction to separate the target speech from the mixed audio is a problem worth studying in the speech separation field. Since human...
    Shilong Yu, Chenhui Yang in MultiMedia Modeling
    Conference paper 2024
  5. SEAS-Net: Segment Exchange Augmentation for Semi-supervised Brain Tumor Segmentation

    Accurate segmentation of brain tumors is crucial for cancer diagnosis, treatment planning, and evaluation. However, semi-supervised brain tumor image...
    **g Zhang, Wei Wu in MultiMedia Modeling
    Conference paper 2024
  6. A Language-Based Solution to Enable Metaverse Retrieval

    Recently, the Metaverse is becoming increasingly attractive, with millions of users accessing the many available virtual worlds. However, how do...
    Ali Abdari, Alex Falcon, Giuseppe Serra in MultiMedia Modeling
    Conference paper 2024
  7. Hierarchical Supervised Contrastive Learning for Multimodal Sentiment Analysis

    Multimodal sentiment analysis (MSA) is dedicated to deciphering human emotions in videos. It is a challenging task due to the semantic disparities...
    Kezhou Chen, Shuo Wang, Yanbin Hao in MultiMedia Modeling
    Conference paper 2024
  8. Localization and Local Motion Magnification of Pulsatile Regions in Endoscopic Surgery Videos

    Localization of neurovascular bundles or vessels is critical in endoscopic surgery. It still remains challenging to identify neurovascular bundles...
    Honglei Zheng, Wenkang Fan, ... **ongbiao Luo in MultiMedia Modeling
    Conference paper 2024
  9. Pattern Recognition Techniques in Image-Based Material Classification of Ancient Manuscripts

    Classifying ancient manuscripts based on their writing surfaces often becomes essential for palaeographic research, including writer identification,...
    Maruf A. Dhali, Thomas Reynolds, ... Lambert Schomaker in Pattern Recognition Applications and Methods
    Conference paper 2024
  10. Improving Person Re-identification Through Low-Light Image Enhancement

    Person re-identification (ReID) is a popular area of research in the field of computer vision. Despite the significant advancements achieved in...
    Oliverio J. Santana, Javier Lorenzo-Navarro, ... Modesto Castrillón-Santana in Pattern Recognition Applications and Methods
    Conference paper 2024
  11. Short Summary on Main Cenozoic Fossiliferous Localities and South American Land Mammal Ages (SALMAs)

    This chapter includes a cursory, but critical analysis of the South American Land Mammal ages, their chronology, and brief comments about some...
    Chapter 2024
  12. Dinosaur Footprints Throughout Mesozoic Basins in Brazil

    The vertebrate ichnological data from the Brazilian Mesozoic basinsMesozoic basins represented by the fossil tracks are well-known, in...
    Chapter 2024
  13. RESET: Relational Similarity Extension for V3C1 Video Dataset

    Effective content-based information retrieval (IR) is crucial across multimedia platforms, especially in the realm of videos. Whether navigating a...
    Patrik Veselý, Ladislav Peška in MultiMedia Modeling
    Conference paper 2024
  14. PDTW150K: A Dataset for Patent Drawing Retrieval

    We introduce a new large-scale patent dataset termed PDTW150K for patent drawing retrieval. The dataset contains more than 150,000 patents associated...
    Chan-Ming Hsu, Tse-Hung Lin, ... Chih-Yi Chiu in MultiMedia Modeling
    Conference paper 2024
  15. MSAA-Net: Multi-Scale Attention Assembler Network Based on Multiple Instance Learning for Pathological Image Analysis

    In this paper, we present a multi-scale attention assembler network (MSAA-Net) tailored for multi-scale pathological image analysis. The proposed...
    Takeshi Yoshida, Kazuki Uehara, ... Masahiro Murakawa in Pattern Recognition Applications and Methods
    Conference paper 2024
  16. Splendid Isolation Revisited: The Entente Cordiale Model

    Previous pages were the necessary introduction to understand the topics that are included in present chapter. Here the main pillars of a new model...
    Chapter 2024
  17. Brief History of South American Biogeography

    The early history of the study of vertebrate palaeobiogeography in South America is marked in the first decade of the twentieth century by two...
    Chapter 2024
  18. Major Clades of South American Mammals

    In this chapter, I shall provide a brief introduction to the main lineages that form part of the early history of South American mammals. Here, the...
    Chapter 2024
  19. Tracking Dinosaurs During the Equatorial and South Atlantic Opening

    The Mesozoic rift basinsRift basins of northeastern BrazilBrazil, particularly those at the edge of the Atlantic marginAtlantic margin, are small,...
    Giuseppe Leonardi, Maria de Fátima C. F. dos Santos, Fernando Henrique de Souza Barbosa in Dinosaur Tracks of Mesozoic Basins in Brazil
    Chapter 2024
  20. WikiMuTe: A Web-Sourced Dataset of Semantic Descriptions for Music Audio

    Multi-modal deep learning techniques for matching free-form text with music have shown promising results in the field of Music Information Retrieval...
    Benno Weck, Holger Kirchhoff, ... Xavier Serra in MultiMedia Modeling
    Conference paper 2024
Did you find what you were looking for? Share feedback.