Search Page | SpringerLink

Multimodal Information Retrieval

In today’s rapidly evolving digital landscape, the wealth of available information has expanded beyond the boundaries of traditional text-based...

Man Luo, Tejas Gokhale, ... Chitta Baral in Advances in Multimodal Information Retrieval and Generation

Chapter 2025

Retrieval Augmented Modeling

Till this point in our book, we have discussed the fundamental principles of information retrieval, exploring its key elements, and various...

Man Luo, Tejas Gokhale, ... Chitta Baral in Advances in Multimodal Information Retrieval and Generation

Chapter 2025

Outlook

While multimodal information retrieval has several exciting applications and a high potential for impact on important problems, there are several...

Man Luo, Tejas Gokhale, ... Chitta Baral in Advances in Multimodal Information Retrieval and Generation

Chapter 2025

Multimodal Content Generation

In this chapter, we will review the advances that are being made in this new field of multimodal content generation and also discuss several...

Man Luo, Tejas Gokhale, ... Chitta Baral in Advances in Multimodal Information Retrieval and Generation

Chapter 2025

Transformer-Driven Models for Language, Vision, and Multimodality

In this chapter, we will learn about the modeling and learning techniques that drive multimodal applications. We will focus specifically on the...

Man Luo, Tejas Gokhale, ... Chitta Baral in Advances in Multimodal Information Retrieval and Generation

Chapter 2025

Introduction

In this book, our emphasis is on multimodal information retrieval, specifically concentrating on text and image data. The traditional unimodal...

Man Luo, Tejas Gokhale, ... Chitta Baral in Advances in Multimodal Information Retrieval and Generation

Chapter 2025

Deep Learning-Based Solution for Intrusion Detection in the Internet of Things

Securing the Internet of Things-based environment is a top priority for consumers, businesses, and governments. There are billions of devices...

Akhil Chaurasia, Alok Mishra, ... Alok Kumar in Computational Intelligence and Network Systems

Conference paper 2024

Executive Summary

Recent advancements in machine learning, particularly in natural language processing, have been marked by the emergence of large models pretrained on...

Pepa Atanasova in Accountable and Explainable Methods for Complex Reasoning over Text

Chapter 2024

Sentence-Final Particle de in Mandarin as an Informativity Maximizer

In this study, we provide a new empirical generalization of the meaning contribution of the Mandarin sentence-final particle de from an information...

Jun Chen, Sean Papay in Selected Reflections in Language, Logic, and Information

Conference paper 2024

Formalizing Henkin-Style Completeness of an Axiomatic System for Propositional Logic

I formalize a Henkin-style completeness proof for an axiomatic system for propositional logic in the proof assistant Isabelle/HOL. The formalization...

Asta Halkjær From in Selected Reflections in Language, Logic, and Information

Conference paper 2024

Instructors’ Perceptions of an Information Literacy-Centered Professional Development Workshop

The teach-the-teacher model, in which librarians act in an educational developer role, has been one approach that librarians have used to integrate...

Amanda L. Folk, Jane Hammons, ... Hanna Primeau in Information Experience and Information Literacy

Conference paper 2024

E2Evideo: End to End Video and Image Pre-processing and Analysis Tool

In this demonstration paper, we present “e2evideo” a versatile Python package composed of domain-independent modules. These modules can be seamlessly...

Faiga Alawad, Pål Halvorsen, Michael A. Riegler in MultiMedia Modeling

Conference paper 2024

Exquisitor at the Video Browser Showdown 2024: Relevance Feedback Meets Conversational Search

An important open problem in video retrieval and exploration concerns the generation and refinement of queries for complex tasks that standard...

Omar Shahbaz Khan, Hongyi Zhu, ... Björn Þór Jónsson in MultiMedia Modeling

Conference paper 2024

Removing Stray-Light for Wild-Field Fundus Image Fusion Based on Large Generative Models

In low-cost wide-field fundus cameras, the built-in lighting sources are prone to generate stray-light nearby, leading to low-quality image regions....

Jun Wu, Mingxin He, ... Dayong Ding in MultiMedia Modeling

Conference paper 2024

Advancing Incremental Few-Shot Semantic Segmentation via Semantic-Guided Relation Alignment and Adaptation

Incremental few-shot semantic segmentation aims to extend a semantic segmentation model to novel classes according to only a few labeled data, while...

Yuan Zhou, **n Chen, ... Qi Tian in MultiMedia Modeling

Conference paper 2024

From Skulls to Faces: A Deep Generative Framework for Realistic 3D Craniofacial Reconstruction

The shape of the human face is largely determined by the underlying skull morphology. Craniofacial reconstruction (CfR), or the process of...

Yehong Pan, Jian Wang, ... Yuan Li in MultiMedia Modeling

Conference paper 2024

Super-Resolution-Assisted Feature Refined Extraction for Small Objects in Remote Sensing Images

Despite achieving impressive results in object detection in natural scenes, the task of object detection in remote sensing images is still full of...

Lihua Du, Wei Wu, Chen Li in MultiMedia Modeling

Conference paper 2024

A Detail-Guided Multi-source Fusion Network for Remote Sensing Object Detection

Optical and synthetic aperture radar (SAR) remote sensing have established themselves as valuable tools for object detection. Optical images exhibit...

**aoting Li, Shouhong Wan, ... Peiquan ** in MultiMedia Modeling

Conference paper 2024

Dive into Coarse-to-Fine Strategy in Single Image Deblurring

The coarse-to-fine approach has gained significant popularity in the design of networks for single image deblurring. Traditional methods used to...

Zebin Li, Jian** Luo in MultiMedia Modeling

Conference paper 2024

Audio-Visual Segmentation by Leveraging Multi-scaled Features Learning

Audio-visual segmentation with semantics (AVSS) is an advanced approach that enriches Audio-visual segmentation (AVS) by incorporating object...

Sze An Peter Tan, Guangyu Gao, Jia Zhao in MultiMedia Modeling

Conference paper 2024

Search

Filters

Search Results

Search

Navigation