Search
Search Results
-
Multimodal Information Retrieval
In today’s rapidly evolving digital landscape, the wealth of available information has expanded beyond the boundaries of traditional text-based... -
Retrieval Augmented Modeling
Till this point in our book, we have discussed the fundamental principles of information retrieval, exploring its key elements, and various... -
Outlook
While multimodal information retrieval has several exciting applications and a high potential for impact on important problems, there are several... -
Multimodal Content Generation
In this chapter, we will review the advances that are being made in this new field of multimodal content generation and also discuss several... -
Transformer-Driven Models for Language, Vision, and Multimodality
In this chapter, we will learn about the modeling and learning techniques that drive multimodal applications. We will focus specifically on the... -
Introduction
In this book, our emphasis is on multimodal information retrieval, specifically concentrating on text and image data. The traditional unimodal... -
Deep Learning-Based Solution for Intrusion Detection in the Internet of Things
Securing the Internet of Things-based environment is a top priority for consumers, businesses, and governments. There are billions of devices... -
Executive Summary
Recent advancements in machine learning, particularly in natural language processing, have been marked by the emergence of large models pretrained on... -
Sentence-Final Particle de in Mandarin as an Informativity Maximizer
In this study, we provide a new empirical generalization of the meaning contribution of the Mandarin sentence-final particle de from an information... -
Formalizing Henkin-Style Completeness of an Axiomatic System for Propositional Logic
I formalize a Henkin-style completeness proof for an axiomatic system for propositional logic in the proof assistant Isabelle/HOL. The formalization... -
Instructors’ Perceptions of an Information Literacy-Centered Professional Development Workshop
The teach-the-teacher model, in which librarians act in an educational developer role, has been one approach that librarians have used to integrate... -
E2Evideo: End to End Video and Image Pre-processing and Analysis Tool
In this demonstration paper, we present “e2evideo” a versatile Python package composed of domain-independent modules. These modules can be seamlessly... -
Exquisitor at the Video Browser Showdown 2024: Relevance Feedback Meets Conversational Search
An important open problem in video retrieval and exploration concerns the generation and refinement of queries for complex tasks that standard... -
Removing Stray-Light for Wild-Field Fundus Image Fusion Based on Large Generative Models
In low-cost wide-field fundus cameras, the built-in lighting sources are prone to generate stray-light nearby, leading to low-quality image regions.... -
Advancing Incremental Few-Shot Semantic Segmentation via Semantic-Guided Relation Alignment and Adaptation
Incremental few-shot semantic segmentation aims to extend a semantic segmentation model to novel classes according to only a few labeled data, while... -
From Skulls to Faces: A Deep Generative Framework for Realistic 3D Craniofacial Reconstruction
The shape of the human face is largely determined by the underlying skull morphology. Craniofacial reconstruction (CfR), or the process of... -
Super-Resolution-Assisted Feature Refined Extraction for Small Objects in Remote Sensing Images
Despite achieving impressive results in object detection in natural scenes, the task of object detection in remote sensing images is still full of... -
A Detail-Guided Multi-source Fusion Network for Remote Sensing Object Detection
Optical and synthetic aperture radar (SAR) remote sensing have established themselves as valuable tools for object detection. Optical images exhibit... -
Dive into Coarse-to-Fine Strategy in Single Image Deblurring
The coarse-to-fine approach has gained significant popularity in the design of networks for single image deblurring. Traditional methods used to... -
Audio-Visual Segmentation by Leveraging Multi-scaled Features Learning
Audio-visual segmentation with semantics (AVSS) is an advanced approach that enriches Audio-visual segmentation (AVS) by incorporating object...