![Loading...](https://link.springer.com/static/c4a417b97a76cc2980e3c25e2271af3129e08bbe/images/pdf-preview/spacer.gif)
1,125,147 Result(s)
-
Book Series
-
Chapter
Transformer-Driven Models for Language, Vision, and Multimodality
In this chapter, we will learn about the modeling and learning techniques that drive multimodal applications. We will focus specifically on the recent advances in transformer-based modeling for natural languag...
-
Chapter
Multimodal Content Generation
In this chapter, we will review the advances that are being made in this new field of multimodal content generation and also discuss several challenges associated with this emerging technology. First, we will ...
-
Chapter
Outlook
While multimodal information retrieval has several exciting applications and a high potential for impact on important problems, there are several challenges associated with the information that lives on the in...
-
Chapter
Introduction
In this book, our emphasis is on multimodal information retrieval, specifically concentrating on text and image data. The traditional unimodal systems, limited to a single type of data, often fall short of cap...
-
Book
-
Chapter
Multimodal Information Retrieval
In today’s rapidly evolving digital landscape, the wealth of available information has expanded beyond the boundaries of traditional text-based content. With the proliferation of multimedia platforms and data ...
-
Chapter
Retrieval Augmented Modeling
Till this point in our book, we have discussed the fundamental principles of information retrieval, exploring its key elements, and various approaches to achieving effective retrieval, including multimodal ret...
-
Article
A stabilized Crank-Nicolson virtual element method for the unsteady Navier-Stokes problems with high Reynolds number
This paper studies a stabilized virtual element method for the unsteady Navier-Stokes problems on polygonal meshes. Using “equal-order” virtual elements in space and the Crank-Nicolson scheme in time, we give ...
-
Article
Conv-ViT fusion for improved handwritten Arabic character classification
An essential aspect of pattern recognition pertains to handwriting recognition, particularly in languages with diverse character styles like Arabic. Arabic characters present a challenge due to their varied wr...
-
Article
Open AccessPerformance evaluation of Word2vec accelerators exploiting spatial and temporal parallelism on DDR/HBM-based FPGAs
Word embedding is a technique for representing words as vectors in a way that captures their semantic and syntactic relationships. The processing time of one of the most popular word embedding technique Word2v...
-
Article
A learning-based efficient query model for blockchain in internet of medical things
This paper proposes a learning-based model for the resource-constrained edge nodes in the blockchain-enabled Internet of Medical Things (IoMT) systems to realize efficient querying. Three layers are designed i...
-
Article
Open AccessPanAf20K: A Large Video Dataset for Wild Ape Detection and Behaviour Recognition
We present the PanAf20K dataset, the largest and most diverse open-access annotated video dataset of great apes in their natural environment. It comprises more than 7 million frames across
-
Article
Fpga-based SoC design for real-time facial point detection using deep convolutional neural networks with dynamic partial reconfiguration
Deep convolutional neural networks (DCNNs) have been mainly powerful and important artificial intelligence techniques, which are exploited in various computer vision applications, such as facial point detectio...
-
Article
MS-HRNet: multi-scale high-resolution network for human pose estimation
Human pose estimation has important applications in medical diagnosis (such as early diagnosis of autism in children and assisting with the diagnosis of Parkinson’s disease), human-computer interaction, animat...
-
Article
CBNet: A Plug-and-Play Network for Segmentation-Based Scene Text Detection
Recently, segmentation-based methods are quite popular in scene text detection, which mainly contain two steps: text kernel segmentation and expansion. However, the segmentation process only considers each pix...
-
Article
Enhancing image steganalysis via integrated reinforcement learning and dilated convolution techniques
In the wake of unparalleled expansion in digital communication platforms, the imperative to bolster security and privacy measures has escalated. Within this landscape, image steganalysis emerges as a pivotal d...
-
Article
\(H^{1}\) -norm error analysis of a robust ADI method on graded mesh for three-dimensional subdiffusion problems
This work proposes a robust ADI scheme on graded mesh for solving three-dimensional subdiffusion problems. The Caputo fractional derivative is discretized by L1 scheme, where the graded mesh is used to elimina...
-
Article
Open AccessSignifiers for conveying and exploiting affordances: from human-computer interaction to multi-agent systems
The ecological psychologist James J. Gibson defined the notion of affordances to refer to what action possibilities environments offer to animals. In this paper, we show how (artificial) agents can discover an...
-
Article
Publisher Correction: Improving query processing in blockchain systems by using a multi-level sharding mechanism