![Loading...](https://link.springer.com/static/c4a417b97a76cc2980e3c25e2271af3129e08bbe/images/pdf-preview/spacer.gif)
671,993 Result(s)
-
Book Series
-
Chapter
Transformer-Driven Models for Language, Vision, and Multimodality
In this chapter, we will learn about the modeling and learning techniques that drive multimodal applications. We will focus specifically on the recent advances in transformer-based modeling for natural languag...
-
Chapter
Multimodal Content Generation
In this chapter, we will review the advances that are being made in this new field of multimodal content generation and also discuss several challenges associated with this emerging technology. First, we will ...
-
Chapter
Outlook
While multimodal information retrieval has several exciting applications and a high potential for impact on important problems, there are several challenges associated with the information that lives on the in...
-
Chapter
Introduction
In this book, our emphasis is on multimodal information retrieval, specifically concentrating on text and image data. The traditional unimodal systems, limited to a single type of data, often fall short of cap...
-
Book
-
Chapter
Multimodal Information Retrieval
In today’s rapidly evolving digital landscape, the wealth of available information has expanded beyond the boundaries of traditional text-based content. With the proliferation of multimedia platforms and data ...
-
Chapter
Retrieval Augmented Modeling
Till this point in our book, we have discussed the fundamental principles of information retrieval, exploring its key elements, and various approaches to achieving effective retrieval, including multimodal ret...
-
Article
Open AccessPanAf20K: A Large Video Dataset for Wild Ape Detection and Behaviour Recognition
We present the PanAf20K dataset, the largest and most diverse open-access annotated video dataset of great apes in their natural environment. It comprises more than 7 million frames across
-
Article
CBNet: A Plug-and-Play Network for Segmentation-Based Scene Text Detection
Recently, segmentation-based methods are quite popular in scene text detection, which mainly contain two steps: text kernel segmentation and expansion. However, the segmentation process only considers each pix...
-
Article
The multi-criteria evaluation of research efforts based on ETL software: from business intelligence approach to big data and semantic approaches
Many industries and academia have devoted a lot of effort and money to creating and/or using good extract-transform-load (ETL) software suitable for their data analysis purposes since it is considered a key to...
-
Article
Open AccessSignifiers for conveying and exploiting affordances: from human-computer interaction to multi-agent systems
The ecological psychologist James J. Gibson defined the notion of affordances to refer to what action possibilities environments offer to animals. In this paper, we show how (artificial) agents can discover an...
-
Article
A general framework for improving cuckoo search algorithms with resource allocation and re-initialization
Cuckoo search (CS) has currently become one of the most favorable meta-heuristic algorithms (MHAs). In this article, a simple yet effective framework is proposed for CS algorithms to reinforce their performanc...
-
Article
Open AccessUnsupervised Point Cloud Representation Learning by Clustering and Neural Rendering
Data augmentation has contributed to the rapid advancement of unsupervised learning on 3D point clouds. However, we argue that data augmentation is not ideal, as it requires a careful application-dependent sel...
-
Article
Surrogate-assisted evolutionary optimisation: a novel blueprint and a state of the art survey
Surrogate-Assisted Evolutionary Optimisation algorithms are a specialized brand of optimisers developed to undertake problems with computationally expensive fitness functions. These algorithms work by building...
-
Article
Tensor discriminant analysis on grassmann manifold with application to video based human action recognition
Representing videos as linear subspaces on Grassmann manifolds has made great strides in action recognition problems. Recent studies have explored the convenience of discriminant analysis by making use of Gras...
-
Article
Open AccessAn algorithmic debugging approach for belief-desire-intention agents
Debugging agent systems can be rather difficult. It is often noted as one of the most time-consuming tasks during the development of cognitive agents. Algorithmic (or declarative) debugging is a semi-automatic...
-
Article
ConDA: state-based data augmentation for context-dependent text-to-SQL
The context-dependent text-to-SQL task has profound real-world implications, as it facilitates users in extracting knowledge from vast databases, which allows users to acquire the information interactively for...
-
Article
Annotation-Free Human Sketch Quality Assessment
As lovely as bunnies are, your sketched version would probably not do them justice (Fig. 1). This paper recognises this very problem and studies sketch quality assessment for the first time—letting you find these...
-
Article
Open Set Recognition in Real World
Open set recognition (OSR) constitutes a critical endeavor within the domain of computer vision, frequently deployed in applications, such as autonomous driving and medical imaging recognition. Existing OSR me...