Search
Search Results
-
Multimodal Information Retrieval
In today’s rapidly evolving digital landscape, the wealth of available information has expanded beyond the boundaries of traditional text-based... -
Outlook
While multimodal information retrieval has several exciting applications and a high potential for impact on important problems, there are several... -
Multimodal Content Generation
In this chapter, we will review the advances that are being made in this new field of multimodal content generation and also discuss several... -
Retrieval Augmented Modeling
Till this point in our book, we have discussed the fundamental principles of information retrieval, exploring its key elements, and various... -
Transformer-Driven Models for Language, Vision, and Multimodality
In this chapter, we will learn about the modeling and learning techniques that drive multimodal applications. We will focus specifically on the... -
Introduction
In this book, our emphasis is on multimodal information retrieval, specifically concentrating on text and image data. The traditional unimodal... -
Toward an Industrial Robot Gym
This chapter briefly describes how to build a digital twin and how to make it work in order to benefit a modern factory. It describes how the digital... -
Creating SORDI: The Largest Synthetic Dataset for Industries
This chapter describes SORDI, the largest Synthetic Industrial Dataset for Object Detection for Industries, jointly developed by BMW Group and... -
Digital Images – The Bread and Butter of Computer Vision
This chapter focuses on the usage of Computer Vision (CV) in manufacturing for product inspection, quality assurance, workplace safety, and factory... -
Projects
This is a special chapter dealing with security projects. We have arranged the projects in three parts. Part 1 consists of projects that can be done... -
Industrial Evolution Toward the Age of Imagination
This chapter briefly describes transitions within the industrial age: from the first, second, and third industrial revolutions, to the current... -
How Visual Data Is Revolutionizing the Industry World
This chapter briefly describes how Industry 4.0 technologies have placed digital multimedia data at the center of organizations of all sizes and... -
Standardization and Security Criteria: Security Evaluation of Computer Products
Our growing dependence on technology and the corresponding skyrocketing security problems arising from it have all created a high demand for... -
Deep Learning-Based Solution for Intrusion Detection in the Internet of Things
Securing the Internet of Things-based environment is a top priority for consumers, businesses, and governments. There are billions of devices... -
Executive Summary
Recent advancements in machine learning, particularly in natural language processing, have been marked by the emergence of large models pretrained on... -
PDTW150K: A Dataset for Patent Drawing Retrieval
We introduce a new large-scale patent dataset termed PDTW150K for patent drawing retrieval. The dataset contains more than 150,000 patents associated... -
Event Recognition in Laparoscopic Gynecology Videos with Hybrid Transformers
Analyzing laparoscopic surgery videos presents a complex and multifaceted challenge, with applications including surgical training, intra-operative... -
GreenScreen: A Multimodal Dataset for Detecting Corporate Greenwashing in the Wild
Greenwashing, a form of deceptive marketing where organizations attempt to convince consumers that their offerings and operations are environmentally... -
RESET: Relational Similarity Extension for V3C1 Video Dataset
Effective content-based information retrieval (IR) is crucial across multimedia platforms, especially in the realm of videos. Whether navigating a... -
WikiMuTe: A Web-Sourced Dataset of Semantic Descriptions for Music Audio
Multi-modal deep learning techniques for matching free-form text with music have shown promising results in the field of Music Information Retrieval...