Search
Search Results
-
MGSGA: Multi-grained and Semantic-Guided Alignment for Text-Video Retrieval
In the text-video retrieval task, the objective is to calculate the similarity between a text and a video, and rank the relevant candidates higher....
-
SPSD: Similarity-preserving self-distillation for video–text retrieval
Most of existing methods solve cross-modal video and text retrieval via coarse-grained similarity computation based on global representations or...
-
Deep learning for video-text retrieval: a review
Video-Text Retrieval (VTR) aims to search for the most relevant video related to the semantics in a given sentence, and vice versa. In general, this...
-
Video–text retrieval via multi-modal masked transformer and adaptive attribute-aware graph convolutional network
Despite significant advancements in deep learning-based video–text retrieval methods, three challenges persist: the alignment of fine-grained...
-
VTM-GAN: video-text matcher based generative adversarial network for generating videos from textual description
Text-to-video synthesis has garnered significant attention as a challenging task in the domain of vision computing. With the advent of unsupervised...
-
Text presentation or video: Malaysian university students' preferences with synchronous and asynchronous learning
In overcoming the obstacles of online learning with the current Covid-19 pandemic crisis, synchronous and asynchronous learning has been a...
-
V2T: video to text framework using a novel automatic shot boundary detection algorithm
The generation of natural language descriptions for a video has been reported by many researchers till now. But, it is still the most interesting...
-
A comprehensive review of the video-to-text problem
Research in the Vision and Language area encompasses challenging topics that seek to connect visual and textual information. When the visual...
-
Learning from text and video blogs: comprehension effects on secondary school students
Informational video blogs are a popular method of communication among students that may be fruitful educational tools, but their potential benefits...
-
Video captioning with global and local text attention
The task of video captioning is to generate a video description corresponding to the video content, so there are stringent requirements for the...
-
An Algorithm for Detecting Precipitation in Computer Processing of Video Images
AbstractThe importance of detecting and reducing the visibility of precipitation in video images obtained by fixed cameras is shown. A statistical...
-
Multi-grained encoding and joint embedding space fusion for video and text cross-modal retrieval
Video-text cross-modal retrieval is significant to computer vision. Most of existing works focus on exploring the global similarity between...
-
Bilingual video captioning model for enhanced video retrieval
Many video platforms rely on the descriptions that uploaders provide for video retrieval. However, this reliance may cause inaccuracies. Although...
-
Only overlay text: novel features for TV news broadcast video segmentation
Segmentation of television news videos into programs and stories (after removing advertisements) is a necessary first step for news broadcast...
-
RoICLIP: Text-Enhanced UAV-Based Video Object Detection
In recent years, Unmanned Aerial Vehicles (UAV)-based video object detection algorithms have attracted a lot of attention due to their widespread... -
A video compression-cum-classification network for classification from compressed video streams
Video analytics can achieve increased speed and efficiency by operating directly on the compressed video format, thereby alleviating the decoding...
-
ICDAR 2023 Competition on Born Digital Video Text Question Answering
This paper presents the final results of the ICDAR 2023 Competition on Born Digital Video Text Question Answering (i.e., BDVT-QA) which contains two... -
Security to text (S2T): multi-layered based security approaches for secret text content
In the digital world, text data is produced in an unstructured manner across various communication channels. Extracting valuable information from...
-
Improved Vehicle Detection Accuracy and Processing Time for Video Based ITS Applications
The increase in daily traffic volume needs a more effective, intelligent, and sophisticated traffic management and control strategy. Video-based...
-
Automatic football video production system with edge processing
Automatic video production of sports aims at producing an aesthetic broadcast of sporting events. This is an enabler of low-cost solutions for...