Search Page | SpringerLink

MGSGA: Multi-grained and Semantic-Guided Alignment for Text-Video Retrieval

In the text-video retrieval task, the objective is to calculate the similarity between a text and a video, and rank the relevant candidates higher....

**aoyu Wu, Jiayao Qian, Lulu Yang in Neural Processing Letters

Article Open access 17 February 2024

SPSD: Similarity-preserving self-distillation for video–text retrieval

Most of existing methods solve cross-modal video and text retrieval via coarse-grained similarity computation based on global representations or...

Jiachen Wang, Yan Hua, ... Hongwei Kou in International Journal of Multimedia Information Retrieval

Article 01 September 2023

Deep learning for video-text retrieval: a review

Video-Text Retrieval (VTR) aims to search for the most relevant video related to the semantics in a given sentence, and vice versa. In general, this...

Cunjuan Zhu, Qi Jia, ... Yu Liu in International Journal of Multimedia Information Retrieval

Article 23 February 2023

Video–text retrieval via multi-modal masked transformer and adaptive attribute-aware graph convolutional network

Despite significant advancements in deep learning-based video–text retrieval methods, three challenges persist: the alignment of fine-grained...

Gang Lv, Yining Sun, Fudong Nian in Multimedia Systems

Article 22 January 2024

VTM-GAN: video-text matcher based generative adversarial network for generating videos from textual description

Text-to-video synthesis has garnered significant attention as a challenging task in the domain of vision computing. With the advent of unsupervised...

Rayeesa Mehmood, Rumaan Bashir, Kaiser J. Giri in International Journal of Information Technology

Article 16 September 2023

Text presentation or video: Malaysian university students' preferences with synchronous and asynchronous learning

In overcoming the obstacles of online learning with the current Covid-19 pandemic crisis, synchronous and asynchronous learning has been a...

Ali Sorayyaei Azar, Nur Haslinda Iskandar Tan in Education and Information Technologies

Article 04 May 2023

V2T: video to text framework using a novel automatic shot boundary detection algorithm

The generation of natural language descriptions for a video has been reported by many researchers till now. But, it is still the most interesting...

Alok Singh, Thoudam Doren Singh, Sivaji Bandyopadhyay in Multimedia Tools and Applications

Article 08 March 2022

A comprehensive review of the video-to-text problem

Research in the Vision and Language area encompasses challenging topics that seek to connect visual and textual information. When the visual...

Jesus Perez-Martin, Benjamin Bustos, ... Grethel Coello Said in Artificial Intelligence Review

Article 16 January 2022

Learning from text and video blogs: comprehension effects on secondary school students

Informational video blogs are a popular method of communication among students that may be fruitful educational tools, but their potential benefits...

P. Delgado, Ø. Anmarkrud, ... L. Salmerón in Education and Information Technologies

Article Open access 30 November 2021

Video captioning with global and local text attention

The task of video captioning is to generate a video description corresponding to the video content, so there are stringent requirements for the...

Yuqing Peng, Chenxi Wang, ... Yingjun Li in The Visual Computer

Article 05 September 2021

An Algorithm for Detecting Precipitation in Computer Processing of Video Images

Abstract

The importance of detecting and reducing the visibility of precipitation in video images obtained by fixed cameras is shown. A statistical...

V. T. Dmitriev, A. A. Baukov in Programming and Computer Software

Article 26 May 2023

Multi-grained encoding and joint embedding space fusion for video and text cross-modal retrieval

Video-text cross-modal retrieval is significant to computer vision. Most of existing works focus on exploring the global similarity between...

**aotao Cui, **g **ao, ... Jia Zhu in Multimedia Tools and Applications

Article 30 May 2022

Bilingual video captioning model for enhanced video retrieval

Many video platforms rely on the descriptions that uploaders provide for video retrieval. However, this reliance may cause inaccuracies. Although...

Norah Alrebdi, Amal A. Al-Shargabi in Journal of Big Data

Article Open access 16 January 2024

Only overlay text: novel features for TV news broadcast video segmentation

Segmentation of television news videos into programs and stories (after removing advertisements) is a necessary first step for news broadcast...

Raghvendra Kannao, Prithwijit Guha, Bidyut B. Chaudhuri in Multimedia Tools and Applications

Article 06 April 2022

RoICLIP: Text-Enhanced UAV-Based Video Object Detection

In recent years, Unmanned Aerial Vehicles (UAV)-based video object detection algorithms have attracted a lot of attention due to their widespread...

Peiyi Zhang, Yali Li, Sheng** Wang in Image and Graphics

Conference paper 2023

A video compression-cum-classification network for classification from compressed video streams

Video analytics can achieve increased speed and efficiency by operating directly on the compressed video format, thereby alleviating the decoding...

Sangeeta Yadav, Preeti Gulia, ... Prashant Kumar Shukla in The Visual Computer

Article 08 March 2024

ICDAR 2023 Competition on Born Digital Video Text Question Answering

This paper presents the final results of the ICDAR 2023 Competition on Born Digital Video Text Question Answering (i.e., BDVT-QA) which contains two...

Zhibo Yang, **aoge Song, ... Cong Yao in Document Analysis and Recognition - ICDAR 2023

Conference paper 2023

Security to text (S2T): multi-layered based security approaches for secret text content

In the digital world, text data is produced in an unstructured manner across various communication channels. Extracting valuable information from...

Shamal Kashid, Lalit K. Awasthi, Krishan Berwal in Multimedia Tools and Applications

Article 19 June 2024

Improved Vehicle Detection Accuracy and Processing Time for Video Based ITS Applications

The increase in daily traffic volume needs a more effective, intelligent, and sophisticated traffic management and control strategy. Video-based...

Manipriya Sankaranarayanan, C. Mala, Samson Mathew in SN Computer Science

Article 26 April 2022

Automatic football video production system with edge processing

Automatic video production of sports aims at producing an aesthetic broadcast of sporting events. This is an enabler of low-cost solutions for...

Henry Carrillo, Julian Quiroga, ... Edisson Maldonado in Machine Vision and Applications

Article 21 February 2022

Search

Filters

Search Results

Search

Navigation