We are improving our search experience. To check which content you have full access to, or for advanced search, go back to the old search.

Search

Please fill in this field.
Filters applied:

Search Results

Showing 1-20 of 10,000 results
  1. MGSGA: Multi-grained and Semantic-Guided Alignment for Text-Video Retrieval

    In the text-video retrieval task, the objective is to calculate the similarity between a text and a video, and rank the relevant candidates higher....

    **aoyu Wu, Jiayao Qian, Lulu Yang in Neural Processing Letters
    Article Open access 17 February 2024
  2. SPSD: Similarity-preserving self-distillation for video–text retrieval

    Most of existing methods solve cross-modal video and text retrieval via coarse-grained similarity computation based on global representations or...

    Jiachen Wang, Yan Hua, ... Hongwei Kou in International Journal of Multimedia Information Retrieval
    Article 01 September 2023
  3. Deep learning for video-text retrieval: a review

    Video-Text Retrieval (VTR) aims to search for the most relevant video related to the semantics in a given sentence, and vice versa. In general, this...

    Article 23 February 2023
  4. Video–text retrieval via multi-modal masked transformer and adaptive attribute-aware graph convolutional network

    Despite significant advancements in deep learning-based video–text retrieval methods, three challenges persist: the alignment of fine-grained...

    Gang Lv, Yining Sun, Fudong Nian in Multimedia Systems
    Article 22 January 2024
  5. VTM-GAN: video-text matcher based generative adversarial network for generating videos from textual description

    Text-to-video synthesis has garnered significant attention as a challenging task in the domain of vision computing. With the advent of unsupervised...

    Rayeesa Mehmood, Rumaan Bashir, Kaiser J. Giri in International Journal of Information Technology
    Article 16 September 2023
  6. Text presentation or video: Malaysian university students' preferences with synchronous and asynchronous learning

    In overcoming the obstacles of online learning with the current Covid-19 pandemic crisis, synchronous and asynchronous learning has been a...

    Ali Sorayyaei Azar, Nur Haslinda Iskandar Tan in Education and Information Technologies
    Article 04 May 2023
  7. V2T: video to text framework using a novel automatic shot boundary detection algorithm

    The generation of natural language descriptions for a video has been reported by many researchers till now. But, it is still the most interesting...

    Alok Singh, Thoudam Doren Singh, Sivaji Bandyopadhyay in Multimedia Tools and Applications
    Article 08 March 2022
  8. A comprehensive review of the video-to-text problem

    Research in the Vision and Language area encompasses challenging topics that seek to connect visual and textual information. When the visual...

    Jesus Perez-Martin, Benjamin Bustos, ... Grethel Coello Said in Artificial Intelligence Review
    Article 16 January 2022
  9. Learning from text and video blogs: comprehension effects on secondary school students

    Informational video blogs are a popular method of communication among students that may be fruitful educational tools, but their potential benefits...

    P. Delgado, Ø. Anmarkrud, ... L. Salmerón in Education and Information Technologies
    Article Open access 30 November 2021
  10. Video captioning with global and local text attention

    The task of video captioning is to generate a video description corresponding to the video content, so there are stringent requirements for the...

    Yuqing Peng, Chenxi Wang, ... Yingjun Li in The Visual Computer
    Article 05 September 2021
  11. An Algorithm for Detecting Precipitation in Computer Processing of Video Images

    Abstract

    The importance of detecting and reducing the visibility of precipitation in video images obtained by fixed cameras is shown. A statistical...

    V. T. Dmitriev, A. A. Baukov in Programming and Computer Software
    Article 26 May 2023
  12. Multi-grained encoding and joint embedding space fusion for video and text cross-modal retrieval

    Video-text cross-modal retrieval is significant to computer vision. Most of existing works focus on exploring the global similarity between...

    **aotao Cui, **g **ao, ... Jia Zhu in Multimedia Tools and Applications
    Article 30 May 2022
  13. Bilingual video captioning model for enhanced video retrieval

    Many video platforms rely on the descriptions that uploaders provide for video retrieval. However, this reliance may cause inaccuracies. Although...

    Norah Alrebdi, Amal A. Al-Shargabi in Journal of Big Data
    Article Open access 16 January 2024
  14. Only overlay text: novel features for TV news broadcast video segmentation

    Segmentation of television news videos into programs and stories (after removing advertisements) is a necessary first step for news broadcast...

    Raghvendra Kannao, Prithwijit Guha, Bidyut B. Chaudhuri in Multimedia Tools and Applications
    Article 06 April 2022
  15. RoICLIP: Text-Enhanced UAV-Based Video Object Detection

    In recent years, Unmanned Aerial Vehicles (UAV)-based video object detection algorithms have attracted a lot of attention due to their widespread...
    Peiyi Zhang, Yali Li, Sheng** Wang in Image and Graphics
    Conference paper 2023
  16. A video compression-cum-classification network for classification from compressed video streams

    Video analytics can achieve increased speed and efficiency by operating directly on the compressed video format, thereby alleviating the decoding...

    Sangeeta Yadav, Preeti Gulia, ... Prashant Kumar Shukla in The Visual Computer
    Article 08 March 2024
  17. ICDAR 2023 Competition on Born Digital Video Text Question Answering

    This paper presents the final results of the ICDAR 2023 Competition on Born Digital Video Text Question Answering (i.e., BDVT-QA) which contains two...
    Zhibo Yang, **aoge Song, ... Cong Yao in Document Analysis and Recognition - ICDAR 2023
    Conference paper 2023
  18. Security to text (S2T): multi-layered based security approaches for secret text content

    In the digital world, text data is produced in an unstructured manner across various communication channels. Extracting valuable information from...

    Shamal Kashid, Lalit K. Awasthi, Krishan Berwal in Multimedia Tools and Applications
    Article 19 June 2024
  19. Improved Vehicle Detection Accuracy and Processing Time for Video Based ITS Applications

    The increase in daily traffic volume needs a more effective, intelligent, and sophisticated traffic management and control strategy. Video-based...

    Manipriya Sankaranarayanan, C. Mala, Samson Mathew in SN Computer Science
    Article 26 April 2022
  20. Automatic football video production system with edge processing

    Automatic video production of sports aims at producing an aesthetic broadcast of sporting events. This is an enabler of low-cost solutions for...

    Henry Carrillo, Julian Quiroga, ... Edisson Maldonado in Machine Vision and Applications
    Article 21 February 2022
Did you find what you were looking for? Share feedback.