Search
Search Results
-
Refined dense face alignment through image matching
Face alignment is the foundation of building 3D avatars for virtue communication in the metaverse, human-computer interaction, AI-generated content,...
-
All-day Image Alignment for PTZ Surveillance Based on Correlated Siamese Neural Network
Image alignment is a highly researched topic in computer vision, which aligns a pair of images due to image changes. Despite the numerous studies...
-
Alignment efficient image-sentence retrieval considering transferable cross-modal representation learning
Traditional image-sentence cross-modal retrieval methods usually aim to learn consistent representations of heterogeneous modalities, thereby to...
-
PSNet: position-shift alignment network for image caption
Recently, Transformer-based models have gained increasing popularity in the field of image captioning. The global attention mechanism of the...
-
Local feature semantic alignment network for few-shot image classification
The goal of few-shot learning is to use a small number of labeled samples to train a machine learning model and then classify the unlabeled samples....
-
Unbinding tensor product representations for image captioning with semantic alignment and complementation
Image captioning, which describes an image with natural language, is an important but challenging multi-modal task. Many state-of-the-art methods...
-
A transferability-aware covariance alignment network for image steganalysis
Image steganalysis seeks to detect whether the secret information is hidden in images. Recently, to alleviate the distribution discrepancy between...
-
Prototype local–global alignment network for image–text retrieval
Image–text retrieval is a challenging task due to the requirement of thorough multimodal understanding and precise inter-modality relationship...
-
Textline alignment on the image domain
Editing and publishing a historical manuscript involves a research phase to recover the original manuscript and reconstruct the transmission of its...
-
Locally controllable network based on visual–linguistic relation alignment for text-to-image generation
Since locally controllable text-to-image generation cannot achieve satisfactory results in detail, a novel locally controllable text-to-image...
-
Text-Vision Relationship Alignment for Referring Image Segmentation
Referring image segmentation aims to segment object in an image based on a referring expression. Its difficulty lies in aligning expression semantics...
-
Tensor factorization via transformed tensor-tensor product for image alignment
In this paper, we study the problem of a batch of linearly correlated image alignment, where the observed images are deformed by some unknown domain...
-
Learning semantic alignment from image for text-guided image inpainting
In this paper, we propose a method called LSAI (learning semantic alignment from image) to recover the corrupted image patches for text-guided image...
-
Cross-modal alignment with graph reasoning for image-text retrieval
Image-text retrieval task has received a lot of attention in the modern research field of artificial intelligence. It still remains challenging since...
-
Beyond homography: nonparametric image alignment via graph convolutional networks
We propose an image alignment algorithm based on weak supervision, which aims to identify the correspondence between a pair of reference and target...
-
A deep learning approach to satellite image time series coregistration through alignment of road networks
The adverse effects of thawing permafrost on transportation infrastructure in northern regions are exacerbated by climate change. To address this...
-
TOAC: Try-On Aligning Conformer for Image-Based Virtual Try-On Alignment
Recently, Image-based Virtual Try-on has garnered increasing attention within the realm of online apparel e-commerce, which aims to virtually... -
Recognize after early fusion: the Chinese food recognition based on the alignment of image and ingredients
As concerns about health continue to grow, more and more works are being done in the field of food computing. One of the basic topics in food...
-
SSM: Semantic Selection and Multi-view Alignment for Image-Text Retrieval
Image-text retrieval has been a crucial and fundamental task in multi-modal field. Benefiting from the superiority of Transformer encoder in modeling... -
Pipe Alignment with the Image Based Visual Servo Control
Tube alignment is an important task characterized by the complexity of processing and aligning large-diameter tube segments. Traditional...