Search
Search Results
-
YOLO-MTG: a lightweight YOLO model for multi-target garbage detection
With wide adoption of deep learning technology in AI, intelligent garbage detection has become a hot research topic. However, existing datasets...
-
Dog Face Recognition Using Vision Transformer
The demand for effective, efficient and safe methods for animal identification has been increasing significantly, due to the need for traceability,... -
Vision Transformers for Computer Go
Motivated by transformers’ success in diverse fields like language understanding and image analysis, our investigation explores their potential in... -
Vision transformer models for mobile/edge devices: a survey
With the rapidly growing demand for high-performance deep learning vision models on mobile and edge devices, this paper emphasizes the importance of...
-
CLIP for Lightweight Semantic Segmentation
The large-scale pretrained model CLIP, trained on 400 million image-text pairs, offers a promising paradigm for tackling vision tasks, albeit at the... -
HELViT: highly efficient lightweight vision transformer for remote sensing image scene classification
Remote sensing image scene classification methods based on convolutional neural networks (CNN) have been extremely successful. However, the...
-
Laplacian-Former: Overcoming the Limitations of Vision Transformers in Local Texture Detection
Vision Transformer (ViT) models have demonstrated a breakthrough in a wide range of computer vision tasks. However, compared to the Convolutional... -
Dynamic Circular Convolution for Image Classification
In recent years, Vision Transformer (ViT) has achieved an outstanding landmark in disentangling diverse information of visual inputs, superseding... -
Depthwise Convolution with Channel Mixer: Rethinking MLP in MetaFormer for Faster and More Accurate Vehicle Detection
Vehicle detection is an important task in intelligent traffic monitoring and autonomous driving. However, vehicle detection not only requires fast...