We are improving our search experience. To check which content you have full access to, or for advanced search, go back to the old search.

Search

Please fill in this field.
Filters applied:

Search Results

Showing 1-20 of 10,000 results
  1. Multi-scale hash encoding based neural geometry representation

    Recently, neural implicit function-based representation has attracted more and more attention, and has been widely used to represent surfaces using...

    Zhi Deng, Haoyao **ao, ... Juyong Zhang in Computational Visual Media
    Article Open access 22 March 2024
  2. Learning multi-level and multi-scale deep representations for privacy image classification

    Privacy image classification can help people detect privacy images when people share images. In this paper, we propose a novel method using...

    Yahui Han, Yonggang Huang, ... Yunbo Zheng in Multimedia Tools and Applications
    Article 23 October 2021
  3. Relational multi-scale metric learning for few-shot knowledge graph completion

    Few-shot knowledge graph completion (FKGC) refers to the task of inferring missing facts in a knowledge graph by utilizing a limited number of...

    Yu Song, Mingyu Gui, ... Dezhi Kong in Knowledge and Information Systems
    Article 08 May 2024
  4. Retinal artery/vein classification by multi-channel multi-scale fusion network

    The automatic artery/vein (A/V) classification in retinal fundus images plays a significant role in detecting vascular abnormalities and could speed...

    Junyan Yi, Chouyu Chen, Gang Yang in Applied Intelligence
    Article 23 August 2023
  5. MLANet: multi-level attention network with multi-scale feature fusion for crowd counting

    Estimating the population in a given scene is a process known as crowd counting. The field has recently garnered significant attention, and many...

    Liyan **ong, Yijuan Zeng, ... Peng Huang in Cluster Computing
    Article 04 March 2024
  6. TAE: Topic-aware encoder for large-scale multi-label text classification

    Convolutional neural networks, recurrent neural networks, and transformers have excelled in representation learning for large-scale multi-label text...

    Shaowei Qin, Hao Wu, ... Lei Zhang in Applied Intelligence
    Article 01 April 2024
  7. Person re-identification based on multi-scale feature fusion and multi-attention mechanism

    Person re-identification is an image retrieval technique for person in real scenes. Due to factors such as camera angle, lighting, and occlusion,...

    Jiacheng Pu, Wei Zou in Signal, Image and Video Processing
    Article 04 September 2023
  8. Multi-view Self-supervised Learning and Multi-scale Feature Fusion for Automatic Speech Recognition

    To address the challenges of the poor representation capability and low data utilization rate of end-to-end speech recognition models in deep...

    **gyu Zhao, Ruwei Li, ... Weidong An in Neural Processing Letters
    Article Open access 08 May 2024
  9. Large-scale Multi-modal Pre-trained Models: A Comprehensive Survey

    With the urgent demand for generalized deep models, many pre-trained big models are proposed, such as bidirectional encoder representations (BERT),...

    **ao Wang, Guangyao Chen, ... Wen Gao in Machine Intelligence Research
    Article Open access 06 June 2023
  10. Accurate Facial Landmark Detector via Multi-scale Transformer

    Facial landmark detection is an essential prerequisite for many face applications, which has attracted much attention and made remarkable progress in...
    Yuyang Sha, Weiyu Meng, ... Kefeng Li in Pattern Recognition and Computer Vision
    Conference paper 2024
  11. PointCMC: cross-modal multi-scale correspondences learning for point cloud understanding

    Existing cross-modal frameworks have achieved impressive performance in point cloud object representations learning, where a 2D image encoder is...

    Honggu Zhou, **aogang Peng, ... Zizhao Wu in Multimedia Systems
    Article 30 April 2024
  12. Lightweight multi-scale network with attention for accurate and efficient crowd counting

    Crowd counting is a significant task in computer vision, which aims to estimate the total number of people appeared in images or videos. However, it...

    Mengyuan **, Hua Yan in The Visual Computer
    Article 25 September 2023
  13. MPA-GNet: multi-scale parallel adaptive graph network for 3D human pose estimation

    Graph convolutional networks (GCNs) have achieved remarkable performance in the 2D-to-3D human pose estimation (HPE) task. The adjacency matrix in...

    Ru Jia, Honghong Yang, ... Yumei Zhang in The Visual Computer
    Article 10 November 2023
  14. 3D Human pose estimation from video via multi-scale multi-level spatial temporal features

    In this paper, we present an innovative framework for 2D-to-3D human pose estimation from video, harnessing the power of multi-scale multi-level...

    Liling Fan, Kunliang Jiang, ... Yanmin Luo in Multimedia Tools and Applications
    Article 22 January 2024
  15. MSGNN: Multi-scale Spatio-temporal Graph Neural Network for epidemic forecasting

    Infectious disease forecasting has been a key focus and proved to be crucial in controlling epidemic. A recent trend is to develop forecasting models...

    Mingjie Qiu, Zhiyi Tan, Bing-Kun Bao in Data Mining and Knowledge Discovery
    Article 21 May 2024
  16. MMFL-net: multi-scale and multi-granularity feature learning for cross-domain fashion retrieval

    Instance-level image retrieval in fashion industry is a challenging issue owing to its increasing importance in real-scenario visual fashion search....

    Chen Bao, Xudong Zhang, ... Yongwei Miao in Multimedia Tools and Applications
    Article 22 August 2022
  17. Crowd Counting based on Multi-level Multi-scale Feature

    Crowd counting has drawn more and more attention for its significance in reality application. However, it’s still a challenging task because of scale...

    Di Wu, Zheyi Fan, Shuhan Yi in Applied Intelligence
    Article 15 June 2023
  18. 2MGAS-Net: multi-level multi-scale gated attentional squeezed network for polyp segmentation

    Accurate segmentation of colon polyps in endoscopic images is crucial for early colorectal cancer diagnosis and treatment planning. However,...

    Ibtissam Bakkouri, Siham Bakkouri in Signal, Image and Video Processing
    Article 10 May 2024
  19. GaitASMS: gait recognition by adaptive structured spatial representation and multi-scale temporal aggregation

    Gait recognition is one of the most promising video-based biometric technologies. The edge of silhouettes and motion are the most informative feature...

    Yan Sun, Hu Long, ... Mark Nixon in Neural Computing and Applications
    Article 17 February 2024
  20. MS-RAFT+: High Resolution Multi-Scale RAFT

    Hierarchical concepts have proven useful in many classical and learning-based optical flow methods regarding both accuracy and robustness. In this...

    Azin Jahedi, Maximilian Luz, ... Andrés Bruhn in International Journal of Computer Vision
    Article Open access 18 December 2023
Did you find what you were looking for? Share feedback.