Skip to main content

and
  1. No Access

    Chapter and Conference Paper

    Static Semantics Reconstruction for Enhancing JavaScript-WebAssembly Multilingual Malware Detection

    The emergence of WebAssembly allows attackers to hide the malicious functionalities of JavaScript malware in cross-language interoperations, termed JavaScript-WebAssembly multilingual malware (JWMM). However, ...

    Yifan **a, ** He, Xuhong Zhang, Peiyu Liu in Computer Security – ESORICS 2023 (2024)

  2. No Access

    Chapter and Conference Paper

    Applying Rely-Guarantee Reasoning on Concurrent Memory Management and Mailbox in  \(\mu \) C/OS-II: A Case Study

    Real-time operating systems (RTOSs) such as \(\mu \) μ C/OS-II are critical components of ma...

    Huan Sun, Ziyu Mao, **gyi Wang, Ziyan Zhao in Formal Methods for Industrial Critical Sys… (2023)

  3. No Access

    Chapter and Conference Paper

    VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition

    Recently, computer vision foundation models such as CLIP and ALI-GN, have shown impressive generalization capabilities on various downstream tasks. But their abilities to deal with the long-tailed data still r...

    Changyao Tian, Wenhai Wang, **zhou Zhu, Jifeng Dai, Yu Qiao in Computer Vision – ECCV 2022 (2022)

  4. No Access

    Chapter and Conference Paper

    BEVFormer: Learning Bird’s-Eye-View Representation from Multi-camera Images via Spatiotemporal Transformers

    3D visual perception tasks, including 3D detection and map segmentation based on multi-camera images, are essential for autonomous driving systems. In this work, we present a new framework termed BEVFormer, wh...

    Zhiqi Li, Wenhai Wang, Hongyang Li, Enze **e, Chonghao Sima in Computer Vision – ECCV 2022 (2022)

  5. No Access

    Chapter and Conference Paper

    Guided Refine-Head for Object Detection

    In recent years, multi-stage detectors improve the accuracy of object detection to a new level. However, due to multiple stages, these methods typically fall short in the inference speed. To alleviate this pro...

    Lingyun Zeng, You Song, Wenhai Wang in MultiMedia Modeling (2020)

  6. No Access

    Chapter and Conference Paper

    TK-Text: Multi-shaped Scene Text Detection via Instance Segmentation

    Benefit from the development of deep neural networks, scene text detectors have progressed rapidly over the past few years and achieved outstanding performance on several standard benchmarks. However, most exi...

    **aoge Song, Yirui Wu, Wenhai Wang, Tong Lu in MultiMedia Modeling (2020)

  7. No Access

    Chapter and Conference Paper

    Differentiable Hierarchical Graph Grou** for Multi-person Pose Estimation

    Multi-person pose estimation is challenging because it localizes body keypoints for multiple persons simultaneously. Previous methods can be divided into two streams, i.e. top-down and bottom-up methods. The top-...

    Sheng **, Wentao Liu, Enze **e, Wenhai Wang, Chen Qian in Computer Vision – ECCV 2020 (2020)

  8. No Access

    Chapter and Conference Paper

    Scene Text Image Super-Resolution in the Wild

    Low-resolution text images are often seen in natural scenes such as documents captured by mobile phones. Recognizing low-resolution text images is challenging because they lose detailed content information, le...

    Wenjia Wang, Enze **e, Xuebo Liu, Wenhai Wang, Ding Liang in Computer Vision – ECCV 2020 (2020)

  9. No Access

    Chapter and Conference Paper

    AE TextSpotter: Learning Visual and Linguistic Representation for Ambiguous Text Spotting

    Scene text spotting aims to detect and recognize the entire word or sentence with multiple characters in natural images. It is still challenging because ambiguity often occurs when the spacing between characte...

    Wenhai Wang, Xuebo Liu, **aozhong Ji, Enze **e, Ding Liang in Computer Vision – ECCV 2020 (2020)

  10. No Access

    Chapter and Conference Paper

    Segmenting Transparent Objects in the Wild

    Transparent objects such as windows and bottles made by glass widely exist in the real world. Segmenting transparent objects is challenging because these objects have diverse appearance inherited from the imag...

    Enze **e, Wenjia Wang, Wenhai Wang, Mingyu Ding in Computer Vision – ECCV 2020 (2020)

  11. No Access

    Chapter and Conference Paper

    A Novel 3D Human Action Recognition Framework for Video Content Analysis

    Understanding the meanings of human actions from 3D skeleton data embedded videos is a new challenge in content-oriented video analysis. In this paper, we propose to incorporate temporal patterns of joint posi...

    Lianglei Wei, Yirui Wu, Wenhai Wang, Tong Lu in MultiMedia Modeling (2018)

  12. No Access

    Chapter and Conference Paper

    Cloud of Line Distribution for Arbitrary Text Detection in Scene/Video/License Plate Images

    Detecting arbitrary oriented text in scene and license plate images is challenging due to multiple adverse factors caused by images of diversified applications. This paper proposes a novel idea of extracting C...

    Wenhai Wang, Yirui Wu in Advances in Multimedia Information Process… (2018)

  13. No Access

    Chapter and Conference Paper

    Hand Pose Estimation with Attention-and-Sequence Network

    Hand pose estimation from depth images is an essential topic in computer vision. Despite the recent advancements in this area promoted by Convolutional Neural Network, accurate hand pose estimation is still a ...

    Tian** Hu, Wenhai Wang, Tong Lu in Advances in Multimedia Information Process… (2018)

  14. No Access

    Chapter and Conference Paper

    Cloud of Line Distribution and Random Forest Based Text Detection from Natural/Video Scene Images

    Text detection in natural and video scene images is still considered to be challenging due to unpredictable nature of scene texts. This paper presents a new method based on Cloud of Line Distribution (COLD) an...

    Wenhai Wang, Yirui Wu, Palaiahnakote Shivakumara, Tong Lu in MultiMedia Modeling (2018)

  15. No Access

    Chapter and Conference Paper

    Visual Robotic Object Gras** Through Combining RGB-D Data and 3D Meshes

    In this paper, we present a novel framework to drive automatic robotic grasp by matching camera captured RGB-D data with 3D meshes, on which prior knowledge for grasp is pre-defined for each object type. The p...

    Yiyang Zhou, Wenhai Wang, Wenjie Guan, Yirui Wu, Heng Lai, Tong Lu in MultiMedia Modeling (2017)