Skip to main content

and
  1. No Access

    Chapter and Conference Paper

    EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications

    In the pursuit of achieving ever-increasing accuracy, large and complex neural networks are usually developed. Such models demand high computational resources and therefore cannot be deployed on edge devices. ...

    Muhammad Maaz, Abdelrahman Shaker in Computer Vision – ECCV 2022 Workshops (2023)

  2. No Access

    Chapter and Conference Paper

    PS-ARM: An End-to-End Attention-Aware Relation Mixer Network for Person Search

    Person search is a challenging problem with various real-world applications, that aims at joint person detection and re-identification of a query person from uncropped gallery images. Although, previous study ...

    Mustansar Fiaz, Hisham Cholakkal, Sanath Narayan in Computer Vision – ACCV 2022 (2023)

  3. No Access

    Chapter and Conference Paper

    DoodleFormer: Creative Sketch Drawing with Transformers

    Creative sketching or doodling is an expressive activity, where imaginative and previously unseen depictions of everyday visual objects are drawn. Creative sketch image generation is a challenging vision probl...

    Ankan Kumar Bhunia, Salman Khan, Hisham Cholakkal in Computer Vision – ECCV 2022 (2022)

  4. No Access

    Chapter and Conference Paper

    Video Instance Segmentation via Multi-Scale Spatio-Temporal Split Attention Transformer

    State-of-the-art transformer-based video instance segmentation (VIS) approaches typically utilize either single-scale spatio-temporal features or per-frame multi-scale features during the attention computation...

    Omkar Thawakar, Sanath Narayan, Jiale Cao, Hisham Cholakkal in Computer Vision – ECCV 2022 (2022)

  5. No Access

    Chapter and Conference Paper

    Count- and Similarity-Aware R-CNN for Pedestrian Detection

    Recent pedestrian detection methods generally rely on additional supervision, such as visible bounding-box annotations, to handle heavy occlusions. We propose an approach that leverages pedestrian count and pr...

    ** **e, Hisham Cholakkal, Rao Muhammad Anwer in Computer Vision – ECCV 2020 (2020)

  6. No Access

    Article

    CNN-based gender classification in near-infrared periocular images

    Periocular region has emerged as a key biometric trait with potential applications in the forensics domain. In this paper, we explore two convolutional neural network (CNN)-based approaches for gender classifi...

    Anirudh Manyala, Hisham Cholakkal, Vijay Anand in Pattern Analysis and Applications (2019)