Skip to main content

and
  1. No Access

    Chapter and Conference Paper

    An Investigation of CNN-CARU for Image Captioning

    The goal of an image description is to extract essential information and a description of the content of a media feature from an image. This description can be obtained directly from a human-understandable des...

    Sio-Kei Im in 4th International Conference on Electronics and Signal Processing (2024)

  2. No Access

    Article

    SMigraPH: a perceptually retained method for passive haptics-based migration of MR indoor scenes

    To enhance users’ immersion in the mixed reality (MR) cross-scene environment, it is imperative to make geometric modifications to arbitrary multi-scale virtual scenes, including adjustments to layout and size...

    Qixiang Ma, Lili Wang, Wei Ke, Sio-Kei Im in The Visual Computer (2023)

  3. No Access

    Article

    Real-scene-constrained virtual scene layout synthesis for mixed reality

    Given a real source scene and a virtual target scene, the real-scene-constrained virtual scene layout synthesis problem is defined as how to re-synthesize the layout of the virtual furniture in the virtual sce...

    Runze Fan, Lili Wang, **nda Liu, Sio Kei Im, Chan Tong Lam in The Visual Computer (2023)

  4. Article

    Open Access

    Speech emotion recognition based on Graph-LSTM neural network

    Currently, Graph Neural Networks have been extended to the field of speech signal processing. It is the more compact and flexible way to represent speech sequences by graphs. However, the structures of the rel...

    Yan Li, Yapeng Wang, Xu Yang, Sio-Kei Im in EURASIP Journal on Audio, Speech, and Musi… (2023)

  5. No Access

    Chapter

    Automatic Speech Recognition for Portuguese with Small Data Set

    Voice recognition has become more and more popular in various systems and applications. To further promote Macau tourism worldwide, a mobile Macau tourism APP is being develo** that supports voice control to...

    Yapeng Wang, Ruize Jia, Chan Tong Lam in Computer and Information Science 2021 - Fa… (2022)

  6. No Access

    Article

    Multiple classifier for concatenate-designed neural network

    This article introduces a multiple classifier method to improve the performance of concatenate-designed neural networks, such as ResNet and DenseNet, with the purpose of alleviating the pressure on the final c...

    Ka-Hou Chan, Sio-Kei Im, Wei Ke in Neural Computing and Applications (2022)

  7. No Access

    Chapter and Conference Paper

    Robust Pedestrian Detection: Faster Deployments with Fusion of Models

    Pedestrian detection has a wide range of real-world critical applications including security and management of emergency scenarios. In critical applications, detection recall and precision are both essential ...

    Chan Tong Lam, Jose Gaspar, Wei Ke, Xu Yang, Sio Kei Im in Pattern Recognition (2020)

  8. No Access

    Chapter and Conference Paper

    CARU: A Content-Adaptive Recurrent Unit for the Transition of Hidden State in NLP

    This article introduces a novel RNN unit inspired by GRU, namely the Content-Adaptive Recurrent Unit (CARU). The design of CARU contains all the features of GRU but requires fewer training parameters. We make ...

    Ka-Hou Chan, Wei Ke, Sio-Kei Im in Neural Information Processing (2020)

  9. No Access

    Chapter and Conference Paper

    Variable-Depth Convolutional Neural Network for Text Classification

    This article introduces a recurrent CNN based framework for the classification of arbitrary length text in natural sentence. In our model, we present a complete CNN design with recurrent structure to capture t...

    Ka-Hou Chan, Sio-Kei Im, Wei Ke in Neural Information Processing (2020)

  10. Chapter and Conference Paper

    Fast Grid-Based Fluid Dynamics Simulation with Conservation of Momentum and Kinetic Energy on GPU

    Since the computation of fluid animation is often too heavy to run in real-time simulation, we propose a fast grid-based method with parallel acceleration. In order to reduce the cost of computation kee** a ...

    Ka-Hou Chan, Sio-Kei Im in Image and Graphics (2017)

  11. No Access

    Article

    Improved rate-distortion optimized video coding using non-integer bit estimation and multiple Lambda search

    Many modern video encoders use the Lagrangian rate-distortion optimization (RDO) algorithm for mode decisions during the compression procedure. For each encoding stage, this approach involves minimizing a cost...

    Sio Kei Im, Mohammad Mahdi Ghandi in Frontiers of Computer Science (2016)