Skip to main content

previous disabled Page of 3
and
  1. No Access

    Chapter and Conference Paper

    End-to-End Streaming Customizable Keyword Spotting Based on Text-Adaptive Neural Search

    Streaming keyword spotting (KWS) is an important technique for voice assistant wake-up. While KWS with a preset fixed keyword has been well studied, test-time customizable keyword spotting in streaming mode re...

    Baochen Yang, Jiaqi Guo, Haoyu Li, Yu **, Qing Zhuo in Man-Machine Speech Communication (2024)

  2. No Access

    Chapter and Conference Paper

    3RE-Net: Joint Loss-REcovery and Super-REsolution Neural Network for REal-Time Video

    Real-time video over the Internet suffers from packet loss and low network bandwidth. The receiving side may receive down-sampled video with damaged frames. In this work, we are motivated to enhance the qualit...

    Liming Ge, David Zhaochen Jiang, Wei Bao in AI 2023: Advances in Artificial Intelligence (2024)

  3. No Access

    Chapter and Conference Paper

    Preliminary Experiment for Measuring the Anxiety Level Using Heart Rate Variability

    Anxiety is one of the most significant health issues. Generally, there are four levels of anxiety: mild anxiety, moderate anxiety, severe anxiety, and panic level anxiety

    Haochen He, Chen Feng, Peeraya Sripian in Virtual, Augmented and Mixed Reality (2023)

  4. No Access

    Chapter and Conference Paper

    Semantic Enhancement Framework for Robust Speech Recognition

    Auto speech recognition (ASR) has been widely used in dialogue systems of various domains, performing as a crucial part of technology. Since the output of the ASR system will provide input to the subsequent sy...

    Baochen Yang, Kai Yu in Man-Machine Speech Communication (2023)

  5. No Access

    Chapter and Conference Paper

    VERTEX: VEhicle Reconstruction and TEXture Estimation from a Single Image Using Deep Implicit Semantic Template Map**

    We introduce VERTEX, an effective solution to recovering the 3D shape and texture of vehicles from uncalibrated monocular inputs under real-world street environments. To fully utilize the semantic prior of veh...

    **aochen Zhao, Zerong Zheng, Chaonan Ji, Zhenyi Liu, Siyou Lin in Artificial Intelligence (2022)

  6. No Access

    Chapter and Conference Paper

    A Deep Attention Transformer Network for Pain Estimation with Facial Expression Video

    Since pain often causes deformations in the facial structure, analysis of facial expressions has received considerable attention for automatic pain estimation in recent years. This study proposes a deep attent...

    Haochen Xu, Manhua Liu in Biometric Recognition (2021)

  7. No Access

    Chapter and Conference Paper

    Integrating Task Information into Few-Shot Classifier by Channel Attention

    It has been increasingly recognized that meta-learning-based approaches provide a promising way to handle challenges to few-shot learning. In this paper, we incorporate the channel attention in the main framew...

    Zhaochen Li, Kedian Mu in Knowledge Science, Engineering and Management (2021)

  8. No Access

    Chapter and Conference Paper

    Stacked Sparse Autoencoder for Audio Object Coding

    Compared with channel-based audio coding, the object-based audio coding has a definite advantage in meeting the user’s demands of personalized control. However, in the conventional Spatial Audio Object Coding ...

    Yulin Wu, Ruimin Hu, **aochen Wang, Chenhao Hu, Gang Li in MultiMedia Modeling (2021)

  9. No Access

    Chapter and Conference Paper

    A Metagraph-Based Model for Predicting Drug-Target Interaction on Heterogeneous Network

    Determining drug-target interactions (DTIs) is an important task in drug discovery and drug relocalization. Currently, different models have been proposed to predict the potential interactions between drugs an...

    Peng Ke, Yuqi Wen, Zhongnan Zhang, Song He in Artificial Neural Networks and Machine Lea… (2021)

  10. No Access

    Chapter and Conference Paper

    EMRM: Enhanced Multi-source Review-Based Model for Rating Prediction

    Rating prediction, whose goal is to predict user preference for unconsumed items, has become one of the core tasks in recommendation systems. Recently, many deep learning-based methods have been applied to the...

    **aochen Wang, Tingsong **ao, Jie Shao in Knowledge Science, Engineering and Management (2021)

  11. No Access

    Chapter and Conference Paper

    Multi-step Coding Structure of Spatial Audio Object Coding

    The spatial audio object coding (SAOC) is an effective meth-od which compresses multiple audio objects and provides flexibility for personalized rendering in interactive services. It divides each frame signal ...

    Chenhao Hu, Ruimin Hu, **aochen Wang, Tingzhao Wu, Dengshi Li in MultiMedia Modeling (2020)

  12. No Access

    Chapter and Conference Paper

    Imputation of Incomplete Data Based on Attribute Cross Fitting Model and Iterative Missing Value Variables

    The problem of missing values is often encountered in tasks such as machine learning, and imputation of missing values has become an important research content in incomplete data analysis. In this paper, we p...

    **chong Zhu, Liyong Zhang, **aochen Lai in Advances in Neural Networks – ISNN 2020 (2020)

  13. No Access

    Chapter and Conference Paper

    Perceptual Localization of Virtual Sound Source Based on Loudspeaker Triplet

    When using a loudspeaker triplet for virtual sound localization, the traditional conversion method will result in inaccurate localization. In this paper, we constructed a perceptual localization distortion mod...

    Duanzheng Guan, Dengshi Li, Xuebei Cai, **aochen Wang, Ruimin Hu in MultiMedia Modeling (2020)

  14. No Access

    Chapter and Conference Paper

    HMM-Based Person Re-identification in Large-Scale Open Scenario

    This paper aims to tackle person re-identification (person re-ID) in large-scale open scenario, which differs from the conventional person re-ID tasks but is significant for some real suspect investigation ca...

    Dongyang Li, Ruimin Hu, Wenxin Huang, **aochen Wang, Dengshi Li in MultiMedia Modeling (2020)

  15. No Access

    Chapter and Conference Paper

    HRTF Representation with Convolutional Auto-encoder

    The head-related transfer function (HRTF) can be considered as some kind of filter that describes how a sound from an arbitrary spatial direction transfers to the listener’s eardrums. HRTF can be used to synth...

    Wei Chen, Ruimin Hu, **aochen Wang, Dengshi Li in MultiMedia Modeling (2020)

  16. No Access

    Chapter and Conference Paper

    A Novel Ensemble Approach for Click-Through Rate Prediction Based on Factorization Machines and Gradient Boosting Decision Trees

    Click-Through Rate (CTR) prediction is a significant technique in the field of computational advertising, its accuracy directly affects companies profits and user experience. Achieving great ability of general...

    **aochen Wang, Gang Hu, Haoyang Lin, Jiayu Sun in Web and Big Data (2019)

  17. No Access

    Chapter and Conference Paper

    Imputation Using a Correlation-Enhanced Auto-Associative Neural Network with Dynamic Processing of Missing Values

    The missing value is a common phenomenon in real-world datasets, which makes the analysis of incomplete data become an active research area. In this paper, a correlation-enhanced auto-associative neural networ...

    **aochen Lai, **a Wu, Liyong Zhang in Advances in Neural Networks – ISNN 2019 (2019)

  18. No Access

    Chapter and Conference Paper

    Spectral Tilt Estimation for Speech Intelligibility Enhancement Using RNN Based on All-Pole Model

    Speech intelligibility enhancement is extremely meaningful for successful speech communication in noisy environments. Several methods based on Lombard effect are used to increase intelligibility. In those meth...

    Rui Zhang, Ruimin Hu, Gang Li, **aochen Wang in MultiMedia Modeling (2019)

  19. No Access

    Chapter and Conference Paper

    Cervical Nuclei Segmentation in Whole Slide Histopathology Images Using Convolution Neural Network

    Pathologists generally diagnose whether or not cervical cancer cells have the potential to spread to other organs and assess the malignancy of cancer through whole slide histopathology images using virtual mic...

    Qiuju Yang, Kaijie Wu, Hao Cheng, Chaochen Gu, Yuan Liu in Soft Computing in Data Science (2019)

  20. No Access

    Chapter and Conference Paper

    Image Stitching Based on Discrete Wavelet Transform and Slope Fusion

    The fusion algorithm of traditional image stitching does not fully consider the differences of the clarity of the two images, and the conventional Discrete Wavelet Transform algorithm would blur the image whe...

    Daochen Weng, Qianying Zheng, Bingkun Yang in Multi-disciplinary Trends in Artificial In… (2019)

previous disabled Page of 3