Skip to main content

previous disabled Page of 2
and
  1. No Access

    Chapter and Conference Paper

    AIGCIQA2023: A Large-Scale Image Quality Assessment Database for AI Generated Images: From the Perspectives of Quality, Authenticity and Correspondence

    Recent years have witnessed a rapid growth of Artificial Intelligence Generated Content (AIGC), among which with the development of text-to-image techniques, AI-based image generation has been applied to vario...

    Jiarui Wang, Huiyu Duan, **g Liu, Shi Chen, **ongkuo Min in Artificial Intelligence (2024)

  2. No Access

    Chapter and Conference Paper

    Perceptual Quality Assessment of Omnidirectional Audio-Visual Signals

    Omnidirectional videos (ODVs) play an increasingly important role in the application fields of medical, education, advertising, tourism, etc. Assessing the quality of ODVs is significant for service-providers to ...

    **lei Zhu, Huiyu Duan, Yuqin Cao, Yuxin Zhu, Yucheng Zhu in Artificial Intelligence (2024)

  3. No Access

    Article

    Boosting power line inspection in bad weather: Removing weather noise with channel-spatial attention-based UNet

    Power line inspection based on UAVs can effectively improve the inspection efficiency. With the development of object detection algorithms, automatic detection and recognition for power line components based o...

    Yaocheng Li, Qinglin Qian, Huiyu Duan, **ongkuo Min in Multimedia Tools and Applications (2023)

  4. No Access

    Article

    Audio-visual aligned saliency model for omnidirectional video with implicit neural representation learning

    Since the audio information is fully explored and leveraged in omnidirectional videos (ODVs), the performance of existing audio-visual saliency models has been improving dramatically and significantly. However...

    Dandan Zhu, Xuan Shao, Kaiwei Zhang, **ongkuo Min, Guangtao Zhai in Applied Intelligence (2023)

  5. No Access

    Chapter and Conference Paper

    MSPP-IQA: Adaptive Blind Image Quality Assessment Based on Multi-level Spatial Pyramid Pooling

    The main reason why image quality assessment (IQA) for real distortion has not been well solved is that, first, after the training of CNN is completed, the parameters of the convolution kernel are fixed, but t...

    Fangfang Lu, Yingjie Lian, Feng Qin, Guangtao Zhai in Digital Multimedia Communications (2023)

  6. No Access

    Chapter and Conference Paper

    Audio-Visual Saliency for Omnidirectional Videos

    Visual saliency prediction for omnidirectional videos (ODVs) has shown great significance and necessity for omnidirectional videos to help ODV coding, ODV transmission, ODV rendering, etc.. However, most studies ...

    Yuxin Zhu, **lei Zhu, Huiyu Duan, Jie Li, Kaiwei Zhang, Yucheng Zhu in Image and Graphics (2023)

  7. No Access

    Chapter and Conference Paper

    Perceptual Quality Assessment of TTS-Synthesized Speech

    The evaluation of a Text-to-Speech (TTS) system is typically labor-intensive and highly biased because there is no golden standard of the generated speech or objective evaluation metrics. To improve the perfor...

    Zidong Chen, **ongkuo Min in Digital Multimedia Communications (2023)

  8. No Access

    Chapter and Conference Paper

    A Lightweight Segmentation Network Based on Weak Supervision for COVID-19 Detection

    The Coronavirus Disease 2019 (COVID-19) outbreak in late 2019 threatens global health security. Computed tomography (CT) can provide richer information for the diagnosis and treatment of COVID-19. Unfortunatel...

    Fangfang Lu, Tianxiang Liu, Chi Tang, Zhihao Zhang in Digital Multimedia Communications (2023)

  9. No Access

    Chapter and Conference Paper

    A CNN-Based Quality Assessment Method for Pseudo 4K Contents

    Recently, there has been a growing interest in Ultra High-Definition (UHD) content, which brings a better visual experience for end-users. However, quite a few contents with 4K resolution are upscaled from Hig...

    Wei Lu, Wei Sun, Wenhan Zhu, **ongkuo Min in Digital TV and Wireless Multimedia Communi… (2022)

  10. No Access

    Chapter and Conference Paper

    Blind Surveillance Image Quality Assessment via Deep Neural Network Combined with the Visual Saliency

    The intelligent video surveillance system (IVSS) can automatically analyze the content of the surveillance image (SI) and reduce the burden of the manual labour. However, the SIs may suffer quality degradation...

    Wei Lu, Wei Sun, **ongkuo Min, Zicheng Zhang, Tao Wang in Artificial Intelligence (2022)

  11. No Access

    Chapter and Conference Paper

    Where are the Children with Autism Looking in Reality?

    Social difficulties are hallmarks of individuals with autism spectrum disorder (ASD), of which atypical visual attention is one of the most important characteristics. Learning and modeling the atypical visual ...

    **aoyu Ren, Huiyu Duan, **ongkuo Min, Yucheng Zhu, Wei Shen in Artificial Intelligence (2022)

  12. No Access

    Chapter and Conference Paper

    SMESwin Unet: Merging CNN and Transformer for Medical Image Segmentation

    Vision transformer is the new favorite paradigm in medical image segmentation since last year, which surpassed the traditional CNN counterparts in quantitative metrics. The significant advantage of ViTs is to ...

    Ziheng Wang, **ongkuo Min, Fangyu Shi in Medical Image Computing and Computer Assis… (2022)

  13. No Access

    Chapter and Conference Paper

    Iwin: Human-Object Interaction Detection via Transformer with Irregular Windows

    This paper presents a new vision Transformer, named Iwin Transformer, which is specifically designed for human-object interaction (HOI) detection, a detailed scene understanding task involving a sequential pro...

    Danyang Tu, **ongkuo Min, Huiyu Duan, Guodong Guo in Computer Vision – ECCV 2022 (2022)

  14. No Access

    Article

    Calculation of ophthalmic diagnostic parameters on a single eye image based on deep neural network

    It is necessary to manually measure many parameters of eyes when an ophthalmologist diagnoses, which is time consuming, unsanitary, subjective and unrepeatable. Those manually achieved parameters often risk cl...

    **angyang Zhu, Xuefei Song, **ongkuo Min, Huifang Zhou in Multimedia Tools and Applications (2022)

  15. No Access

    Article

    Fine localization and distortion resistant detection of multi-class barcode in complex environments

    Barcode, including one-dimensional (1D) barcode and two-dimensional (2D) barcode, can be seen almost anywhere in our lives. In many barcode-based mobile systems, different barcodes will appear simultaneously w...

    Jiahe Zhang, **ongkuo Min, Jun Jia, Zehao Zhu in Multimedia Tools and Applications (2021)

  16. No Access

    Chapter and Conference Paper

    Comparative Sharpness Evaluation for Mobile Phone Photos

    Mobile phones are the main source of a vast majority of digital photos nowadays. Photos taken by current mobile phones generally have fairly good visual quality without noticeable distortion. This progress ben...

    Qiang Lu, Guangtao Zhai, Yucheng Zhu, **ongkuo Min, Tao Wang in Artificial Intelligence (2021)

  17. No Access

    Article

    Perceptual image quality assessment: a survey

    Perceptual quality assessment plays a vital role in the visual communication systems owing to the existence of quality degradations introduced in various stages of visual signal acquisition, compression, trans...

    Guangtao Zhai, **ongkuo Min in Science China Information Sciences (2020)

  18. No Access

    Chapter and Conference Paper

    A Reading Assistant System for Blind People Based on Hand Gesture Recognition

    A reading assistant system for blind people based on hand gesture recognition is proposed in this paper. This system consists of seven modules: camera input module, page adjustment module, page information ret...

    Qiang Lu, Guangtao Zhai, **ongkuo Min in Digital TV and Wireless Multimedia Communi… (2020)

  19. Article

    Open Access

    Assessment of eye fatigue caused by head-mounted displays using eye-tracking

    Head-mounted displays (HMDs) and virtual reality (VR) have been frequently used in recent years, and a user’s experience and computation efficiency could be assessed by mounting eye-trackers. However, in addit...

    Yan Wang, Guangtao Zhai, Sichao Chen, **ongkuo Min in BioMedical Engineering OnLine (2019)

  20. No Access

    Chapter and Conference Paper

    Terahertz Security Image Quality Assessment by No-reference Model Observers

    To provide the possibility of develo** objective image quality assessment (IQA) algorithms for THz security images, we constructed the THz security image database (THSID) including a total of 181 THz securit...

    Menghan Hu, **ongkuo Min, Wenhan Zhu in Digital TV and Wireless Multimedia Communi… (2018)

previous disabled Page of 2