Search Results - Springer

Sort By Newest First Oldest First

Article

The image and ground truth dataset of Mongolian movable-type newspapers for text recognition

OCR approaches have been widely advanced in recent years thanks to the resurgence of deep learning. However, to the best of our knowledge, there is little work on Mongolian movable-type document recognition. O...

Min Lu, Feilong Bao, Hui Zhang, Guanglai Gao in International Journal on Document Analysis… (2024)
Chapter and Conference Paper

MnTTS2: An Open-Source Multi-speaker Mongolian Text-to-Speech Synthesis Dataset

Text-to-Speech (TTS) synthesis for low-resource languages is an attractive research issue in academia and industry nowadays. Mongolian is the official language of the Inner Mongolia Autonomous Region and a rep...

Kailin Liang, Bin Liu, Yifan Hu, Rui Liu, Feilong Bao… in Man-Machine Speech Communication (2023)
Chapter and Conference Paper

A Deep Investigation of RNN and Self-attention for the Cyrillic-Traditional Mongolian Bidirectional Conversion

Cyrillic and Traditional Mongolian are the two main members of the Mongolian writing system. The Cyrillic-Traditional Mongolian Bidirectional Conversion (CTMBC) task includes two conversion processes, includin...

Muhan Na, Rui Liu, Feilong Bao, Guanglai Gao in Neural Information Processing (2023)
Chapter and Conference Paper

Power Efficient Video Super-Resolution on Mobile NPUs with Deep Learning, Mobile AI & AIM 2022 Challenge: Report

Video super-resolution is one of the most popular tasks on mobile devices, being widely used for an automatic improvement of low-bitrate and low-resolution video streams. While numerous solutions have been pro...

Andrey Ignatov, Radu Timofte, Cheng-Ming Chiang… in Computer Vision – ECCV 2022 Workshops (2023)
Chapter and Conference Paper

End-to-End Large-Scale Image Retrieval Network with Convolution and Vision Transformers

There has been significant progress in content-based image retrieval with the development of convolutional neural networks and visual transformers. However, there are semantic gaps between high-level semantic ...

Qing Zhang, Feilong Bao, **angdong Su… in Artificial Neural Networks and Machine Lea… (2022)
Chapter and Conference Paper

Panoptic-DLA: Document Layout Analysis of Historical Newspapers Based on Proposal-Free Panoptic Segmentation Model

In this paper, we introduce a novel historical newspaper layout analysis model named Panoptic-DLA. Different from the previous works regarding layout analysis as a separate object detection or semantic segment...

Min Lu, Feilong Bao, Guanglai Gao in Knowledge Science, Engineering and Management (2021)
Article

Impulse Noise Detector Performance Measure Based on Intensity Volume

An accurate detector performance evaluation method provides a fair comparison platform and can also support in parameter optimization for existing Impulse noise detectors in the applications of medical imaging...

Long Bao, Karen Panetta, Sos Agaian in Journal of Signal Processing Systems (2020)
Chapter and Conference Paper

Infrared Small Target Recognition with Improved Particle Filtering Based on Feature Fusion

Aiming at the problem of tracking weak targets in different scenarios, an improved particle tracking method is proposed. This paper firstly uses background prediction and extracts the gray and motion features ...

Qian Feng, Dong**g Cao, Shulong Bao, Lu Liu in Image and Graphics Technologies and Applic… (2020)
Chapter and Conference Paper

Morphological Knowledge Guided Mongolian Constituent Parsing

Mongolian constituent parsing is a challenging task due to lack of hand-annotated corpus and rich morphological varying. This paper takes a self-attention neural network to deal with Mongolian constituent pars...

Na Liu, **angdong Su, Guanglai Gao, Feilong Bao, Min Lu in Neural Information Processing (2019)
Chapter and Conference Paper

Micro Heater with Low Temperature Coefficient of Resistance for ICF Target

A micro heater with low temperature coefficient of resistance (TCR) at liquid hydrogen temperature was designed and fabricated by micro fabrication technology. The NiCr heater annealed in N₂ at 250 °C for 9 min a...

Bin Xu, Zhibiao Li, Gang Tang, Yulong Bao, Huang Wang in Human Centered Computing (2019)
Chapter and Conference Paper

Building Mongolian TTS Front-End with Encoder-Decoder Model by Using Bridge Method and Multi-view Features

In the context of text-to-speech systems (TTS), a front-end is a critical step for extracting linguistic features from given input text. In this paper, we propose a Mongolian TTS front-end which joint training...

Rui Liu, Feilong Bao, Guanglai Gao in Neural Information Processing (2019)
Chapter and Conference Paper

A Natural Scene Text Extraction Approach Based on Generative Adversarial Learning

Extracting textual information embodied in natural scenes is a very challenge task, and has a great influence on the performance of the following text recognition and understanding. It can be seen as an image...

Huali Xu, **angdong Su, Tongyang Liu, Pengcheng Guo… in Neural Information Processing (2019)
Chapter and Conference Paper

Mongolian Word Segmentation Based on Three Character Level Seq2Seq Models

Mongolian word segmentation is splitting the Mongolian words into roots and suffixes. It plays an important role in Mongolian related natural language processing tasks. To improve performance and avoid the ted...

Na Liu, **angdong Su, Guanglai Gao, Feilong Bao in Neural Information Processing (2018)
Chapter and Conference Paper

Phonologically Aware BiLSTM Model for Mongolian Phrase Break Prediction with Attention Mechanism

Phrase break prediction is the first and most important component in increasing naturalness and intelligibility of text-to-speech (TTS) systems. Most works rely on language specific resources, large annotated ...

Rui Liu, FeiLong Bao, Guanglai Gao… in PRICAI 2018: Trends in Artificial Intellig… (2018)
Article

Single sample per person face recognition with KPCANet and a weighted voting scheme

Most current methods of facial recognition rely on the condition of having multiple samples per person available for feature extraction. In practical applications, however, only one sample may be available for...

Chunhui Ding, Tianlong Bao, Saleem Karmoshi, Ming Zhu in Signal, Image and Video Processing (2017)
Article

A knowledge-based recognition system for historical Mongolian documents

This paper proposes a knowledge-based system to recognize historical Mongolian documents in which the words exhibit remarkable variation and character overlap**. According to the characteristics of Mongolian...

**angdong Su, Guanglai Gao, Hongxi Wei… in International Journal on Document Analysis… (2016)
Chapter and Conference Paper

Video Anomaly Detection Based on Adaptive Multiple Auto-Encoders

Anomaly detection in surveillance videos is a challenging problem in computer vision community. In this paper, a novel unsupervised learning framework is proposed to detect and localize abnormal events in real...

Tianlong Bao, Chunhui Ding, Saleem Karmoshi, Ming Zhu in Advances in Visual Computing (2016)
Chapter and Conference Paper

Enhancing the Mongolian Historical Document Recognition System with Multiple Knowledge-Based Strategies

This paper describes recent work on integrating multiple strategies to improve the performance of the Mongolian historical document recognition system which utilize the segmentation-based scheme. We analyze th...

**angdong Su, Guanglai Gao, Hongxi Wei, Feilong Bao in Neural Information Processing (2015)
Chapter and Conference Paper

Character Segmentation for Classical Mongolian Words in Historical Documents

There are many classical Mongolian historical documents which are reserved in image form, and as a result it is inconvenient for us to search and mining the desired content. In order to facilitate the word rec...

**angdong Su, Guanglai Gao, Weihua Wang, Feilong Bao, Hongxi Wei in Pattern Recognition (2014)
Chapter and Conference Paper

A Convergent Incoherent Dictionary Learning Algorithm for Sparse Coding

Recently, sparse coding has been widely used in many applications ranging from image recovery to pattern recognition. The low mutual coherence of a dictionary is an important property that ensures the optimali...

Chenglong Bao, Yuhui Quan, Hui Ji in Computer Vision – ECCV 2014 (2014)

Download PDF (430 KB)

22 Result(s)

The image and ground truth dataset of Mongolian movable-type newspapers for text recognition

MnTTS2: An Open-Source Multi-speaker Mongolian Text-to-Speech Synthesis Dataset

A Deep Investigation of RNN and Self-attention for the Cyrillic-Traditional Mongolian Bidirectional Conversion

Power Efficient Video Super-Resolution on Mobile NPUs with Deep Learning, Mobile AI & AIM 2022 Challenge: Report

End-to-End Large-Scale Image Retrieval Network with Convolution and Vision Transformers

Panoptic-DLA: Document Layout Analysis of Historical Newspapers Based on Proposal-Free Panoptic Segmentation Model

Impulse Noise Detector Performance Measure Based on Intensity Volume

Infrared Small Target Recognition with Improved Particle Filtering Based on Feature Fusion

Morphological Knowledge Guided Mongolian Constituent Parsing

Micro Heater with Low Temperature Coefficient of Resistance for ICF Target

Building Mongolian TTS Front-End with Encoder-Decoder Model by Using Bridge Method and Multi-view Features

A Natural Scene Text Extraction Approach Based on Generative Adversarial Learning

Mongolian Word Segmentation Based on Three Character Level Seq2Seq Models

Phonologically Aware BiLSTM Model for Mongolian Phrase Break Prediction with Attention Mechanism

Single sample per person face recognition with KPCANet and a weighted voting scheme

A knowledge-based recognition system for historical Mongolian documents

Video Anomaly Detection Based on Adaptive Multiple Auto-Encoders

Enhancing the Mongolian Historical Document Recognition System with Multiple Knowledge-Based Strategies

Character Segmentation for Classical Mongolian Words in Historical Documents

A Convergent Incoherent Dictionary Learning Algorithm for Sparse Coding

Our Content

Other Sites

Help & Contacts