Search
Search Results
-
Finnish parliament ASR corpus
Public sources like parliament meeting recordings and transcripts provide ever-growing material for the training and evaluation of automatic speech...
-
Jira: a Central Kurdish speech recognition system, designing and building speech corpus and pronunciation lexicon
This paper introduces the first large vocabulary speech recognition system (LVSR) for the Central Kurdish language, named Jira. The Kurdish language...
-
Speech Recognition and Text-to-Speech Synthesis
Automatic speech recognition (ASR) and text-to-speech (TTS) synthesis are two very important modules in human-computer communication. With the... -
Lahjoita puhetta: a large-scale corpus of spoken Finnish with some benchmarks
The Donate Speech campaign has so far succeeded in gathering approximately 3600 h of ordinary, colloquial Finnish speech into the Lahjoita puhetta ( Do...
-
Speech Emotion Recognition Using Spontaneous Children’s Corpus
Automatic recognition of human emotions is a relatively new field and is attracting significant attention in research and development areas because... -
Multilingual Speech Emotion Recognition on Japanese, English, and German
The current study focuses on human emotion recognition based on speech, and particularly on multilingual speech emotion recognition using Japanese,... -
Lexical modeling for the development of Amharic automatic speech recognition systems
Amharic is the second most spoken Semitic language after Arabic. It has its own syllabary writing system, each character representing a consonant and...
-
A Study on Far-Field Emotion Recognition Based on Deep Convolutional Neural Networks
Automatic recognition of human emotions is a relatively new field, and is attracting significant attention in research and development areas because... -
Investigating the effects of gender, dialect, and training size on the performance of Arabic speech recognition
Research in Arabic automatic speech recognition (ASR) is constrained by datasets of limited size, and of highly variable content and quality....
-
Unparalleled sarcasm: a framework of parallel deep LSTMs with cross activation functions towards detection and generation of sarcastic statements
Sarcasm is a modest kind of mockingly expressing one’s own thoughts. With the advent of social networking communication, new routes of sociability...
-
DZDC12: a new multipurpose parallel Algerian Arabizi–French code-switched corpus
Algeria’s socio-linguistic situation is known as a complex phenomenon involving several historical, cultural and technological factors. However,...
-
Modeling under-resourced languages for speech recognition
One particular problem in large vocabulary continuous speech recognition for low-resourced languages is finding relevant training data for the...
-
-
Vowelless Syllables in Moroccan Arabic
In this chapter we explore a modified version of our analysis of syllable structure in MA. In this new version, syllables with a schwa would all be... -