Search
Search Results
-
A One-class Model for Voice Replay Attack Detection
Replay attack poses a serious security concern for automatic speaker verification systems. Most of the existing replay detection methods cast the... -
INTELLIBOT - Intelligent Voice Assisted Chatbot with Sentiment Analysis, COVID Dashboard and Offensive Text Detection
Chatbot has become an essential crowd puller in the world today and is used in various domains and professions. With increasing technologies and... -
Noise and acoustic modeling with waveform generator in text-to-speech and neutral speech conversion
This article focuses on develo** a system for high-quality synthesized and converted speech by addressing three fundamental principles. Although...
-
Pardon? An Overview of the Current State and Requirements of Voice User Interfaces for Blind and Visually Impaired Users
People with special needs like blind and visually impaired (BVI) people can particularly benefit from using voice assistants providing spoken... -
Beyond Text-to-Speech Synthesis
In this chapter, we briefly introduce other speech tasks that are related to TTS and discuss their relationships. The closest task to text-to-speech... -
A Robust Framework for High-Quality Voice Conversion with Conditional Generative Adversarial Network
The deep neural network (DNNs) has been applied in voice conversion (VC) system successfully. DNN shows its effectiveness especially with a large... -
Audio verification in forensic investigation using light deep neural network
Recently people have difficulties distinguishing real speech from computer-generated speech so that the synthetic voice is getting closer to a...
-
Hands in Harmony: Empowering Communication Through Translation
Over the years, sign language has developed to be a remarkable advancement. Unfortunately, there are specific effects associated with this language.... -
Voice liveness detection under feature fusion and cross-environment scenario
Detecting playback spoofing attacks in speaker verification system is a big challenge. Recent studies on ASVspoof challenges show that replay attacks...
-
Audio-visual speech synthesis using vision transformer–enhanced autoencoders with ensemble of loss functions
AbstractAudio-visual speech synthesis (AVSS) has garnered attention in recent years for its utility in the realm of audio-visual learning. AVSS...
-
Transformation of Voice Signals to Spatial Domain for Code Optimization in Digital Image Processing
The paper is intended to transform the voice-signal from the frequency domain into a spatial domain in form of grayscale image and applied the image... -
Comparison of the effectiveness of cepstral coefficients for Russian speech synthesis detection
Modern speech synthesis technologies can be used to deceive voice authentication systems, phone scams, or discredit public figures. An urgent task is...
-
Conv-transformer-based Jaya Gazelle optimization for speech intelligibility with aphasia
Individual speech impairment damages a specific region of the brain, which is the main cause of aphasia. The goal is to develop a method, namely Jaya...
-
Raspberry Pi-based robust speech command recognition for normal and hearing-impaired (HI)
The speech command identification system has become a necessary tool to transcribe speech into text, for performing hands-free control of devices and...
-
Anti Noise Speech Recognition Based on Deep Learning in Wireless Communication Networks
As a new high-tech industry, the application of speech recognition technology is becoming more and more competitive, with a wide range of application... -
Conversion of NAM to Normal Speech Based on Stochastic Binary Cat Swarm Optimization Algorithm
Speech recognition plays an important role in a variety of applications for mobile communication. User communication devices for contact necessitate... -
Research on Quantitative Models and Correlation of QoE Testing for Vehiclar Voice Cloud Services
Vehicle voice cloud service can help drivers reduce the dependence on vehicle operation and improve driving safety. In the related test of automobile... -
Development and assessment of MyAccessible Math: promoting self-learning for students with vision impairment
Human–computer interaction (HCI) research aims to make systems versatile, easy to use, and accessible for most people. The abundant information on...
-
Spoofing Detection for Speaker Verification with Glottal Flow and 1D Pure Convolutional Networks
Automatic Speaker Verification Systems are subject to attacks, these attacks aim to fool the system into accepting as valid the identity of a speaker... -
NAO vs. Pepper: Speech Recognition Performance Assessment
Social robots are becoming increasingly popular due to their communication capabilities in various fields, such as schools, hospitals and other...