Search Page | SpringerLink

A One-class Model for Voice Replay Attack Detection

Replay attack poses a serious security concern for automatic speaker verification systems. Most of the existing replay detection methods cast the...

**ngliang Cheng, Lantian Li, ... Thomas Fang Zheng in Handbook of Biometric Anti-Spoofing

Chapter 2023

INTELLIBOT - Intelligent Voice Assisted Chatbot with Sentiment Analysis, COVID Dashboard and Offensive Text Detection

Chatbot has become an essential crowd puller in the world today and is used in various domains and professions. With increasing technologies and...

Gadiparthy Harika Sai, Meghna Manoj Nair, ... Shivani in Cyber Warfare, Security and Space Research

Conference paper 2022

Noise and acoustic modeling with waveform generator in text-to-speech and neutral speech conversion

This article focuses on develo** a system for high-quality synthesized and converted speech by addressing three fundamental principles. Although...

Mohammed Salah Al-Radhi, Tamás Gábor Csapó, Géza Németh in Multimedia Tools and Applications

Article Open access 10 September 2020

Pardon? An Overview of the Current State and Requirements of Voice User Interfaces for Blind and Visually Impaired Users

People with special needs like blind and visually impaired (BVI) people can particularly benefit from using voice assistants providing spoken...

Christina Oumard, Julian Kreimeier, Timo Götzelmann in Computers Hel** People with Special Needs

Conference paper 2022

Beyond Text-to-Speech Synthesis

In this chapter, we briefly introduce other speech tasks that are related to TTS and discuss their relationships. The closest task to text-to-speech...

Xu Tan in Neural Text-to-Speech Synthesis

Chapter 2023

A Robust Framework for High-Quality Voice Conversion with Conditional Generative Adversarial Network

The deep neural network (DNNs) has been applied in voice conversion (VC) system successfully. DNN shows its effectiveness especially with a large...

Liyang Chen, Yingxue Wang, ... Haiyong **e in Artificial Intelligence and Security

Conference paper 2020

Audio verification in forensic investigation using light deep neural network

Recently people have difficulties distinguishing real speech from computer-generated speech so that the synthetic voice is getting closer to a...

Noor D. AL-Shakarchy, Zahraa Najm Abdullah, ... Zahraa A. Harjan in International Journal of Information Technology

Article 20 April 2024

Hands in Harmony: Empowering Communication Through Translation

Over the years, sign language has developed to be a remarkable advancement. Unfortunately, there are specific effects associated with this language....

C. Manikandan, B. Keerthana, ... Grandhi Sirisha in Speech and Language Technologies for Low-Resource Languages

Conference paper 2024

Voice liveness detection under feature fusion and cross-environment scenario

Detecting playback spoofing attacks in speaker verification system is a big challenge. Recent studies on ASVspoof challenges show that replay attacks...

Sanjay Garg, Sapan H Mankad in Multimedia Tools and Applications

Article 19 July 2020

Audio-visual speech synthesis using vision transformer–enhanced autoencoders with ensemble of loss functions

Abstract

Audio-visual speech synthesis (AVSS) has garnered attention in recent years for its utility in the realm of audio-visual learning. AVSS...

Subhayu Ghosh, Snehashis Sarkar, ... Nanda Dulal Jana in Applied Intelligence

Article 27 March 2024

Transformation of Voice Signals to Spatial Domain for Code Optimization in Digital Image Processing

The paper is intended to transform the voice-signal from the frequency domain into a spatial domain in form of grayscale image and applied the image...

Akram Alsubari, Ghanshyam D. Ramteke, Rakesh J. Ramteke in Recent Trends in Image Processing and Pattern Recognition

Conference paper 2021

Comparison of the effectiveness of cepstral coefficients for Russian speech synthesis detection

Modern speech synthesis technologies can be used to deceive voice authentication systems, phone scams, or discredit public figures. An urgent task is...

Dmitry Efanov, Pavel Aleksandrov, Ilia Mironov in Journal of Computer Virology and Hacking Techniques

Article 13 August 2023

Conv-transformer-based Jaya Gazelle optimization for speech intelligibility with aphasia

Individual speech impairment damages a specific region of the brain, which is the main cause of aphasia. The goal is to develop a method, namely Jaya...

Ranjith Rajendran, Arumugam Chandrasekar in Signal, Image and Video Processing

Article 26 December 2023

Raspberry Pi-based robust speech command recognition for normal and hearing-impaired (HI)

The speech command identification system has become a necessary tool to transcribe speech into text, for performing hands-free control of devices and...

A. Revathi, N. Sasikaladevi, ... N. Raju in Multimedia Tools and Applications

Article 14 November 2023

Anti Noise Speech Recognition Based on Deep Learning in Wireless Communication Networks

As a new high-tech industry, the application of speech recognition technology is becoming more and more competitive, with a wide range of application...

Yanning Zhang, Lei Ma, ... **gyu Li in Advanced Hybrid Information Processing

Conference paper 2024

Conversion of NAM to Normal Speech Based on Stochastic Binary Cat Swarm Optimization Algorithm

Speech recognition plays an important role in a variety of applications for mobile communication. User communication devices for contact necessitate...

T. Rajesh Kumar, G. N. Balaji, ... G. R. Suresh in Distributed Computing and Optimization Techniques

Conference paper 2022

Research on Quantitative Models and Correlation of QoE Testing for Vehiclar Voice Cloud Services

Vehicle voice cloud service can help drivers reduce the dependence on vehicle operation and improve driving safety. In the related test of automobile...

Yuxin Li, Kailiang Zhang, ... ** Cui in Simulation Tools and Techniques

Conference paper 2021

Development and assessment of MyAccessible Math: promoting self-learning for students with vision impairment

Human–computer interaction (HCI) research aims to make systems versatile, easy to use, and accessible for most people. The abundant information on...

Abhishek Jariwala, Fatemeh Jamshidi, ... Richard Chapman in Universal Access in the Information Society

Article 23 November 2023

Spoofing Detection for Speaker Verification with Glottal Flow and 1D Pure Convolutional Networks

Automatic Speaker Verification Systems are subject to attacks, these attacks aim to fool the system into accepting as valid the identity of a speaker...

Antonio Camarena-Ibarrola, Karina Figueroa, Axel Plancarte Curiel in Pattern Recognition

Conference paper 2023

NAO vs. Pepper: Speech Recognition Performance Assessment

Social robots are becoming increasingly popular due to their communication capabilities in various fields, such as schools, hospitals and other...

Akshara Pande, Deepti Mishra, Bhavana Nachenahalli Bhuthegowda in Human-Computer Interaction

Conference paper 2024

Search

Filters

Search Results

Search

Navigation