Speech and Computer
25th International Conference, SPECOM 2023, Dharwad, India, November 29 – December 2, 2023, Proceedings, Part II
Book and Conference Proceedings
25th International Conference, SPECOM 2023, Dharwad, India, November 29 – December 2, 2023, Proceedings, Part II
Book and Conference Proceedings
25th International Conference, SPECOM 2023, Dharwad, India, November 29 – December 2, 2023, Proceedings, Part I
Book and Conference Proceedings
6th International Symposium, SIRS 2020, Chennai, India, October 14–17, 2020, Revised Selected Papers
Book and Conference Proceedings
5th International Symposium, SIRS 2019, Trivandrum, India, December 18–21, 2019, Revised Selected Papers
Article
Keyword spotting in a continuous speech is a challenging problem and has relevance in applications like audio indexing and music retrieval. In this work, the problem of keyword spotting is addressed by utilizi...
Article
This work explores the use of phoneme level information in cohort selection to improve the performance of a speaker verification system. In speaker verification, cohort is used in score normalization to get a bet...
Article
In this paper, a novel method to localize indoor wireless sensor nodes using visible light in non-line of sight (NLOS) condition is proposed. The proposed method is able to identify NLOS condition in a sensor ...
Article
Acoustic surveillance is gaining importance given the pervasive nature of multimedia sensors being deployed in all environments. In this paper, novel probabilistic detection methods using audio histograms are ...
Article
Emerging multi-modal signal processing applications require a sustained effort on the part of the developer to realize and deploy an application. A rapid prototy** platform will reduce the effort, cost, and ...
Chapter and Conference Paper
In this paper, a novel cosine similarity metric learning based on large margin nearest neighborhood (LMNN) is proposed for an i-vector based speaker verification system. Generally, in an i-vector based speaker...
Article
In this paper, an adaptive framework for audio retrieval in live teleconferencing environments with multiple participants is proposed. The framework uses a non reference anchor array (NRA) to capture the inter...
Chapter and Conference Paper
Tagging multi media data based on who is speaking at what time, is important especially in the intelligent retrieval of recordings of meetings and conferences. In this paper an unsupervised approach to trackin...
Article
In this article the significance of a new parametric spectral ratio method that can be used to detect whispered speech segments within normally phonated speech is described. Adaptation methods based on the max...
Chapter and Conference Paper
Distant speech recognition over microphone arrays is challenging, especially in multi source environments. In this paper, a non reference anchor array (NRA) framework for distant speech recognition is proposed...
Chapter and Conference Paper
Distant speaker verification involves explicit spectral estimation of speech acquired over microphone arrays. The choice of the appropriate set of microphones is important here. In this paper we describe an im...
Article
Multimodal speech processing has been a subject of investigation to increase robustness of unimodal speech processing systems. Hard fusion of acoustic and visual speech is generally used for improving the accu...
Article
This paper investigates the significance of combining cepstral features derived from the modified group delay function and from the short-time spectral magnitude like the MFCC. The conventional group delay fun...
Chapter and Conference Paper
Speakers are generally identified by using features derived from the Fourier transform magnitude. The Modified group delay feature(MODGDF) derived from the Fourier transform phase has been used effectively for...