Search Results - Springer

Sort By Newest First Oldest First

Book and Conference Proceedings

Speech and Computer

25th International Conference, SPECOM 2023, Dharwad, India, November 29 – December 2, 2023, Proceedings, Part II

Alexey Karpov, K. Samudravijaya, K. T. Deepak… in Lecture Notes in Computer Science (2023)
Book and Conference Proceedings

Speech and Computer

25th International Conference, SPECOM 2023, Dharwad, India, November 29 – December 2, 2023, Proceedings, Part I

Alexey Karpov, K. Samudravijaya, K. T. Deepak… in Lecture Notes in Computer Science (2023)
Book and Conference Proceedings

Advances in Signal Processing and Intelligent Recognition Systems

6th International Symposium, SIRS 2020, Chennai, India, October 14–17, 2020, Revised Selected Papers

Prof. Sabu M. Thampi, Sri Krishnan… in Communications in Computer and Information Science (2021)
Book and Conference Proceedings

Advances in Signal Processing and Intelligent Recognition Systems

5th International Symposium, SIRS 2019, Trivandrum, India, December 18–21, 2019, Revised Selected Papers

Prof. Sabu M. Thampi, Rajesh M. Hegde… in Communications in Computer and Information Science (2020)
Article

Keyword Spotting in Continuous Speech Using Spectral and Prosodic Information Fusion

Keyword spotting in a continuous speech is a challenging problem and has relevance in applications like audio indexing and music retrieval. In this work, the problem of keyword spotting is addressed by utilizi...

Laxmi Pandey, Rajesh M. Hegde in Circuits, Systems, and Signal Processing (2019)
Article

Client-wise cohort set selection by combining speaker- and phoneme-specific I-vectors for speaker verification

This work explores the use of phoneme level information in cohort selection to improve the performance of a speaker verification system. In speaker verification, cohort is used in score normalization to get a bet...

Waquar Ahmad, Harish Karnick, Rajesh M. Hegde in Multimedia Tools and Applications (2018)
Article

Localization in Wireless Sensor Networks Using Visible Light in Non-Line of Sight Conditions

In this paper, a novel method to localize indoor wireless sensor nodes using visible light in non-line of sight (NLOS) condition is proposed. The proposed method is able to identify NLOS condition in a sensor ...

Om Jee Pandey, Richika Sharan, Rajesh M. Hegde in Wireless Personal Communications (2017)
Article

Probabilistic Detection Methods for Acoustic Surveillance Using Audio Histograms

Acoustic surveillance is gaining importance given the pervasive nature of multimedia sensors being deployed in all environments. In this paper, novel probabilistic detection methods using audio histograms are ...

M. S. Shankar Reddy, Karan Nathwani… in Circuits, Systems, and Signal Processing (2015)
Article

On the Rapid Prototy** of a Portable Multi Media Acquisition System for Intelligent Meeting Capture

Emerging multi-modal signal processing applications require a sustained effort on the part of the developer to realize and deploy an application. A rapid prototy** platform will reduce the effort, cost, and ...

Pranjal Agrawal, Aseem Kushwah, Lalan Kumar… in Journal of Signal Processing Systems (2014)
Chapter and Conference Paper

Cosine Distance Metric Learning for Speaker Verification Using Large Margin Nearest Neighbor Method

In this paper, a novel cosine similarity metric learning based on large margin nearest neighborhood (LMNN) is proposed for an i-vector based speaker verification system. Generally, in an i-vector based speaker...

Waquar Ahmad, Harish Karnick… in Advances in Multimedia Information Process… (2014)
Article

An Adaptive Non Reference Anchor Array Framework for Audio Retrieval in Teleconferencing Environment

In this paper, an adaptive framework for audio retrieval in live teleconferencing environments with multiple participants is proposed. The framework uses a non reference anchor array (NRA) to capture the inter...

Karan Nathwani, Arpit Shukla, Shubham Khunteta… in Journal of Signal Processing Systems (2014)
Chapter and Conference Paper

An Unsupervised Approach to Multiple Speaker Tracking for Robust Multimedia Retrieval

Tagging multi media data based on who is speaking at what time, is important especially in the intelligent retrieval of recordings of meetings and conferences. In this paper an unsupervised approach to trackin...

M. Phanikumar, Lalan Kumar, Rajesh M. Hegde in The Era of Interactive Media (2013)
Article

Open Access

Significance of parametric spectral ratio methods in detection and recognition of whispered speech

In this article the significance of a new parametric spectral ratio method that can be used to detect whispered speech segments within normally phonated speech is described. Adaptation methods based on the max...

Arpit Mathur, Shankar M Reddy… in EURASIP Journal on Advances in Signal Proc… (2012)

Download PDF (4322 KB) View Article
Chapter and Conference Paper

An Adaptive Non Reference Anchor Array Framework for Distant Speech Recognition

Distant speech recognition over microphone arrays is challenging, especially in multi source environments. In this paper, a non reference anchor array (NRA) framework for distant speech recognition is proposed...

Arpit Shukla, Karan Nathwani… in Advances in Multimedia Information Process… (2012)
Chapter and Conference Paper

Distant Speaker Verification Using a Combined Family of MVDR Estimates

Distant speaker verification involves explicit spectral estimation of speech acquired over microphone arrays. The choice of the appropriate set of microphones is important here. In this paper we describe an im...

Bhargava Manevarte, Waquar Ahmad… in Advances in Multimedia Information Process… (2012)
Article

Open Access

On the Soft Fusion of Probability Mass Functions for Multimodal Speech Processing

Multimodal speech processing has been a subject of investigation to increase robustness of unimodal speech processing systems. Hard fusion of acoustic and visual speech is generally used for improving the accu...

D. Kumar, P. Vimal, Rajesh M. Hegde in EURASIP Journal on Advances in Signal Processing (2011)

Download PDF (6078 KB) View Article
Article

Open Access

Significance of Joint Features Derived from the Modified Group Delay Function in Speech Processing

This paper investigates the significance of combining cepstral features derived from the modified group delay function and from the short-time spectral magnitude like the MFCC. The conventional group delay fun...

Rajesh M. Hegde, Hema A. Murthy… in EURASIP Journal on Audio, Speech, and Musi… (2006)

Download PDF (1858 KB) View Article
Chapter and Conference Paper

Cluster and Intrinsic Dimensionality Analysis of the Modified Group Delay Feature for Speaker Classification

Speakers are generally identified by using features derived from the Fourier transform magnitude. The Modified group delay feature(MODGDF) derived from the Fourier transform phase has been used effectively for...

Rajesh M. Hegde, Hema A. Murthy in Neural Information Processing (2004)

18 Result(s)

Speech and Computer

Speech and Computer

Advances in Signal Processing and Intelligent Recognition Systems

Advances in Signal Processing and Intelligent Recognition Systems

Keyword Spotting in Continuous Speech Using Spectral and Prosodic Information Fusion

Client-wise cohort set selection by combining speaker- and phoneme-specific I-vectors for speaker verification

Localization in Wireless Sensor Networks Using Visible Light in Non-Line of Sight Conditions

Probabilistic Detection Methods for Acoustic Surveillance Using Audio Histograms

On the Rapid Prototy** of a Portable Multi Media Acquisition System for Intelligent Meeting Capture

Cosine Distance Metric Learning for Speaker Verification Using Large Margin Nearest Neighbor Method

An Adaptive Non Reference Anchor Array Framework for Audio Retrieval in Teleconferencing Environment

An Unsupervised Approach to Multiple Speaker Tracking for Robust Multimedia Retrieval

Significance of parametric spectral ratio methods in detection and recognition of whispered speech

An Adaptive Non Reference Anchor Array Framework for Distant Speech Recognition

Distant Speaker Verification Using a Combined Family of MVDR Estimates

On the Soft Fusion of Probability Mass Functions for Multimodal Speech Processing

Significance of Joint Features Derived from the Modified Group Delay Function in Speech Processing

Cluster and Intrinsic Dimensionality Analysis of the Modified Group Delay Feature for Speaker Classification

Our Content

Other Sites

Help & Contacts