Skip to main content

and
  1. No Access

    Book and Conference Proceedings

    Speech and Computer

    25th International Conference, SPECOM 2023, Dharwad, India, November 29 – December 2, 2023, Proceedings, Part II

    Alexey Karpov, K. Samudravijaya, K. T. Deepak in Lecture Notes in Computer Science (2023)

  2. No Access

    Book and Conference Proceedings

    Speech and Computer

    25th International Conference, SPECOM 2023, Dharwad, India, November 29 – December 2, 2023, Proceedings, Part I

    Alexey Karpov, K. Samudravijaya, K. T. Deepak in Lecture Notes in Computer Science (2023)

  3. No Access

    Book and Conference Proceedings

    Advances in Signal Processing and Intelligent Recognition Systems

    6th International Symposium, SIRS 2020, Chennai, India, October 14–17, 2020, Revised Selected Papers

    Prof. Sabu M. Thampi, Sri Krishnan in Communications in Computer and Information Science (2021)

  4. No Access

    Book and Conference Proceedings

    Advances in Signal Processing and Intelligent Recognition Systems

    5th International Symposium, SIRS 2019, Trivandrum, India, December 18–21, 2019, Revised Selected Papers

    Prof. Sabu M. Thampi, Rajesh M. Hegde in Communications in Computer and Information Science (2020)

  5. No Access

    Article

    Keyword Spotting in Continuous Speech Using Spectral and Prosodic Information Fusion

    Keyword spotting in a continuous speech is a challenging problem and has relevance in applications like audio indexing and music retrieval. In this work, the problem of keyword spotting is addressed by utilizi...

    Laxmi Pandey, Rajesh M. Hegde in Circuits, Systems, and Signal Processing (2019)

  6. No Access

    Article

    Client-wise cohort set selection by combining speaker- and phoneme-specific I-vectors for speaker verification

    This work explores the use of phoneme level information in cohort selection to improve the performance of a speaker verification system. In speaker verification, cohort is used in score normalization to get a bet...

    Waquar Ahmad, Harish Karnick, Rajesh M. Hegde in Multimedia Tools and Applications (2018)

  7. No Access

    Article

    Localization in Wireless Sensor Networks Using Visible Light in Non-Line of Sight Conditions

    In this paper, a novel method to localize indoor wireless sensor nodes using visible light in non-line of sight (NLOS) condition is proposed. The proposed method is able to identify NLOS condition in a sensor ...

    Om Jee Pandey, Richika Sharan, Rajesh M. Hegde in Wireless Personal Communications (2017)

  8. No Access

    Article

    Probabilistic Detection Methods for Acoustic Surveillance Using Audio Histograms

    Acoustic surveillance is gaining importance given the pervasive nature of multimedia sensors being deployed in all environments. In this paper, novel probabilistic detection methods using audio histograms are ...

    M. S. Shankar Reddy, Karan Nathwani in Circuits, Systems, and Signal Processing (2015)

  9. No Access

    Article

    On the Rapid Prototy** of a Portable Multi Media Acquisition System for Intelligent Meeting Capture

    Emerging multi-modal signal processing applications require a sustained effort on the part of the developer to realize and deploy an application. A rapid prototy** platform will reduce the effort, cost, and ...

    Pranjal Agrawal, Aseem Kushwah, Lalan Kumar in Journal of Signal Processing Systems (2014)

  10. No Access

    Chapter and Conference Paper

    Cosine Distance Metric Learning for Speaker Verification Using Large Margin Nearest Neighbor Method

    In this paper, a novel cosine similarity metric learning based on large margin nearest neighborhood (LMNN) is proposed for an i-vector based speaker verification system. Generally, in an i-vector based speaker...

    Waquar Ahmad, Harish Karnick in Advances in Multimedia Information Process… (2014)

  11. No Access

    Article

    An Adaptive Non Reference Anchor Array Framework for Audio Retrieval in Teleconferencing Environment

    In this paper, an adaptive framework for audio retrieval in live teleconferencing environments with multiple participants is proposed. The framework uses a non reference anchor array (NRA) to capture the inter...

    Karan Nathwani, Arpit Shukla, Shubham Khunteta in Journal of Signal Processing Systems (2014)

  12. No Access

    Chapter and Conference Paper

    An Unsupervised Approach to Multiple Speaker Tracking for Robust Multimedia Retrieval

    Tagging multi media data based on who is speaking at what time, is important especially in the intelligent retrieval of recordings of meetings and conferences. In this paper an unsupervised approach to trackin...

    M. Phanikumar, Lalan Kumar, Rajesh M. Hegde in The Era of Interactive Media (2013)

  13. Article

    Open Access

    Significance of parametric spectral ratio methods in detection and recognition of whispered speech

    In this article the significance of a new parametric spectral ratio method that can be used to detect whispered speech segments within normally phonated speech is described. Adaptation methods based on the max...

    Arpit Mathur, Shankar M Reddy in EURASIP Journal on Advances in Signal Proc… (2012)

  14. No Access

    Chapter and Conference Paper

    An Adaptive Non Reference Anchor Array Framework for Distant Speech Recognition

    Distant speech recognition over microphone arrays is challenging, especially in multi source environments. In this paper, a non reference anchor array (NRA) framework for distant speech recognition is proposed...

    Arpit Shukla, Karan Nathwani in Advances in Multimedia Information Process… (2012)

  15. No Access

    Chapter and Conference Paper

    Distant Speaker Verification Using a Combined Family of MVDR Estimates

    Distant speaker verification involves explicit spectral estimation of speech acquired over microphone arrays. The choice of the appropriate set of microphones is important here. In this paper we describe an im...

    Bhargava Manevarte, Waquar Ahmad in Advances in Multimedia Information Process… (2012)

  16. Article

    Open Access

    On the Soft Fusion of Probability Mass Functions for Multimodal Speech Processing

    Multimodal speech processing has been a subject of investigation to increase robustness of unimodal speech processing systems. Hard fusion of acoustic and visual speech is generally used for improving the accu...

    D. Kumar, P. Vimal, Rajesh M. Hegde in EURASIP Journal on Advances in Signal Processing (2011)

  17. Article

    Open Access

    Significance of Joint Features Derived from the Modified Group Delay Function in Speech Processing

    This paper investigates the significance of combining cepstral features derived from the modified group delay function and from the short-time spectral magnitude like the MFCC. The conventional group delay fun...

    Rajesh M. Hegde, Hema A. Murthy in EURASIP Journal on Audio, Speech, and Musi… (2006)

  18. No Access

    Chapter and Conference Paper

    Cluster and Intrinsic Dimensionality Analysis of the Modified Group Delay Feature for Speaker Classification

    Speakers are generally identified by using features derived from the Fourier transform magnitude. The Modified group delay feature(MODGDF) derived from the Fourier transform phase has been used effectively for...

    Rajesh M. Hegde, Hema A. Murthy in Neural Information Processing (2004)