Skip to main content

and
  1. No Access

    Chapter and Conference Paper

    Cluster and Intrinsic Dimensionality Analysis of the Modified Group Delay Feature for Speaker Classification

    Speakers are generally identified by using features derived from the Fourier transform magnitude. The Modified group delay feature(MODGDF) derived from the Fourier transform phase has been used effectively for...

    Rajesh M. Hegde, Hema A. Murthy in Neural Information Processing (2004)

  2. No Access

    Chapter and Conference Paper

    An Adaptive Non Reference Anchor Array Framework for Distant Speech Recognition

    Distant speech recognition over microphone arrays is challenging, especially in multi source environments. In this paper, a non reference anchor array (NRA) framework for distant speech recognition is proposed...

    Arpit Shukla, Karan Nathwani in Advances in Multimedia Information Process… (2012)

  3. No Access

    Chapter and Conference Paper

    Distant Speaker Verification Using a Combined Family of MVDR Estimates

    Distant speaker verification involves explicit spectral estimation of speech acquired over microphone arrays. The choice of the appropriate set of microphones is important here. In this paper we describe an im...

    Bhargava Manevarte, Waquar Ahmad in Advances in Multimedia Information Process… (2012)

  4. No Access

    Chapter and Conference Paper

    An Unsupervised Approach to Multiple Speaker Tracking for Robust Multimedia Retrieval

    Tagging multi media data based on who is speaking at what time, is important especially in the intelligent retrieval of recordings of meetings and conferences. In this paper an unsupervised approach to trackin...

    M. Phanikumar, Lalan Kumar, Rajesh M. Hegde in The Era of Interactive Media (2013)

  5. No Access

    Chapter and Conference Paper

    Cosine Distance Metric Learning for Speaker Verification Using Large Margin Nearest Neighbor Method

    In this paper, a novel cosine similarity metric learning based on large margin nearest neighborhood (LMNN) is proposed for an i-vector based speaker verification system. Generally, in an i-vector based speaker...

    Waquar Ahmad, Harish Karnick in Advances in Multimedia Information Process… (2014)

  6. No Access

    Article

    Client-wise cohort set selection by combining speaker- and phoneme-specific I-vectors for speaker verification

    This work explores the use of phoneme level information in cohort selection to improve the performance of a speaker verification system. In speaker verification, cohort is used in score normalization to get a bet...

    Waquar Ahmad, Harish Karnick, Rajesh M. Hegde in Multimedia Tools and Applications (2018)

  7. No Access

    Book and Conference Proceedings

    Advances in Signal Processing and Intelligent Recognition Systems

    5th International Symposium, SIRS 2019, Trivandrum, India, December 18–21, 2019, Revised Selected Papers

    Prof. Sabu M. Thampi, Rajesh M. Hegde in Communications in Computer and Information Science (2020)

  8. No Access

    Book and Conference Proceedings

    Advances in Signal Processing and Intelligent Recognition Systems

    6th International Symposium, SIRS 2020, Chennai, India, October 14–17, 2020, Revised Selected Papers

    Prof. Sabu M. Thampi, Sri Krishnan in Communications in Computer and Information Science (2021)

  9. No Access

    Book and Conference Proceedings

    Speech and Computer

    25th International Conference, SPECOM 2023, Dharwad, India, November 29 – December 2, 2023, Proceedings, Part II

    Alexey Karpov, K. Samudravijaya, K. T. Deepak in Lecture Notes in Computer Science (2023)

  10. No Access

    Book and Conference Proceedings

    Speech and Computer

    25th International Conference, SPECOM 2023, Dharwad, India, November 29 – December 2, 2023, Proceedings, Part I

    Alexey Karpov, K. Samudravijaya, K. T. Deepak in Lecture Notes in Computer Science (2023)