Skip to main content

and
  1. No Access

    Article

    Video based exercise recognition and correct pose detection

    Human pose estimation has gained significant attention from researchers of the present era. Personal exercise sessions can be monitored and supervised with the help of pose recognition. Existing work on exerci...

    Tushar Rangari, Sudhanshu Kumar, Partha Pratim Roy in Multimedia Tools and Applications (2022)

  2. No Access

    Article

    3D word spotting using leap motion sensor

    Leap motion sensor provides a new way of interaction with computers or mobile devices. With this sensor, users can write in air by moving palm or finger, thus, avoiding traditional pen and paper for writing. T...

    Partha Pratim Roy, Pradeep Kumar, Shweta Patidar in Multimedia Tools and Applications (2021)

  3. No Access

    Article

    Logo detection using weakly supervised saliency map

    Box level annotation of a large number of logo images for training purpose of typical deep learning architecture is highly challenging. Thus, a method that can detect the logo with the help of training to remo...

    Gautam Kumar, Prateek Keserwani, Partha Pratim Roy in Multimedia Tools and Applications (2021)

  4. No Access

    Article

    Fast Griffin Lim based waveform generation strategy for text-to-speech synthesis

    The performance of text-to-speech (TTS) systems heavily depends on spectrogram to waveform generation, also known as the speech reconstruction phase. The time required for the same is known as synthesis delay....

    Ankit Sharma, Puneet Kumar, Vikas Maddukuri in Multimedia Tools and Applications (2020)

  5. No Access

    Article

    Zone-based keyword spotting in Bangla and Devanagari documents

    In this paper, we present a word spotting system in text lines for offline Indic scripts such as Bangla (Bengali) and Devanagari. Recently, it was shown that the zone-wise recognition method improves word reco...

    Ayan Kumar Bhunia, Partha Pratim Roy, Aneeshan Sain in Multimedia Tools and Applications (2020)

  6. No Access

    Article

    Fractional Local Neighborhood Intensity Pattern for Image Retrieval using Genetic Algorithm

    In this paper, a new texture descriptor named “Fractional Local Neighborhood Intensity Pattern” (FLNIP) has been proposed for content-based image retrieval (CBIR). It is an extension of an earlier work involvi...

    Shuvozit Ghose, Abhirup Das, Ayan Kumar Bhunia in Multimedia Tools and Applications (2020)

  7. No Access

    Article

    A study of EEG for enterprise multimedia security

    In this era of technological advancement the security of one’s own identity to access multimedia content have become a major concern for big enterprises. The traditional security mechanisms like PIN numbers, I...

    Bar**der Kaur, Dinesh Singh, Partha Pratim Roy in Multimedia Tools and Applications (2020)

  8. No Access

    Article

    An intelligent recommendation system using gaze and emotion detection

    Recently, recommendation system has become popular in many e-commerce websites. It helps users by suggesting products which they could buy. Existing work till now uses past feedback of user, similarity of othe...

    Saurabh Jaiswal, Shubham Virmani, Vishal Sethi in Multimedia Tools and Applications (2019)

  9. No Access

    Article

    Word searching in scene image and video frame in multi-script scenario using dynamic shape coding

    Retrieval of text information from natural scene images and video frames is a challenging task due to its inherent problems like complex character shapes, low resolution, background noise, etc. Available OCR s...

    Partha Pratim Roy, Ayan Kumar Bhunia in Multimedia Tools and Applications (2019)

  10. No Access

    Article

    Analysis of 3D signatures recorded using leap motion sensor

    Signature recognition is identifying the signature’s owner, whereas verification is the process to find whether a signature is genuine or forged. Though, both are important in the field of forensic sciences, h...

    Santosh Kumar Behera, Debi Prosad Dogra in Multimedia Tools and Applications (2018)

  11. No Access

    Article

    A position and rotation invariant framework for sign language recognition (SLR) using Kinect

    Sign language is the only means of communication for speech and hearing impaired people. Using machine translation, Sign Language Recognition (SLR) systems provide medium of communication between speech and he...

    Pradeep Kumar, Rajkumar Saini, Partha Pratim Roy in Multimedia Tools and Applications (2018)

  12. No Access

    Article

    Text recognition in scene image and video frame using Color Channel selection

    In recent years, recognition of text from natural scene image and video frame has got increased attention among the researchers due to its various complexities and challenges. Because of low resolution, blurri...

    Ayan Kumar Bhunia, Gautam Kumar, Partha Pratim Roy in Multimedia Tools and Applications (2018)

  13. No Access

    Article

    Frame selection for OCR from video stream of book flip**

    Optical Character Recognition (OCR) in video stream of flip** pages is a challenging task because flip** at random speed causes difficulties in identifying the frames that contain the open page image (OPI)...

    Dibyayan Chakraborty, Partha Pratim Roy in Multimedia Tools and Applications (2018)

  14. No Access

    Article

    A Novel framework of EEG-based user identification by analyzing music-listening behavior

    This paper introduces a novel framework for user identification by analyzing neuro-signals. Studies regarding Electroencephalography (EEG) revealed that such bio-signals are sensitive, hard to forge, confident...

    Bar**der Kaur, Dinesh Singh, Partha Pratim Roy in Multimedia Tools and Applications (2017)

  15. No Access

    Article

    Analysis of EEG signals and its application to neuromarketing

    Marketing and promotions of various consumer products through advertisement campaign is a well known practice to increase the sales and awareness amongst the consumers. This essentially leads to increase in pr...

    Mahendra Yadava, Pradeep Kumar, Rajkumar Saini in Multimedia Tools and Applications (2017)

  16. No Access

    Article

    3D text segmentation and recognition using leap motion

    In this paper, we present a method of Human-Computer-Interaction (HCI) through 3D air-writing. Our proposed method includes a natural way of interaction without pen and paper. The online texts are drawn on air...

    Pradeep Kumar, Rajkumar Saini, Partha Pratim Roy in Multimedia Tools and Applications (2017)

  17. No Access

    Article

    A multimodal biometric watermarking system for digital images in redundant discrete wavelet transform

    The traditional watermarking algorithms prove the rightful ownership via embedding of independent watermarks like copyright logos, random noise sequences, text etc into the cover images. Coupling biometrics wi...

    Priyanka Singh, Balasubramanian Raman in Multimedia Tools and Applications (2017)