Skip to main content

previous disabled Page of 4
and
  1. No Access

    Article

    Isolated word recognition based on a hyper-tuned cross-validated CNN-BiLSTM from Mel Frequency Cepstral Coefficients

    Speech Recognition(SR) is a challenging task because existing models expect their users from different geographies to have the same level of proficiency, which is unrealistic. Existing models achieve high clas...

    Bachchu Paul, Santanu Phadikar, Somnath Bera in Multimedia Tools and Applications (2024)

  2. No Access

    Article

    LIFA: Language identification from audio with LPCC-G features

    In Western countries, speech recognition-based technologies have significantly developed compared to the countries of the South Asian subcontinent like India. India is a multilingual country (22 scheduled lang...

    Himadri Mukherjee, Ankita Dhar, Sk Md Obaidullah in Multimedia Tools and Applications (2024)

  3. No Access

    Article

    RAttSR: A Novel Low-Cost Reconstructed Attention-Based End-to-End Speech Recognizer

    People are curious about voice commands for the next generation of interaction. It will play a dominant role in communicating with smart devices in the future. However, language remains a significant barrier t...

    Bachchu Paul, Santanu Phadikar in Circuits, Systems, and Signal Processing (2024)

  4. No Access

    Article

    Biomedical term extraction using fuzzy association

    Automatic term extraction from a biomedical text is a well-known problem in the area of natural language processing. It is carried out by employing four kinds of measures: linguistic and rule-based, dictionary...

    Bidyut Das, Mukta Majumder, Santanu Phadikar, Arif Ahmed Sekh in Soft Computing (2024)

  5. No Access

    Article

    Prediction of spirometry parameters of adult Indian population using machine learning technology

    Spirometry is one of the important non-invasive, sensitive, easy-to-perform, reproducible, and objective biomedical screening and diagnostic procedures in healthcare for the assessment of lung function. To dat...

    Arkaprabha Sau, Santanu Phadikar, Ishita Bhakta in Multimedia Tools and Applications (2024)

  6. No Access

    Article

    Machine learning approach of speech emotions recognition using feature fusion technique

    In advancement of machine learning aspect, speech based emotional states identification must have a profound impact on artificial intelligence. Proper feature selection performs a vital role on such emotion re...

    Bachchu Paul, Somnath Bera, Tanushree Dey in Multimedia Tools and Applications (2024)

  7. No Access

    Article

    A hybrid feature-extracted deep CNN with reduced parameters substitutes an End-to-End CNN for the recognition of spoken Bengali digits

    Speech Recognition (SR) is an emerging field in the native language nowadays. Recognizing isolated words in the local language helps people use smartphones and electronic gadgets without technical or education...

    Bachchu Paul, Santanu Phadikar in Multimedia Tools and Applications (2024)

  8. No Access

    Article

    A Visual Attention-Based Model for Bengali Image Captioning

    Image caption or description generation is a fundamental task that involves computer vision (CV) and natural language processing (NLP) ideas to recognize an image-context and produces description(s) using a na...

    Bidyut Das, Ratnabali Pal, Mukta Majumder, Santanu Phadikar in SN Computer Science (2023)

  9. No Access

    Article

    A novel plant disease prediction model based on thermal images using modified deep convolutional neural network

    With the advancement of deep learning and thermal imaging technology, prediction of plant disease before the appearance of any visual symptoms gains attention. Studies showed that before the appearance of any ...

    Ishita Bhakta, Santanu Phadikar, Koushik Majumder in Precision Agriculture (2023)

  10. No Access

    Article

    A novel pre-processing technique of amplitude interpolation for enhancing the classification accuracy of Bengali phonemes

    In linguistics, phonemes are the atomic sound, called word segmentor play an important role to recognize the word properly. A novel approach of seven Bengali vowels and ten diphthongs (a syllable for the pronu...

    Bachchu Paul, Santanu Phadikar in Multimedia Tools and Applications (2023)

  11. No Access

    Chapter

    Isolated Bangla Spoken Digit and Word Recognition Using MFCC and DTW

    Digit recognition is one of the elegant research topics in modern world. Scientists had already got an excellent output in their research work on this topic for English and Chinese like languages. However, ver...

    Bachchu Paul, Rakesh Paul, Somnath Bera in Engineering Mathematics and Computing (2023)

  12. No Access

    Chapter and Conference Paper

    Application of Machine Learning Technology for Screening of Mental Health Disorder

    Mental health disorders are one of the most significant public health problems worldwide. Currently, one in every eight individuals is suffering from some kind of mental health issue. Anxiety and depression ar...

    Arkaprabha Sau, Santanu Phadikar, Ishita Bhakta in Intelligent Human Centered Computing (2023)

  13. No Access

    Chapter and Conference Paper

    Recognition of Infant Footprint: A Review of Advanced Techniques

    “In spite of RFID tags, NICU baby-swap** cannot be prevented.” Across decades, this question has daunted nursing supervisors, pediatric mentors, and newborns’ mothers, thus inspiring a thread of research in ...

    Enakshmi Ghosh, Ishani Roy, Rahul Modak in Advanced Communication and Intelligent Sys… (2023)

  14. No Access

    Article

    An efficient IDS in cloud environment using feature selection based on DM algorithm

    Cloud Computing provides the use of a wide array of applications to a designated server outside one’s personal computer. In the current technological era with the evolution of the Internet, it is being used on...

    Partha Ghosh, Shashwat Sinha in Journal of Computer Virology and Hacking T… (2022)

  15. No Access

    Article

    An efficient SGM based IDS in cloud environment

    Cloud computing is the sharing of remote access resources over the Internet. But with this comes an extensive risk of unauthorized access. Hence, for the security and privacy of the data, intrusion detection s...

    Partha Ghosh, Zaid Alam, Ritu Raj Sharma, Santanu Phadikar in Computing (2022)

  16. No Access

    Chapter and Conference Paper

    Agricultural Image Augmentation with Generative Adversarial Networks GANs

    Deep Learning gains popularity in almost every field of research currently and agricultural industry is not the exception in this. One of the main challenge in deep learning is the requirement of lots of data ...

    Sayan De, Ishita Bhakta, Santanu Phadikar in Computational Intelligence in Pattern Reco… (2022)

  17. No Access

    Chapter and Conference Paper

    Automatic Sign Language Identification Using Convolutional Neural Network

    language is the mode of communication for such people who are not blessed with the gift of hearing and speech. It involves the use of hands and facial expressions, Understanding sign language is diffi...

    Himadri Mukherjee, Ankita Dhar in Computational Intelligence in Pattern Reco… (2022)

  18. No Access

    Chapter and Conference Paper

    Thermal Image Augmentation with Generative Adversarial Network for Agricultural Disease Prediction

    Nowadays deep neural networks have radically changed the scenery of the recent research field of computer vision. The deep learning based method gains attention to the researcher for their astonishing performa...

    Ishita Bhakta, Santanu Phadikar in Computational Intelligence in Pattern Reco… (2022)

  19. No Access

    Chapter and Conference Paper

    ABID: Attention-Based Bengali Image Description

    Image caption or description generation is a fundamental problem of artificial intelligence. It requires both knowledge, natural language processing, and computer vision together. It automatically produces des...

    Bidyut Das, Arif Ahmed Sekh, Mukta Majumder in Proceedings of the 3rd International Confe… (2022)

  20. No Access

    Article

    Can deep learning solve a preschool image understanding problem?

    Automatic assessment of learning is a process where the computer system automatically generates test items and evaluates the responses. Image is one of the major media to assess learning capabilities. In this ...

    Bidyut Das, Arif Ahmed Sekh, Mukta Majumder in Neural Computing and Applications (2021)

previous disabled Page of 4