Skip to main content

and
  1. No Access

    Article

    LWSINet: A deep learning-based approach towards video script identification

    Videos – a high volume of texts – broadcast via different media, such as television and the internet. Since Optical Character Recognition (OCR) engines are script-dependent, script identification is a precurso...

    Mridul Ghosh, Himadri Mukherjee, Sk Md Obaidullah in Multimedia Tools and Applications (2021)

  2. No Access

    Article

    STDNet: A CNN-based approach to single-/mixed-script detection

    Script identification serves as a guide to the detection of the text of the scene through optical character recognition (OCR). But this is not a principal concern for the OCR engine. Until script identificatio...

    Mridul Ghosh, Himadri Mukherjee in Innovations in Systems and Software Engine… (2021)

  3. No Access

    Article

    Understanding movie poster: transfer-deep learning approach for graphic-rich text recognition

    Graphic-rich texts are common in posters. In a movie poster, information, such as movie title, tag lines, and names of the actors, director, and production house, is available. Graphic-rich texts in movie titl...

    Mridul Ghosh, Sayan Saha Roy, Himadri Mukherjee, Sk Md Obaidullah in The Visual Computer (2022)

  4. Article

    LWSNet - a novel deep-learning architecture to segregate Covid-19 and pneumonia from x-ray imagery

    Automatic detection of lung diseases using AI-based tools became very much necessary to handle the huge number of cases occurring across the globe and support the doctors. This paper proposed a novel deep lear...

    Asifuzzaman Lasker, Mridul Ghosh, Sk Md Obaidullah in Multimedia Tools and Applications (2023)

  5. No Access

    Article

    Scene text understanding: recapitulating the past decade

    Computational perception has indeed been dramatically modified and reformed from handcrafted feature-based techniques to the advent of deep learning. Scene text identification and recognition have inexorably b...

    Mridul Ghosh, Himadri Mukherjee, Sk Md Obaidullah in Artificial Intelligence Review (2023)

  6. No Access

    Article

    MOPO-HBT: A movie poster dataset for title extraction and recognition

    Real-world images often encompass embedded texts that adhere to disparate disciplines like business, education, and amusement, to name a few. Such images are graphically rich in terms of font attributes, color...

    Mridul Ghosh, Sayan Saha Roy, Bivan Banik in Multimedia Tools and Applications (2024)