Skip to main content

and
  1. Article

    Open Access

    Finnish parliament ASR corpus

    Public sources like parliament meeting recordings and transcripts provide ever-growing material for the training and evaluation of automatic speech recognition (ASR) systems. In this paper, we publish and anal...

    Anja Virkkunen, Aku Rouhe, Nhan Phan, Mikko Kurimo in Language Resources and Evaluation (2023)

  2. Article

    Open Access

    Lahjoita puhetta: a large-scale corpus of spoken Finnish with some benchmarks

    The Donate Speech campaign has so far succeeded in gathering approximately 3600 h of ordinary, colloquial Finnish speech into the Lahjoita puhetta (Donate Speech) corpus. The corpus includes over twenty thousand ...

    Anssi Moisio, Dejan Porjazovski, Aku Rouhe in Language Resources and Evaluation (2023)

  3. No Access

    Chapter and Conference Paper

    An Equal Data Setting for Attention-Based Encoder-Decoder and HMM/DNN Models: A Case Study in Finnish ASR

    Standard end-to-end training of attention-based ASR models only uses transcribed speech. If they are compared to HMM/DNN systems, which additionally leverage a large corpus of text-only data and expert-crafted...

    Aku Rouhe, Astrid Van Camp, Mittul Singh, Hugo Van Hamme in Speech and Computer (2021)

  4. Article

    Open Access

    Multimodal machine translation through visuals and speech

    Multimodal machine translation involves drawing information from more than one modality, based on the assumption that the additional modalities will contain useful alternative views of the input data. The most...

    Umut Sulubacak, Ozan Caglayan, Stig-Arne Grönroos, Aku Rouhe in Machine Translation (2020)