Skip to main content

and
  1. Article

    Open Access

    Lahjoita puhetta: a large-scale corpus of spoken Finnish with some benchmarks

    The Donate Speech campaign has so far succeeded in gathering approximately 3600 h of ordinary, colloquial Finnish speech into the Lahjoita puhetta (Donate Speech) corpus. The corpus includes over twenty thousand ...

    Anssi Moisio, Dejan Porjazovski, Aku Rouhe in Language Resources and Evaluation (2023)

  2. No Access

    Chapter and Conference Paper

    Attention-Based End-to-End Named Entity Recognition from Speech

    Named entities are heavily used in the field of spoken language understanding, which uses speech as an input. The standard way of doing named entity recognition from speech involves a pipeline of two systems, ...

    Dejan Porjazovski, Juho Leinonen, Mikko Kurimo in Text, Speech, and Dialogue (2021)