Skip to main content

and
  1. Article

    Open Access

    A taxonomy and review of generalization research in NLP

    The ability to generalize well is one of the primary desiderata for models of natural language processing (NLP), but what ‘good generalization’ entails and how it should be evaluated is not well understood. In...

    Dieuwke Hupkes, Mario Giulianelli, Verna Dankers in Nature Machine Intelligence (2023)

  2. No Access

    Chapter and Conference Paper

    One World - Seven Thousand Languages (Best Paper Award, Third Place)

    We present a large scale multilingual lexical resource, the Universal Knowledge Core (UKC), which is organized like a Wordnet with, however, a major design difference. In the UKC the meaning of words is represent...

    Fausto Giunchiglia, Khuyagbaatar Batsuren in Computational Linguistics and Intelligent … (2023)

  3. Article

    Open Access

    A large and evolving cognate database

    We present CogNet, a large-scale, automatically-built database of sense-tagged cognates—words of common origin and meaning across languages. CogNet is continuously evolving: its current version contains over 8 mi...

    Khuyagbaatar Batsuren, Gábor Bella, Fausto Giunchiglia in Language Resources and Evaluation (2022)

  4. No Access

    Chapter and Conference Paper

    A Database and Visualization of the Similarity of Contemporary Lexicons

    Lexical similarity data, quantifying the “proximity” of languages based on the similarity of their lexicons, has been increasingly used to estimate the cross-lingual reusability of language resources, for task...

    Gábor Bella, Khuyagbaatar Batsuren, Fausto Giunchiglia in Text, Speech, and Dialogue (2021)

  5. Article

    Open Access

    Incorporating domain knowledge in chemical and biomedical named entity recognition with word representations

    Chemical and biomedical Named Entity Recognition (NER) is an essential prerequisite task before effective text mining can begin for biochemical-text data. Exploiting unlabeled text data to leverage system perf...

    Tsendsuren Munkhdalai, Mei**g Li, Khuyagbaatar Batsuren in Journal of Cheminformatics (2015)