![Loading...](https://link.springer.com/static/c4a417b97a76cc2980e3c25e2271af3129e08bbe/images/pdf-preview/spacer.gif)
-
Article
Open AccessA taxonomy and review of generalization research in NLP
The ability to generalize well is one of the primary desiderata for models of natural language processing (NLP), but what ‘good generalization’ entails and how it should be evaluated is not well understood. In...
-
Chapter and Conference Paper
One World - Seven Thousand Languages (Best Paper Award, Third Place)
We present a large scale multilingual lexical resource, the Universal Knowledge Core (UKC), which is organized like a Wordnet with, however, a major design difference. In the UKC the meaning of words is represent...
-
Article
Open AccessA large and evolving cognate database
We present CogNet, a large-scale, automatically-built database of sense-tagged cognates—words of common origin and meaning across languages. CogNet is continuously evolving: its current version contains over 8 mi...
-
Chapter and Conference Paper
A Database and Visualization of the Similarity of Contemporary Lexicons
Lexical similarity data, quantifying the “proximity” of languages based on the similarity of their lexicons, has been increasingly used to estimate the cross-lingual reusability of language resources, for task...
-
Article
Open AccessIncorporating domain knowledge in chemical and biomedical named entity recognition with word representations
Chemical and biomedical Named Entity Recognition (NER) is an essential prerequisite task before effective text mining can begin for biochemical-text data. Exploiting unlabeled text data to leverage system perf...