Search
Search Results
-
Russian Web Tables: A Public Corpus of Web Tables for Russian Language Based on Wikipedia
AbstractCorpora that contain tabular data such as WebTables are a vital resource for the academic community. Essentially, they are the backbone of...
-
Named Entity Recognition in Russian Using Multi-Task LSTM-CRF
Named entity recognition (NER) is aimed at obtaining the important information from the unstructured data presented in the form of natural language...
-
Syntax-based transfer learning for the task of biomedical relation extraction
BackgroundTransfer learning aims at enhancing machine learning performance on a problem by reusing labeled data originally designed for a related,...
-
An annotated corpus of clinical trial publications supporting schema-based relational information extraction
BackgroundThe evidence-based medicine paradigm requires the ability to aggregate and compare outcomes of interventions across different trials. This...
-
MedLexSp – a medical lexicon for Spanish medical natural language processing
BackgroundMedical lexicons enable the natural language processing (NLP) of health texts. Lexicons gather terms and concepts from thesauri and...
-
The Software System LingvoDoc and the Possibilities It Offers for Documentation and Analysis of Ob-Ugric Languages
Abstract —The LingvoDoc system ( http://lingvodoc.ispras.ru ) provides a service for collaborative language documentation and computations on the...
-
SemClinBr - a multi-institutional and multi-specialty semantically annotated corpus for Portuguese clinical NLP tasks
BackgroundThe high volume of research focusing on extracting patient information from electronic health records (EHRs) has led to an increase in the...
-
Speech corpora subset selection based on time-continuous utterances features
An extremely large corpus with rich acoustic properties is very useful for training new speech recognition and semantic analysis models. However, it...
-
Extraction of Requirement Bases from Domain Normative Documents and Classifiers with Application to the Russian Building Code
AbstractIn this research, we present a method for automated construction (extraction) of knowledge bases of requirements (requirement bases) from...
-
Data
The analyses in this book use a large quantity of data, all of which is publicly available from data resource consortia. The data is categorized into... -
Assessment of Text Coherence by Constructing the Graph of Semantic, Lexical, and Grammatical Consistancy of Phrases of Sentences
The graph-based method of coherence assessment of texts based on the analysis of semantic, grammatical, and lexical consistency of sentence phrases...
-
CAS: corpus of clinical cases in French
BackgroundTextual corpora are extremely important for various NLP applications as they provide information necessary for creating, setting and...
-
We are not ready yet: limitations of state-of-the-art disease named entity recognizers
BackgroundIntense research has been done in the area of biomedical natural language processing. Since the breakthrough of transfer learning-based...
-
Articulation of Elements
The previous two parts of this book considered statistical universals of language. Sequences were input to specific analysis methods to examine the... -
Bias in Rank-Frequency Relation
As shown at the end of the previous chapter, the rank-frequency relation of Moby Dick almost follows a power law -> with an η value close to 1. The... -
On the Issue of Optimum Machine Learning Methods for Filling and Updating Nuclear Knowledge Graphs
AbstractThe paper deals with the issues of finding and researching optimum algorithms for classification and semantic annotation of textual network...
-
Recovering Word Forms by Context for Morphologically Rich Languages
In this work, we focus on “sentence-level unlemmatization,” the task of generating a grammatical sentence given a lemmatized one; this task is...
-
Temporal information extraction from mental health records to identify duration of untreated psychosis
BackgroundDuration of untreated psychosis (DUP) is an important clinical construct in the field of mental health, as longer DUP can be associated...
-
Method for Generating Interpretable Embeddings Based on Superconcepts
AbstractThis paper presents an approach to creating interpretable word embeddings, in which each component of the vector corresponds to some...
-
Multi-task transfer learning for the prediction of entity modifiers in clinical text: application to opioid use disorder case detection
BackgroundThe semantics of entities extracted from a clinical text can be dramatically altered by modifiers, including entity negation, uncertainty,...