Search Page | SpringerLink

Russian Web Tables: A Public Corpus of Web Tables for Russian Language Based on Wikipedia

Abstract

Corpora that contain tabular data such as WebTables are a vital resource for the academic community. Essentially, they are the backbone of...

P. E. Fedorov, A. V. Mironov, G. A. Chernishev in Lobachevskii Journal of Mathematics

Article 01 January 2023

Named Entity Recognition in Russian Using Multi-Task LSTM-CRF

Named entity recognition (NER) is aimed at obtaining the important information from the unstructured data presented in the form of natural language...

D. Mazitov, I. Alimova, E. Tutubalina in Journal of Mathematical Sciences

Article 22 June 2023

Syntax-based transfer learning for the task of biomedical relation extraction

Background

Transfer learning aims at enhancing machine learning performance on a problem by reusing labeled data originally designed for a related,...

Joël Legrand, Yannick Toussaint, ... Adrien Coulet in Journal of Biomedical Semantics

Article Open access 18 August 2021

An annotated corpus of clinical trial publications supporting schema-based relational information extraction

Background

The evidence-based medicine paradigm requires the ability to aggregate and compare outcomes of interventions across different trials. This...

Olivia Sanchez-Graillet, Christian Witte, ... Philipp Cimiano in Journal of Biomedical Semantics

Article Open access 23 May 2022

MedLexSp – a medical lexicon for Spanish medical natural language processing

Background

Medical lexicons enable the natural language processing (NLP) of health texts. Lexicons gather terms and concepts from thesauri and...

Leonardo Campillos-Llanos in Journal of Biomedical Semantics

Article Open access 02 February 2023

The Software System LingvoDoc and the Possibilities It Offers for Documentation and Analysis of Ob-Ugric Languages

Abstract —

The LingvoDoc system ( http://lingvodoc.ispras.ru ) provides a service for collaborative language documentation and computations on the...

Yu. V. Normanskaja, O. D. Borisenko, ... A. I. Avetisyan in Doklady Mathematics

Article Open access 01 June 2022

SemClinBr - a multi-institutional and multi-specialty semantically annotated corpus for Portuguese clinical NLP tasks

Background

The high volume of research focusing on extracting patient information from electronic health records (EHRs) has led to an increase in the...

Lucas Emanuel Silva e Oliveira, Ana Carolina Peters, ... Claudia Maria Cabral Moro in Journal of Biomedical Semantics

Article Open access 08 May 2022

Speech corpora subset selection based on time-continuous utterances features

An extremely large corpus with rich acoustic properties is very useful for training new speech recognition and semantic analysis models. However, it...

Luobing Dong, Qiumin Guo, Weili Wu in Journal of Combinatorial Optimization

Article 21 September 2018

Extraction of Requirement Bases from Domain Normative Documents and Classifiers with Application to the Russian Building Code

Abstract

In this research, we present a method for automated construction (extraction) of knowledge bases of requirements (requirement bases) from...

I. Baimuratov, D. Turygin, ... D. Mouromtsev in Lobachevskii Journal of Mathematics

Article 01 January 2023

Data

The analyses in this book use a large quantity of data, all of which is publicly available from data resource consortia. The data is categorized into...

Kumiko Tanaka-Ishii in Statistical Universals of Language

Chapter 2021

Assessment of Text Coherence by Constructing the Graph of Semantic, Lexical, and Grammatical Consistancy of Phrases of Sentences

The graph-based method of coherence assessment of texts based on the analysis of semantic, grammatical, and lexical consistency of sentence phrases...

S. D. Pogorilyy, A. A. Kramov in Cybernetics and Systems Analysis

Article 25 November 2020

CAS: corpus of clinical cases in French

Background

Textual corpora are extremely important for various NLP applications as they provide information necessary for creating, setting and...

Natalia Grabar, Clément Dalloux, Vincent Claveau in Journal of Biomedical Semantics

Article Open access 06 August 2020

We are not ready yet: limitations of state-of-the-art disease named entity recognizers

Background

Intense research has been done in the area of biomedical natural language processing. Since the breakthrough of transfer learning-based...

Lisa Kühnel, Juliane Fluck in Journal of Biomedical Semantics

Article Open access 27 October 2022

Articulation of Elements

The previous two parts of this book considered statistical universals of language. Sequences were input to specific analysis methods to examine the...

Kumiko Tanaka-Ishii in Statistical Universals of Language

Chapter 2021

Bias in Rank-Frequency Relation

As shown at the end of the previous chapter, the rank-frequency relation of Moby Dick almost follows a power law -> with an η value close to 1. The...

Kumiko Tanaka-Ishii in Statistical Universals of Language

Chapter 2021

On the Issue of Optimum Machine Learning Methods for Filling and Updating Nuclear Knowledge Graphs

Abstract

The paper deals with the issues of finding and researching optimum algorithms for classification and semantic annotation of textual network...

V. P. Telnov, Y. A. Korovin, K. V. Odintsov in Lobachevskii Journal of Mathematics

Article 01 January 2023

Recovering Word Forms by Context for Morphologically Rich Languages

In this work, we focus on “sentence-level unlemmatization,” the task of generating a grammatical sentence given a lemmatized one; this task is...

A. M. Alekseev, S. I. Nikolenko in Journal of Mathematical Sciences

Article 22 June 2023

Temporal information extraction from mental health records to identify duration of untreated psychosis

Background

Duration of untreated psychosis (DUP) is an important clinical construct in the field of mental health, as longer DUP can be associated...

Natalia Viani, Joyce Kam, ... Sumithra Velupillai in Journal of Biomedical Semantics

Article Open access 10 March 2020

Method for Generating Interpretable Embeddings Based on Superconcepts

Abstract

This paper presents an approach to creating interpretable word embeddings, in which each component of the vector corresponds to some...

M. M. Tikhomirov, N. V. Loukachevitch in Lobachevskii Journal of Mathematics

Article 01 August 2023

Multi-task transfer learning for the prediction of entity modifiers in clinical text: application to opioid use disorder case detection

Background

The semantics of entities extracted from a clinical text can be dramatically altered by modifiers, including entity negation, uncertainty,...

Abdullateef I. Almudaifer, Whitney Covington, ... John D. Osborne in Journal of Biomedical Semantics

Article Open access 07 June 2024

Search

Filters

Search Results

Search

Navigation