Search Page | SpringerLink

Multi-layered semantic annotation and the formalisation of annotation schemas for the investigation of modality in a Latin corpus

This paper stems from the project A World of Possibilities. Modal pathways over an extra-long period of time: the diachrony of modality in the Latin...

Helena Bermúdez-Sabel, Francesca Dell’Oro, Paola Marongiu in Language Resources and Evaluation

Article 06 January 2024

Annotation of scientific uncertainty using linguistic patterns

Scientific uncertainty is an integral part of the research process and inherent to the construction of new knowledge. In this paper, we investigate...

Panggih Kusuma Ningrum, Iana Atanassova in Scientometrics

Article 18 May 2024

A comprehensive examination of emoji usage in Mexican Spanish WhatsApp corpus: a mixed-methods Linguistic approach

The surge of emojis in computer-mediated communication (CMC) since 2011 presents a significant analytical challenge across various disciplines, such...

Monica López-Vázquez, Samuel López-Ruiz in Quality & Quantity

Article 21 June 2024

Linguistic annotation of Byzantine book epigrams

In this paper, we explore the feasibility of develo** a part-of-speech tagger for not-normalised, Byzantine Greek epigrams. Hence, we compared...

Colin Swaelens, Ilse De Vos, Els Lefever in Language Resources and Evaluation

Article 13 December 2023

NewsCom-TOX: a corpus of comments on news articles annotated for toxicity in Spanish

In this article, we present the NewsCom-TOX corpus, a new corpus manually annotated for toxicity in Spanish. NewsCom-TOX consists of 4359 comments in...

Mariona Taulé, Montserrat Nofre, ... Xavier Bonet in Language Resources and Evaluation

Article Open access 17 January 2024

A corpus of Persian literary text

Persian poetry has profoundly affected all periods of Persian literature and the literature of other countries as well. It is a fundamental vehicle...

Shahab Raji, Malihe Alikhani, ... Matthew Stone in Language Resources and Evaluation

Article Open access 23 November 2023

Automatic annotation method of VR speech corpus based on artificial intelligence

With the rapid development of the Internet and artificial intelligence, the demand for data annotation becomes more and more urgent. In order to meet...

Shanshan Yang, Ding Liu in International Journal of Speech Technology

Article 08 February 2022

Temporal Relations at the Sentence and Text Genre Level: The Role of Linguistic Cueing and Non-linguistic Biases—An Annotation Study of a Bilingual Corpus

This study investigates the role of non-linguistic biases in the obligatory (verb tenses) and optional (discourse connectives) linguistic marking for...

Cristina Grisot, Joanna Blochowiak in Corpus Pragmatics

Article Open access 30 April 2021

Slovenian parliamentary corpus siParl

Parliamentary debates represent an essential part of democratic discourse and provide insights into various socio-demographic and linguistic...

Katja Meden, Tomaž Erjavec, Andrej Pančur in Language Resources and Evaluation

Article Open access 02 June 2024

FinnSentiment: a Finnish social media corpus for sentiment polarity annotation

Sentiment analysis and opinion mining are essential tasks with many prominent application areas, e.g., when researching popular opinions on products...

Krister Lindén, Tommi Jauhiainen, Sam Hardwick in Language Resources and Evaluation

Article Open access 03 March 2023

Cross-linguistically consistent semantic and syntactic annotation of child-directed speech

Corpora of child speech and child-directed speech (CDS) have enabled major contributions to the study of child language acquisition, yet semantic...

Ida Szubert, Omri Abend, ... Mark Steedman in Language Resources and Evaluation

Article Open access 15 May 2024

Design and construction of Guayaquil radio speech corpus (CHARG)

The present paper aims to describe the process of creating CHARG—Corpus de Habla Radiofónica de Guayaquil (the Guayaquil Radiophonic Speech Corpus)....

Brygida Sawicka-Stępińska in Language Resources and Evaluation

Article Open access 25 March 2023

Using Semi-automatic Annotation Platform to Create Corpus for Argumentative Zoning

Argumentative Zoning (AZ) is a tool to extract salient information from scientific texts for further Natural Language Processing (NLP) tasks, e.g....

Alaa El-Ebshihy, Annisa Maulida Ningtyas, ... Mira Kania Sabariah in Linking Theory and Practice of Digital Libraries

Conference paper 2023

A Corpus of Quotation Element Annotation for Chinese Novels: Construction, Extraction and Application

Quotations or dialogues are important for literary works, like novels. In the famous ** Yong’s novels, about a half of all sentences contain...

**ge **e, Yuchen Yan, ... Hongying Zan in Neural Information Processing

Conference paper 2024

A morphologically annotated longitudinal corpus of spoken Czech child–adult interactions

The paper presents a longitudinal corpus of transcribed spontaneous child–adult interactions in Czech. It consists of 99,388 tokens in 42,103...

Anna Chromá, Jakub Sláma, ... Jolana Treichelová in Language Resources and Evaluation

Article 30 March 2024

The Visual Language Research Corpus (VLRC): an annotated corpus of comics from Asia, Europe, and the United States

The Visual Language Research Corpus (VLRC) is a dataset of annotations of 376 stories from comics from the United States, northwestern Europe, and...

Neil Cohn, Bruno Cardoso, ... Irmak Hacımusaoğlu in Language Resources and Evaluation

Article Open access 14 July 2023

Syntactic annotation for Portuguese corpora: standards, parsers, and search interfaces

In the last two decades, four Portuguese syntactically annotated corpora were built along the lines initially defined for the Penn Parsed Historical...

Pablo Faria, Charlotte Galves, Catarina Magro in Language Resources and Evaluation

Article 26 December 2023

Analyzing learner language: the case of the Hebrew Learner Essay Corpus

We present the Hebrew Learner Essay Corpus (HELEECS): an annotated corpus of Hebrew language argumentative essays authored by prospective...

Chen Gafni, Livnat Herzig Sheinfux, ... Shuly Wintner in Language Resources and Evaluation

Article Open access 15 May 2024

The Najdi Arabic Corpus: a new corpus for an underrepresented Arabic dialect

This paper presents a new corpus for a dialect of Arabic spoken in the central region of Saudi Arabia: the Najdi Arabic Corpus. This is the first...

Rukayah Alhedayani in Language Resources and Evaluation

Article 02 July 2024

Evolving linguistic divergence on polarizing social media

Language change is influenced by many factors, but often starts from synchronic variation, where multiple linguistic patterns or forms coexist, or...

Andres Karjus, Christine Cuskley in Humanities and Social Sciences Communications

Article Open access 15 March 2024

Search

Filters

Search Results

Search

Navigation