Search
Search Results
-
Improving NLP Techniques by Integrating Linguistic Input to Detect Hate Speech in CMC Corpora
Hate speech detection research relies heavily on automatic detection models that make use of machine learning (ML), opinion mining, sentiment... -
Automatic consistency assurance for literature-based gene ontology annotation
BackgroundLiterature-based gene ontology (GO) annotation is a process where expert curators use uniform expressions to describe gene functions...
-
Investigating annotation noise for named entity recognition
Recent studies revealed that even the most widely used benchmark dataset still contains more than 5% sample-level annotation noise in Named Entity...
-
A New Set of Linguistic Resources for Ukrainian
We have constructed a Ukrainian set of linguistic resources that have allowed us to construct various NLP applications on Ukrainian, including... -
Towards a Useful Chinese Annotation Tool: An Examination of Annotators’ Practice and Needs
With the rapid development of digital humanities research and the digitization of ancient Chinese books, annotation has become a critical... -
Corpus Linguistics in Legal Discourse
There are many different ways in which modern Corpus Linguistics can be used to enrich and broaden our understanding of legal discourse. Based on the...
-
Concepts of Europe in Danish and German Social Media: A Corpus-Linguistic Study
This chapter examines the various concepts of Europe found in Danish social media discourse, set against a comparative background of corresponding... -
Demystifying Corpus Linguistics for English Language Teaching
The aim of this Introduction to the volume Demystifying Corpus Linguistics for English Language Teaching is to introduce the reader to key approaches... -
The Interplay of Laughter and Communicative Purpose in Conversational Discourse: A Corpus-Based Study of British English
Laughter is used strategically in conversational discourse to accomplish pragmatic functions. While other researchers have investigated how...
-
Distinguishing Online Hate Speech from Aggressive Speech: A Five-Factor Annotation Model
This chapter is of an introductory nature, offering a definitional and programmatic reflection about the object under focus in the book. How can hate... -
Part-of-Speech and Pragmatic Tagging of a Corpus of Film Dialogue: A Pilot Study
This article presents how a pilot study for automatically POS-tagging a corpus of orthographic transcriptions of film dialogues (Pavia Corpus of Film...
-
Corpus Annotation
In this chapter, we provide an overview of the main concepts relating to corpus annotation, along with some discussion of the practical aspects of... -
SYN2020: A New Corpus of Czech with an Innovated Annotation
The paper introduces the SYN2020 corpus, a newly released representative corpus of written Czech following the tradition of the Czech National Corpus... -
A Typometrical Study of Greenberg’s Linguistic Universal 1
An interdisciplinary study that combines linguistic typology, corpus linguistics, and computational linguistics to quantitatively review Greenberg’s... -
Constructing a cross-document event coreference corpus for Dutch
Event coreference resolution is a task in which different text fragments that refer to the same real-world event are automatically linked together....
-
A Linguistic Approach to English Phrasal Verbs
This presentation shows how a lexicon grammar dictionary of English phrasal verbs (PV) can be transformed into an electronic dictionary, in order to... -
The Multimedia Corpus of Russian Ironic Speech for Phonetic Analysis
This paper presents a detailed description of the multimedia corpus that was built for the phonetic analysis of Russian ironic speech. The corpus was... -
NEREL: a Russian information extraction dataset with rich annotation for nested entities, relations, and wikidata entity links
This paper describes NEREL—a Russian news dataset suited for three tasks: nested named entity recognition, relation extraction, and entity linking....
-
PRAUTOCAL corpus: a corpus for the study of Down syndrome prosodic aspects
Oral productions of speakers with Down syndrome exhibit special characteristics that have been the target of study for decades. In spite of this...