![Loading...](https://link.springer.com/static/c4a417b97a76cc2980e3c25e2271af3129e08bbe/images/pdf-preview/spacer.gif)
-
Chapter and Conference Paper
Overview of Touché 2022: Argument Retrieval
The goal of the Touché lab on argument retrieval is to foster and support the development of technologies for argument mining and argument analysis. In the third edition of Touché, we organize three shared tas...
-
Chapter and Conference Paper
Overview of PAN 2022: Authorship Verification, Profiling Irony and Stereotype Spreaders, Style Change Detection, and Trigger Detection
The paper gives a brief overview of the four shared tasks to be organized at the PAN 2022 lab on digital text forensics and stylometry hosted at the CLEF 2022 conference. The tasks include authorship verificat...
-
Chapter and Conference Paper
The Power of Anchor Text in the Neural Retrieval Era
In the early days of web search, a study by Craswell et al. [11] showed that anchor texts are particularly helpful ranking features for navigational queries and a study by Eiron and McCurley [24] showed that anch...
-
Chapter and Conference Paper
Shared Tasks on Authorship Analysis at PAN 2020
The paper gives a brief overview of the four shared tasks that are to be organized at the PAN 2020 lab on digital text forensics and stylometry, hosted at CLEF conference. The tasks include author profiling, ...
-
Chapter and Conference Paper
The Effect of Content-Equivalent Near-Duplicates on the Evaluation of Search Engines
Current best practices for the evaluation of search engines do not take into account duplicate documents. Dependent on their prevalence, not discounting duplicates during evaluation artificially inflates perfo...
-
Chapter and Conference Paper
A Search Engine for Police Press Releases to Double-Check the News
Many people have doubts about the factual accuracy of online news, while still trusting the press releases of police departments. To enable an easy corroboration of online news about police-related events, we ...
-
Chapter and Conference Paper
Touché: First Shared Task on Argument Retrieval
Technologies for argument mining and argumentation processing are maturing continuously, giving rise to the idea of retrieving arguments in search scenarios. We introduce Touché, the first lab on Argument Retrie...
-
Chapter and Conference Paper
A Decade of Shared Tasks in Digital Text Forensics at PAN
Digital text forensics aims at examining the originality and credibility of information in electronic documents and, in this regard, to extract and analyze information about the authors of these documents. The...
-
Chapter and Conference Paper
Wikipedia Text Reuse: Within and Without
We study text reuse related to Wikipedia at scale by compiling the first corpus of text reuse cases within Wikipedia as well as without (i.e., reuse of Wikipedia text in a sample of the Common Crawl). To disco...
-
Chapter and Conference Paper
Predicting Retrieval Success Based on Information Use for Writing Tasks
This paper asks to what extent querying, clicking, and text editing behavior can predict the usefulness of the search results retrieved during essay writing. To render the usefulness of a search result directl...
-
Chapter and Conference Paper
Elastic ChatNoir: Search Engine for the ClueWeb and the Common Crawl
Elastic ChatNoir (Search:www.chatnoir.eu Code:www.github.com/chatnoir-eu) is an Elasticsearch-based se...
-
Chapter and Conference Paper
Algorithms and Corpora for Persian Plagiarism Detection
The task of plagiarism detection is to find passages of text-reuse in a suspicious document. This task is of increasing relevance, since scholars around the world take advantage of the fact that information ab...
-
Chapter and Conference Paper
Overview of PAN’17
The PAN 2017 shared tasks on digital text forensics were held in conjunction with the annual CLEF conference. This paper gives a high-level overview of each of the three shared tasks organized this year, namel...
-
Chapter and Conference Paper
Clickbait Detection
This paper proposes a new model for the detection of clickbait, i.e., short messages that lure readers to click a link. Clickbait is primarily used by online content publishers to increase their readership, wh...
-
Chapter and Conference Paper
Overview of PAN’16
This paper presents an overview of the PAN/CLEF evaluation lab. During the last decade, PAN has been established as the main forum of digital text forensic research. PAN 2016 comprises three shared tasks: (i) aut...
-
Chapter and Conference Paper
Who Wrote the Web? Revisiting Influential Author Identification Research Applicable to Information Retrieval
In this paper, we revisit author identification research by conducting a new kind of large-scale reproducibility study: we select 15 of the most influential papers for author identification and recruit a group...
-
Chapter and Conference Paper
Overview of the PAN/CLEF 2015 Evaluation Lab
This paper presents an overview of the PAN/CLEF evaluation lab. During the last decade, PAN has been established as the main forum of text mining research focusing on the identification of personal traits of a...
-
Chapter and Conference Paper
Twitter Sentiment Detection via Ensemble Classification Using Averaged Confidence Scores
We reproduce three classification approaches with diverse feature sets for the task of classifying the sentiment expressed in a given tweet as either positive, neutral, or negative. The reproduced approaches a...
-
Chapter and Conference Paper
Improving the Reproducibility of PAN’s Shared Tasks:
This paper reports on the PAN 2014 evaluation lab which hosts three shared tasks on plagiarism detection, author identification, and author profiling. To improve the reproducibility of shared tasks in general,...
-
Chapter and Conference Paper
Recent Trends in Digital Text Forensics and Its Evaluation
This paper outlines the concepts and achievements of our evaluation lab on digital text forensics, PAN 13, which called for original research and development on plagiarism detection, author identification, and...