Skip to main content

previous disabled Page of 2
and
  1. No Access

    Chapter and Conference Paper

    Overview of Touché 2022: Argument Retrieval

    The goal of the Touché lab on argument retrieval is to foster and support the development of technologies for argument mining and argument analysis. In the third edition of Touché, we organize three shared tas...

    Alexander Bondarenko, Maik Fröbe, Johannes Kiesel in Advances in Information Retrieval (2022)

  2. No Access

    Chapter and Conference Paper

    Overview of PAN 2022: Authorship Verification, Profiling Irony and Stereotype Spreaders, Style Change Detection, and Trigger Detection

    The paper gives a brief overview of the four shared tasks to be organized at the PAN 2022 lab on digital text forensics and stylometry hosted at the CLEF 2022 conference. The tasks include authorship verificat...

    Janek Bevendorff, Berta Chulvi, Elisabetta Fersini in Advances in Information Retrieval (2022)

  3. No Access

    Chapter and Conference Paper

    The Power of Anchor Text in the Neural Retrieval Era

    In the early days of web search, a study by Craswell et al. [11] showed that anchor texts are particularly helpful ranking features for navigational queries and a study by Eiron and McCurley [24] showed that anch...

    Maik Fröbe, Sebastian Günther, Maximilian Probst in Advances in Information Retrieval (2022)

  4. Chapter and Conference Paper

    Shared Tasks on Authorship Analysis at PAN 2020

    The paper gives a brief overview of the four shared tasks that are to be organized at the PAN 2020 lab on digital text forensics and stylometry, hosted at CLEF conference. The tasks include author profiling, ...

    Janek Bevendorff, Bilal Ghanem, Anastasia Giachanou in Advances in Information Retrieval (2020)

  5. Chapter and Conference Paper

    The Effect of Content-Equivalent Near-Duplicates on the Evaluation of Search Engines

    Current best practices for the evaluation of search engines do not take into account duplicate documents. Dependent on their prevalence, not discounting duplicates during evaluation artificially inflates perfo...

    Maik Fröbe, Jan Philipp Bittner, Martin Potthast in Advances in Information Retrieval (2020)

  6. Chapter and Conference Paper

    A Search Engine for Police Press Releases to Double-Check the News

    Many people have doubts about the factual accuracy of online news, while still trusting the press releases of police departments. To enable an easy corroboration of online news about police-related events, we ...

    Maik Fröbe, Nina Schwanke, Matthias Hagen in Advances in Information Retrieval (2020)

  7. Chapter and Conference Paper

    Touché: First Shared Task on Argument Retrieval

    Technologies for argument mining and argumentation processing are maturing continuously, giving rise to the idea of retrieving arguments in search scenarios. We introduce Touché, the first lab on Argument Retrie...

    Alexander Bondarenko, Matthias Hagen, Martin Potthast in Advances in Information Retrieval (2020)

  8. No Access

    Chapter and Conference Paper

    A Decade of Shared Tasks in Digital Text Forensics at PAN

    Digital text forensics aims at examining the originality and credibility of information in electronic documents and, in this regard, to extract and analyze information about the authors of these documents. The...

    Martin Potthast, Paolo Rosso, Efstathios Stamatatos in Advances in Information Retrieval (2019)

  9. No Access

    Chapter and Conference Paper

    Wikipedia Text Reuse: Within and Without

    We study text reuse related to Wikipedia at scale by compiling the first corpus of text reuse cases within Wikipedia as well as without (i.e., reuse of Wikipedia text in a sample of the Common Crawl). To disco...

    Milad Alshomary, Michael Völske, Tristan Licht in Advances in Information Retrieval (2019)

  10. No Access

    Chapter and Conference Paper

    Predicting Retrieval Success Based on Information Use for Writing Tasks

    This paper asks to what extent querying, clicking, and text editing behavior can predict the usefulness of the search results retrieved during essay writing. To render the usefulness of a search result directl...

    Pertti Vakkari, Michael Völske, Martin Potthast in Digital Libraries for Open Knowledge (2018)

  11. No Access

    Chapter and Conference Paper

    Elastic ChatNoir: Search Engine for the ClueWeb and the Common Crawl

    Elastic ChatNoir (Search:www.chatnoir.eu Code:www.github.com/chatnoir-eu) is an Elasticsearch-based se...

    Janek Bevendorff, Benno Stein, Matthias Hagen in Advances in Information Retrieval (2018)

  12. No Access

    Chapter and Conference Paper

    Algorithms and Corpora for Persian Plagiarism Detection

    The task of plagiarism detection is to find passages of text-reuse in a suspicious document. This task is of increasing relevance, since scholars around the world take advantage of the fact that information ab...

    Habibollah Asghari, Salar Mohtaj, Omid Fatemi, Heshaam Faili in Text Processing (2018)

  13. No Access

    Chapter and Conference Paper

    Overview of PAN’17

    The PAN 2017 shared tasks on digital text forensics were held in conjunction with the annual CLEF conference. This paper gives a high-level overview of each of the three shared tasks organized this year, namel...

    Martin Potthast, Francisco Rangel in Experimental IR Meets Multilinguality, Mul… (2017)

  14. No Access

    Chapter and Conference Paper

    Clickbait Detection

    This paper proposes a new model for the detection of clickbait, i.e., short messages that lure readers to click a link. Clickbait is primarily used by online content publishers to increase their readership, wh...

    Martin Potthast, Sebastian Köpsel, Benno Stein in Advances in Information Retrieval (2016)

  15. No Access

    Chapter and Conference Paper

    Overview of PAN’16

    This paper presents an overview of the PAN/CLEF evaluation lab. During the last decade, PAN has been established as the main forum of digital text forensic research. PAN 2016 comprises three shared tasks: (i) aut...

    Paolo Rosso, Francisco Rangel in Experimental IR Meets Multilinguality, Mul… (2016)

  16. No Access

    Chapter and Conference Paper

    Who Wrote the Web? Revisiting Influential Author Identification Research Applicable to Information Retrieval

    In this paper, we revisit author identification research by conducting a new kind of large-scale reproducibility study: we select 15 of the most influential papers for author identification and recruit a group...

    Martin Potthast, Sarah Braun, Tolga Buz in Advances in Information Retrieval (2016)

  17. No Access

    Chapter and Conference Paper

    Overview of the PAN/CLEF 2015 Evaluation Lab

    This paper presents an overview of the PAN/CLEF evaluation lab. During the last decade, PAN has been established as the main forum of text mining research focusing on the identification of personal traits of a...

    Efstathios Stamatatos, Martin Potthast in Experimental IR Meets Multilinguality, Mul… (2015)

  18. No Access

    Chapter and Conference Paper

    Twitter Sentiment Detection via Ensemble Classification Using Averaged Confidence Scores

    We reproduce three classification approaches with diverse feature sets for the task of classifying the sentiment expressed in a given tweet as either positive, neutral, or negative. The reproduced approaches a...

    Matthias Hagen, Martin Potthast, Michel Büchner in Advances in Information Retrieval (2015)

  19. No Access

    Chapter and Conference Paper

    Improving the Reproducibility of PAN’s Shared Tasks:

    This paper reports on the PAN 2014 evaluation lab which hosts three shared tasks on plagiarism detection, author identification, and author profiling. To improve the reproducibility of shared tasks in general,...

    Martin Potthast, Tim Gollub in Information Access Evaluation. Multilingua… (2014)

  20. No Access

    Chapter and Conference Paper

    Recent Trends in Digital Text Forensics and Its Evaluation

    This paper outlines the concepts and achievements of our evaluation lab on digital text forensics, PAN 13, which called for original research and development on plagiarism detection, author identification, and...

    Tim Gollub, Martin Potthast, Anna Beyer in Information Access Evaluation. Multilingua… (2013)

previous disabled Page of 2