Skip to main content

and
  1. Article

    Open Access

    NERO: a biomedical named-entity (recognition) ontology with a large, annotated corpus reveals meaningful associations through text embedding

    Machine reading (MR) is essential for unlocking valuable knowledge contained in millions of existing biomedical documents. Over the last two decades1,2, the most dramatic advances in MR have followed in the wake ...

    Kanix Wang, Robert Stevens, Halima Alachram, Yu Li in npj Systems Biology and Applications (2021)

  2. No Access

    Chapter and Conference Paper

    Within-Ejaculate Sperm Selection and Its Implications for Assisted Reproduction Technologies

    In most animals, males produce large numbers of sperm in each ejaculate, but only very few end up fertilising an egg. This bottleneck in sperm numbers from ejaculation to fertilisation offers an intuitive oppo...

    Ghazal Alavioon, Daniel Marcu, Simone Immler in XIIIth International Symposium on Spermato… (2021)

  3. No Access

    Article

    Incident-Driven Machine Translation and Name Tagging for Low-resource Languages

    We describe novel approaches to tackling the problem of natural language processing for low-resource languages. The approaches are embodied in systems for name tagging and machine translation (MT) that we cons...

    Ulf Hermjakob, Qiang Li, Daniel Marcu, Jonathan May in Machine Translation (2018)

  4. Chapter and Conference Paper

    Building and Using a Knowledge Graph to Combat Human Trafficking

    There is a huge amount of data spread across the web and stored in databases that we can use to build knowledge graphs. However, exploiting this data to build knowledge graphs is difficult due to the heterogen...

    Pedro Szekely, Craig A. Knoblock, Jason Slepicka in The Semantic Web - ISWC 2015 (2015)

  5. No Access

    Chapter

    Exploiting Comparable Corpora

    Comparable corpora exhibit various degrees of parallelism. Fung and Cheung [3] describe corpora ranging from noisy parallel, to comparable, and finally to very non-parallel. The last category contains corpora com...

    Dragos Stefan Munteanu, Daniel Marcu in Building and Using Comparable Corpora (2013)

  6. Article

    Search-based structured prediction

    We present Searn, an algorithm for integrating search and learning to solve complex structured prediction problems such as those that occur in natural language, speech, computational biology, and vision. Searn is...

    Hal Daumé III, John Langford, Daniel Marcu in Machine Learning (2009)

  7. No Access

    Chapter

    How To Select An Answer String?

    Given a question Q and a sentence/paragraph SP that is likely to contain the answer to Q, an answer selection module is supposed to select the “exact” answer sub-string A ⊂ SP. We study three distinct approach...

    Abdessamad Echihabi, Ulf Hermjakob in Advances in Open Domain Question Answering (2008)

  8. No Access

    Chapter and Conference Paper

    Unsupervised Learning of Verb Argument Structures

    We present a statistical generative model for unsupervised learning of verb argument structures. The model was used to automatically induce the argument structures for the 1,500 most frequent verbs of English....

    Thiago Alexandre Salgueiro Pardo in Computational Linguistics and Intelligent … (2006)

  9. No Access

    Chapter and Conference Paper

    Towards Develo** Probabilistic Generative Models for Reasoning with Natural Language Representations

    Probabilistic generative models have been applied successfully in a wide range of applications that range from speech recognition and part of speech tagging, to machine translation and information retrieval, b...

    Daniel Marcu in Computational Linguistics and Intelligent Text Processing (2005)

  10. No Access

    Chapter and Conference Paper

    Cross-Language Question Answering at the USC Information Sciences Institute

    The TextMap-TMT cross-language question answering system at USC-ISI was designed to answer Spanish questions from English documents. The system is fully automatic, including question translation from Spanish t...

    Abdessamad Echihabi, Douglas W. Oard in Comparative Evaluation of Multilingual Inf… (2004)

  11. No Access

    Chapter and Conference Paper

    Text Simplification for Information-Seeking Applications

    This paper addresses the issue of simplifying natural language texts in order to ease the task of accessing factual information contained in them. We define the notion of Easy Access Sentence – a unit of text ...

    Beata Beigman Klebanov, Kevin Knight in On the Move to Meaningful Internet Systems… (2004)

  12. No Access

    Article

    A Machine Learning Approach for Identification Thesis and Conclusion Statements in Student Essays

    This study describes and evaluates twoessay-based discourse analysis systems thatidentify thesis and conclusion statements fromstudent essays written on six different essaytopics. Essays used to train and eval...

    Jill Burstein, Daniel Marcu in Computers and the Humanities (2003)

  13. No Access

    Chapter

    Building a Discourse-Tagged Corpus in the Framework of Rhetorical Structure Theory

    We describe our experience in develo** a discourse-annotated corpus for community-wide use. Working in the framework of Rhetorical Structure Theory, we were able to create a large annotated resource with ver...

    Lynn Carlson, Daniel Marcu in Current and New Directions in Discourse an… (2003)

  14. No Access

    Article

    Translation with Scarce Bilingual Resources

    Machine translation of human languages is a field almost as old as computers themselves. Recent approaches to this challenging problem aim at learning translation knowledge automatically (or semi-automatically...

    Yaser Al-Onaizan, Ulrich Germann, Ulf Hermjakob, Kevin Knight in Machine Translation (2002)

  15. No Access

    Chapter and Conference Paper

    Using a Large Monolingual Corpus to Improve Translation Accuracy

    The existence of a phrase in a large monolingual corpus is very useful information, and so is its frequency. We introduce an alternative approach to automatic translation of phrases/sentences that operationali...

    Radu Soricut, Kevin Knight in Machine Translation: From Research to Real Users (2002)

  16. No Access

    Chapter and Conference Paper

    Translation by the Numbers: Language Weaver

    Pre-market prototype - to be available commercially in the second or third quarter of 2003.

    Bryce Benjamin, Kevin Knight, Daniel Marcu in Machine Translation: From Research to Real… (2002)

  17. No Access

    Chapter and Conference Paper

    Foundations of a logical approach to agent programming

    This paper describes a novel approach to high-level agent programming based on a highly developed logical theory of action. The user provides a specification of the agents' basic actions (preconditions and eff...

    Yves Lespérance, Hector J. Levesque in Intelligent Agents II Agent Theories, Arch… (1996)