Enhancing Anaphora Resolution for Czech

نویسنده

  • Vasek Nemcík
چکیده

Resolution of anaphoric reference is one of the most important challenges in natural language processing (NLP). Functionality of most NLP systems crucially relies on an accurate mechanism for determining which expressions in the input refer to the same entity in the real world. The immense complexity of this issue has led the research community to adopt predominantly knowledge-poor methods, despite the fact that these are known to be incapable of solving this task reliably. This paper suggests several ways of extending such methods by further resources and mechanisms in order to arrive at a more adequate anaphora resolution procedure.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Saara: Anaphora Resolution on Free Text in Czech

Anaphora resolution is one of the key parts of modern NLP systems, and not addressing it usually means a notable performance drop. Despite the abundance of theoretical studies published in the previous decades, real systems for resolving anaphora are rather rare. In this article we present, to our knowledge, the first practical anaphora resolution system applicable to Czech free text. We descri...

متن کامل

The Saara Framework: An Anaphora Resolution System for Czech

Determining reference and referential links in discourse is one of the biggest and most important challenges in natural language understanding. In particular, computing coreference classes over the set of referring expressions in text is crucial for its further syntactic and semantic processing. We present a system for automatic anaphora resolution that can be used on arbitrary texts in Czech. ...

متن کامل

Anaphora in Czech: Large Data and Experiments with Automatic Anaphora Resolution

The aim of this paper is two-fold. First, we want to present a part of the annotation scheme of the Prague Dependency Treebank 2.0 related to the annotation of coreference on the tectogrammatical layer of sentence representation (more than 45,000 textual and grammatical coreference links in almost 50,000 manually annotated Czech sentences). Second, we report a new pronoun resolution system deve...

متن کامل

The Saara Framework

The determination of reference and referential links in discourse is one of the important challenges in natural language understanding. The first commonly adopted step towards this objective is to determine coreference classes over the set of referring expressions. We present a modular framework for automatic anaphora resolution which makes it possible to specify various anaphora resolution alg...

متن کامل

Comparison of Classification and Ranking Approaches to Pronominal Anaphora Resolution in Czech

In this paper we compare two Machine Learning approaches to the task of pronominal anaphora resolution: a conventional classification system based on C5.0 decision trees, and a novel perceptron-based ranker. We use coreference links annotated in the Prague Dependency Treebank 2.0 for training and evaluation purposes. The perceptron system achieves f-score 79.43% on recognizing coreference of pe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007