Benchmarking of Statistical Dependency Parsers for French

نویسندگان

Marie Candito

Joakim Nivre

Pascal Denis

Enrique Henestroza Anguiano

چکیده

We compare the performance of three statistical parsing architectures on the problem of deriving typed dependency structures for French. The architectures are based on PCFGs with latent variables, graph-based dependency parsing and transition-based dependency parsing, respectively. We also study the influence of three types of lexical information: lemmas, morphological features, and word clusters. The results show that all three systems achieve competitive performance, with a best labeled attachment score over 88%. All three parsers benefit from the use of automatically derived lemmas, while morphological features seem to be less important. Word clusters have a positive effect primarily on the latent variable parser.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Old French Dependency Parsing: Results of Two Parsers Analysed from a Linguistic Point of View

The treatment of medieval texts is a particular challenge for parsers. I compare how two dependency parsers, one graph-based, the other transition-based, perform on Old French, facing some typical problems of medieval texts: graphical variation, relatively free word order, and syntactic variation of several parameters over a diachronic period of about 300 years. Both parsers were trained and ev...

متن کامل

Parsing Any Domain English text to CoNLL dependencies

It is well known that accuracies of statistical parsers trained over Penn treebank on test sets drawn from the same corpus tend to be overestimates of their actual parsing performance. This gives rise to the need for evaluation of parsing performance on corpora from different domains. Evaluating multiple parsers on test sets from different domains can give a detailed picture about the relative ...

متن کامل

Statistical French Dependency Parsing: Treebank Conversion and First Results

We first describe the automatic conversion of the French Treebank (Abeillé and Barrier, 2004), a constituency treebank, into typed projective dependency trees. In order to evaluate the overall quality of the resulting dependency treebank, and to quantify the cases where the projectivity constraint leads to wrong dependencies, we compare a subset of the converted treebank to manually validated d...

متن کامل

Improving the Usability of Statistical Parsers by Incorporating Linguistic Constraints

Statistical systems with high accuracy are very useful in real-world applications. If these systems can capture basic linguistic information, then the usefulness of these statistical systems improve a lot. This paper is an attempt at incorporating linguistic constraints in statistical dependency parsing. We consider a simple linguistic constraint that a verb should not have multiple subjects/ob...

متن کامل

Evaluation of Dependency Parsers on Unbounded Dependencies

We evaluate two dependency parsers, MSTParser and MaltParser, with respect to their capacity to recover unbounded dependencies in English, a type of evaluation that has been applied to grammarbased parsers and statistical phrase structure parsers but not to dependency parsers. The evaluation shows that when combined with simple post-processing heuristics, the parsers correctly recall unbounded ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2010

Benchmarking of Statistical Dependency Parsers for French

نویسندگان

چکیده

منابع مشابه

Old French Dependency Parsing: Results of Two Parsers Analysed from a Linguistic Point of View

Parsing Any Domain English text to CoNLL dependencies

Statistical French Dependency Parsing: Treebank Conversion and First Results

Improving the Usability of Statistical Parsers by Incorporating Linguistic Constraints

Evaluation of Dependency Parsers on Unbounded Dependencies

عنوان ژورنال:

اشتراک گذاری