A Hybrid Architecture for Robust Parsing of German
نویسندگان
چکیده
This paper provides an overview of current research on a hybrid and robust parsing architecture for the morphological, syntactic and semantic annotation of German text corpora. The novel contribution of this research lies not in the individual parsing modules, each of which relies on state-of-the-art algorithms and techniques. Rather what is new about the present approach is the combination of these modules into a single architecture. This combination provides a means to significantly optimize the performance of each component, resulting in an increased accuracy of annotation.
منابع مشابه
Robust Syntactic Annotation of Corpora and Memory-based Parsing
This talk provides an overview of current work in my research group on the syntactic annotation of the Tubingen corpus of spoken German and of the German Reference Corpus (Deutsches Referenzkorpus: DEREKO) of written texts. Morpho-syntactic and syntactic annotation as well as annotation of function-argument structure for these corpora is performed automatically by a hybrid architecture that com...
متن کاملInformation Extraction from German Patient Records via Hybrid Parsing and Relation Extraction Strategies
German Research Center for AI Institut für Med. Informatik/Charité Stuhlsatzenhausweg 3, 66123 Saarbrücken Hindenburgdamm 30, 12200 Berlin [email protected] { f.mueller, thomas.tolxdorff}@charite.de Abstract In this paper, we report on first attempts and findings to analyzing German patient records, using a hybrid parsing architecture and a combination of two relation extraction strate...
متن کاملA Robust And Hybrid Deep-Linguistic Theory Applied To Large-Scale Parsing
Modern statistical parsers are robust and quite fast, but their output is relatively shallow when compared to formal grammar parsers. We suggest to extend statistical approaches to a more deep-linguistic analysis while at the same time keeping the speed and low complexity of a statistical parser. The resulting parsing architecture suggested, implemented and evaluated here is highly robust and h...
متن کاملIntegrated Shallow and Deep Parsing: TopP Meets HPSG
We present a novel, data-driven method for integrated shallow and deep parsing. Mediated by an XML-based multi-layer annotation architecture, we interleave a robust, but accurate stochastic topological field parser of German with a constraintbased HPSG parser. Our annotation-based method for dovetailing shallow and deep phrasal constraints is highly flexible, allowing targeted and fine-grained ...
متن کاملExplanation of Facade Patterns in Buildings Constructed by German Architects in Iran (Pahlavi Period)
The external shell and the physical characteristics of the building have always played an essential role in Iranian architecture and its function has been various in different periods and has always been influenced by cultural and social conditions throughout history. The diversity of this layer of architecture in the first Pahlavi period is more visible than other periods. Because by employing...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002