Random Sentences from a Generalized Phrase-Structure Grammar Interpreter
نویسنده
چکیده
In numerous domains in cognitive science it is often useful to have a source for randomly generated corpora. These corpora may serve as a foundation for artificial stimuli in a learning experiment (e.g., Ellefson & Christiansen, 2000), or as input into computational models (e.g., Christiansen & Dale, 2001). The following compact and general C program interprets a phrasestructure grammar specified in a text file. It follows parameters set at a Unix or Unix-based command-line and generates a corpus of random sentences from that grammar.
منابع مشابه
Constraint Grammar As A Framework For Parsing Running Text
1. Outline Grammars which are used in parsers are often directly imported from autonomous grammar theory and descriptive practice that were not exercised for the explicit purpose of parsing. Parsers have been designed for English based on e.g. Government and Binding Theory, Generalized Phrase Structure Grammar, and LexicaI-Functional Grammar. We present a formalism to be used for parsing where ...
متن کاملFeature Engineering in Persian Dependency Parser
Dependency parser is one of the most important fundamental tools in the natural language processing, which extracts structure of sentences and determines the relations between words based on the dependency grammar. The dependency parser is proper for free order languages, such as Persian. In this paper, data-driven dependency parser has been developed with the help of phrase-structure parser fo...
متن کاملTHE X - BAR THEORY OF PHRASE STRUCTUREAndr
X-bar theory is widely regarded as a substantive theory of phrase structure properties in natural languages. In this paper we will demonstrate that a formalization of its content reveals very little substance in its claims. We state and discuss six conditions that encapsulate the claims of X-bar theory: Lexicality | each nonterminal is a projection of a preterminal; Succession | each X n+1 domi...
متن کاملProbabilistic Language Model for Analyzing Korean Sentences
In this paper, we introduce a restricted form of phrase structure grammar to handle the characteristics of Korean more eeciently. Based on this restricted form of the grammar, we propose a probabilistic parser for Korean sentences. To show usefulness of the parser proposed in this paper, we made a preliminary experiment. We extract a set of rules from about 1,682 tree annotated sentences. The e...
متن کاملConstraint - Based Lexica
As the field of generative linguistics has developed, the lexicon has taken on an increasingly important role in the description of both idiosyncratic and regular properties of language. Always viewed as a natural home for exceptions, the lexicon was given relatively little work in the early years of transformational grammar. Then Chomsky proposed in 1970 (Chomsky, 1970) that similarities in th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/cs/0702081 شماره
صفحات -
تاریخ انتشار 2007