Inductive Inference of Context-free Languages - Context-free Expression Method
نویسنده
چکیده
An inductive inference problem of context-free languages is investigated. There have been many attempts to this problem, and most of them are based on a problem setting in which a representation space for hypotheses is a class of context-free grammars. An inference algorithm given in this paper , on the contrary, employs a kind of extensions of regular expressions called context-free expressions as a representation space for context-free languages. The algorithm, based on the notion of an identification in the l imit, is significantly concise when compared with existing algorithms. 1. I n t r o d u c t i o n We consider the following model of inductive inference problem: Given an object L of inference, an inductive inference device (IID) tries to infer a representation H for the object from examples. It is assumed that IID has an enumeration mechanism by which any possible hypothesis from the representation space can be eventually enumerated at least once. It is also assumed that we can utilize an oracle for presenting examples concerning the object. IID asks the oracle for an example, and computes hypothesis and outputs i t , and again asks another example for the next step, and this process is cycled. In a sequence of hypotheses H1, H2 , . . . I ID is said to identify L in the l imit if there exists a positive integer n such that Hn represents L and Hn + i equals to Hn for all i >0. A simple algorithm for identification in the l imit is the one based on the notion of identification by enumeration. Let H1, H2,... be an effective enumeration of the possible hypotheses, and suppose a set of examples e1,e2,...,ek are presented. Then, ED provides as its next output the first hypothesis which is compatible with all these examples. Under the assumption of a perfect oracle, the sequence of hypotheses converges in the limit.([Gold 1967]) In this paper, we deal with the inductive inference problem for context-free languages, and employ a representation space for hypotheses different from the ones in the existing methods. This enables us to make an elegant discussion on the problem and to obtain a simple algorithm for solving the problem.
منابع مشابه
Context-Free Language Induction by Evolution of Deterministic Push-Down Automata Using Genetic Programming<
The process of learning often consists of Inductive Inference, making generalizations from samples. The problem here is finding generalizations (Grammars) for Formal Languages from finite sets of positive and negative sample sentences. The focus of this paper is on Context-Free Languages (CFL’s) as defined by Context-Free Grammars (CFG’s), some of which are accepted by Deterministic Push-Down A...
متن کاملA Polynomial Algorithm for the Inference of Context Free Languages
We present a polynomial algorithm for the inductive inference of a large class of context free languages, that includes all regular languages. The algorithm uses a representation which we call Binary Feature Grammars based on a set of features, capable of representing richly structured context free languages as well as some context sensitive languages. More precisely, we focus on a particular c...
متن کاملUsing Contextual Representations to Efficiently Learn Context-Free Languages
We present a polynomial update time algorithm for the inductive inference of a large class of context-free languages using the paradigm of positive data and a membership oracle. We achieve this result by moving to a novel representation, called Contextual Binary Feature Grammars (CBFGs), which are capable of representing richly structured context-free languages as well as some context sensitive...
متن کاملAdvances in Learning Formal Languages
we present an overview in the advances related to the learning of formal languages i.e. development in the grammatical inference research. The problem of learning correct grammars for the unknown languages is known as grammatical inference. It is considered a main subject of inductive inference, and grammars are important representations to be investigated in machine learning from both theoreti...
متن کاملErweiterung des EMILE-Verfahrens zum induktiven Lernen von kontext-freien Grammatiken für natürliche Sprache. Extension of the EMILE algorithm for inductive learning of context-free grammars for natural languages
Extension of the EMILE algorithm for inductive learning of context-free grammars for natural languages.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1987