Unsupervised learning of natural languages
نویسندگان
چکیده
منابع مشابه
Unsupervised learning of natural languages.
We address the problem, fundamental to linguistics, bioinformatics, and certain other disciplines, of using corpora of raw symbolic sequential data to infer underlying rules that govern their production. Given a corpus of strings (such as text, transcribed speech, chromosome or protein sequence data, sheet music, etc.), our unsupervised algorithm recursively distills from it hierarchically stru...
متن کاملUnsupervised learning of graph languages using kernels
In most domains, machine learning involves discovering the relationship between objects. Such relationships are naturally represented as graphs. Up until now, most work on learning graphs consisted of extending well-known approaches from the field of machine learning to deal with more complex structures. For both technical and conceptual reasons, borrowing techniques from Grammar Induction (GI)...
متن کاملExperiments in Unsupervised Learning of Natural Language
Linguistics has invented and discarded many theories of language, and there are currently many competitors to the basic idea of phrase structure grammars as capturing the syntactic structure of language. Computational Linguistics has proven to be a testing ground for theories and grammars, and is similarly diverse. Moreover recently we have learnt that the similar principles and techniques may ...
متن کاملUnsupervised Learning for Natural Language Processing
Given the abundance of text data, unsupervised approaches are very appealing for natural language processing. We present three latent variable systems which achieve state-of-the-art results in domains previously dominated by fully supervised systems. For syntactic parsing, we describe a grammar induction technique which begins with coarse syntactic structures and iteratively refines them in an ...
متن کاملLearning to Transform Natural to Formal Languages
This paper presents a method for inducing transformation rules that map natural-language sentences into a formal query or command language. The approach assumes a formal grammar for the target representation language and learns transformation rules that exploit the non-terminal symbols in this grammar. The learned transformation rules incrementally map a natural-language sentence or its syntact...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the National Academy of Sciences
سال: 2005
ISSN: 0027-8424,1091-6490
DOI: 10.1073/pnas.0409746102