Unsupervised Chunking Based on Graph Propagation from Bilingual Corpus
نویسندگان
چکیده
منابع مشابه
Unsupervised Chunking Based on Graph Propagation from Bilingual Corpus
This paper presents a novel approach for unsupervised shallow parsing model trained on the unannotated Chinese text of parallel Chinese-English corpus. In this approach, no information of the Chinese side is applied. The exploitation of graph-based label propagation for bilingual knowledge transfer, along with an application of using the projected labels as features in unsupervised model, contr...
متن کاملUnsupervised Part-of-Speech Tagging with Bilingual Graph-Based Projections
We describe a novel approach for inducing unsupervised part-of-speech taggers for languages that have no labeled training data, but have translated text in a resource-rich language. Our method does not assume any knowledge about the target language (in particular no tagging dictionary is assumed), making it applicable to a wide array of resource-poor languages. We use graph-based label propagat...
متن کاملA Graph-based Bilingual Corpus Selection Approach for SMT
In statistical machine translation, the number of sentence pairs in the bilingual corpus is very important to the quality of translation. However, when the quantity reaches some extent, enlarging the corpus has less effect on the translation quality; whereas increasing greatly the time and space complexity to train the translation model, which hinders the development of statistical machine tran...
متن کاملStructure Alignment Using Bilingual Chunking
A new statistical method called “bilingual chunking” for structure alignment is proposed. Different with the existing approaches which align hierarchical structures like sub-trees, our method conducts alignment on chunks. The alignment is finished through a simultaneous bilingual chunking algorithm. Using the constrains of chunk correspondence between source language (SL)1 and target language (...
متن کاملTitle of Thesis: Textual Representations for Corpus-based Bilingual Retrieval Title of Thesis: Textual Representations for Corpus-based Bilingual Retrieval Textual Representations for Corpus-based Bilingual Retrieval
Title of Thesis: Textual Representations for Corpus-Based Bilingual Retrieval Paul McNamee, Doctor of Philosophy, 2008 Thesis directed by: Charles K. Nicholas, Professor Department of Computer Science and Electrical Engineering The traditional approach to information retrieval is based on using words as the indexing and search terms for documents. However, word-based representations have diffic...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The Scientific World Journal
سال: 2014
ISSN: 2356-6140,1537-744X
DOI: 10.1155/2014/401943