lexical segmentation

نتایج جستجو برای: lexical segmentation

تعداد نتایج: 95920 فیلتر نتایج به سال:

The Segmentation of Sub-Lexical Morphemes in English-Learning 15-Month-Olds

Journal: :Frontiers in Psychology 2013

متن کامل

Constraining Models of Jacques Mehler , Lexical Access : The Onset of Emmanuel Dupoux , and Word Recognition Juan Segui

2016

Juan Segui

at many distinguished institutions may license the conclusion that continuous speech has no obvious cues that facilitate the segmentation process es. However , even if cues indicating boundaries or natural segments have not been found , it would be rash to claim that they cannot be found. The lexicon is acquired, and infants are provided with little information about words pronounced in isolati...

متن کامل

Amplitude fluctuations in a masker influence lexical segmentation in cochlear implant users

Journal: :The Journal of the Acoustical Society of America 2015

متن کامل

Rules Design in Word Segmentation of Chinese Micro-Blog

2012

Hao Zong Derek F. Wong Lidia S. Chao

This paper proposed a Hidden Markov Model (HMM) based tokenizer for Chinese micro-blog texts. Comparing with normal Chinese texts, micro-blog texts contain more uncertainties. These uncertainties are generally aroused by the irregular use of bloggers (such as network words, dialect words, wrong written characters, mixture of foreign words and symbols, etc.). Besides the lack of the annotated tr...

متن کامل

HHMM-based Chinese Lexical Analyzer ICTCLAS

2003

Huaping Zhang Hongkui Yu Deyi Xiong Qun Liu

This document presents the results from Inst. of Computing Tech., CAS in the ACLSIGHAN-sponsored First International Chinese Word Segmentation Bakeoff. The authors introduce the unified HHMM-based frame of our Chinese lexical analyzer ICTCLAS and explain the operation of the six tracks. Then provide the evaluation results and give more analysis. Evaluation on ICTCLAS shows that its performance ...

متن کامل

TrTok: A Fast and Trainable Tokenizer for Natural Languages

Journal: :Prague Bull. Math. Linguistics 2012

Jirí Marsík Ondrej Bojar

We present a universal data-driven tool for segmenting and tokenizing text. The presented tokenizer lets the user define where token and sentence boundaries should be considered. These instances are then judged by a classifier which is trained from provided tokenized data. The features passed to the classifier are also defined by the user making, e.g., the inclusion of abbreviation lists trivia...

متن کامل

A Joint Segmenting and Labeling Approach for Chinese Lexical Analysis

2008

Xinhao Wang Jiazhong Nie Dingsheng Luo Xihong Wu

This paper introduces an approach which jointly performs a cascade of segmentation and labeling subtasks for Chinese lexical analysis, including word segmentation, named entity recognition and partof-speech tagging. Unlike the traditional pipeline manner, the cascaded subtasks are conducted in a single step simultaneously, therefore error propagation could be avoided and the information could b...

متن کامل

Building a Syllable Database to Solve the Problem of Khmer Word Segmentation

Journal: :CoRR 2017

Nam Tran Van

Word segmentation is a basic problem in natural language processing. With the languages having the complex writing system like the Khmer language in Southern of Vietnam, this problem really very intractable, posing the significant challenges. Although there are some experts in Vietnam as well as international having deeply researched this problem, there are still no reasonable results meeting t...

متن کامل

Top-Down versus Bottom-Up Theories of Phonological Acquisition: A Big Data Approach

2017

Christina Bergmann Sho Tsuji Alejandrina Cristià

Recent work has made available a number of standardized metaanalyses bearing on various aspects of infant language processing. We utilize data from two such meta-analyses (discrimination of vowel contrasts and word segmentation, i.e., recognition of word forms extracted from running speech) to assess whether the published body of empirical evidence supports a bottom-up versus a top-down theory ...

متن کامل

Japanese-to-English Patent Translation System based on Domain-adapted Word Segmentation and Post-ordering

2014

Katsuhito Sudoh Masaaki Nagata Shinsuke Mori Tatsuya Kawahara

This paper presents a Japanese-to-English statistical machine translation system specialized for patent translation. Patents are practically useful technical documents, but their translation needs different efforts from general-purpose translation. There are two important problems in the Japanese-to-English patent translation: long distance reordering and lexical translation of many domain-spec...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید