Poor Estimates of Context are Worse than None
نویسندگان
چکیده
It is difficult to estimate the probability of a word's context because of sparse data problems. If appropriate care is taken, we find that it is possible to make useful estimates of contextual probabilities that improve performance in a spelling correction application. In contrast, less careful estimates are found to be useless. Specifically, we will show that the Good-Turing method makes the use of contextual information practical for a spelling corrector, while attempts to use the maximum likelihood estimator (MLE) or expected Idcellhood estimator (ELE) fail. Spelling correction was selected as an application domain because it is analogous to many important recognition applications based on a noisy channel model (such as speech recognition), though somewhat simpler and therefore possibly more amenable to detailed statistical analysis.
منابع مشابه
Comparison of Estimates Using Record Statistics from Lomax Model: Bayesian and Non Bayesian Approaches
This paper address the problem of Bayesian estimation of the parameters, reliability and hazard function in the context of record statistics values from the two-parameter Lomax distribution. The ML and the Bayes estimates based on records are derived for the two unknown parameters and the survival time parameters, reliability and hazard functions. The Bayes estimates are obtained based on conju...
متن کاملO-26: Importance of IL-18 in Serum and Follicle Fluid in The Context of Fertility Treatment
Background: Some authors detected significantly higher IL-18 levels in serum, peritoneal, and pleural fluids of patients with severe OHSS as compared with control groups and suggest a role of IL- 18 as a marker of OHSS. Lower levels of IL-18 have been found to characterize unexplained infertility. In this study, we analyzed the importance of IL-18 levels in serum and follicle fluid in response ...
متن کاملاندازه گیری آسیب پذیری کودکان کشور در مقابل فقر
Objective: The aim of measuring vulnerability of children to poverty is to estimate the probability 01 being poor according to the household’s head socioeconomic characteristics. The estimates of the vulnerability to poverty can be used as a guideline to the policymakers to allocate the public subsidies to the poor children and their families. Methodology: Children are at a higher risk of ...
متن کاملاندازه گیری آسیب پذیری کودکان کشور در مقابل فقر
Objective: The aim of measuring vulnerability of children to poverty is to estimate the probability 01 being poor according to the household’s head socioeconomic characteristics. The estimates of the vulnerability to poverty can be used as a guideline to the policymakers to allocate the public subsidies to the poor children and their families. Methodology: Children are at a higher risk of ...
متن کاملبررسی کیفیت خواب دانشجویان ساکن در خوابگاههای دانشگاه علوم پزشکی تهران در سال 1390
<p Background & Objectives: Sleep quality is an important factor in student life and affects in their learning process. Sleep problems are related to increased health concerns, irritability, depression, fatigue, attention and concentration difficulties, along with poor academic performance. The aim of this paper is to conduct a survey based on a questionnaire that would characterize the q...
متن کامل