Regression with n→1 by Expert Knowledge Elicitation
نویسندگان
چکیده
We consider regression under the “extremely small n large p” condition. In particular, we focus on problems with so small sample sizes n compared to the dimensionality p, even n → 1, that predictors cannot be estimated without prior knowledge. Furthermore, we assume all prior knowledge that can be automatically extracted from databases has already been taken into account. This setup occurs in personalized medicine, for instance, when predicting treatment outcomes for an individual patient based on noisy high-dimensional genomics data. A remaining source of information is expert knowledge which has received relatively little attention in recent years. We formulate the inference problem of asking expert feedback on features on a budget, present experimental results for two setups: “small n” and “n=1 with similar data available”, and derive conditions under which the elicitation strategy is optimal. Experiments on simulated experts, both on simulated and genomics data, demonstrate that the proposed strategy can drastically improve prediction accuracy.
منابع مشابه
Probabilistic Expert Knowledge Elicitation of Feature Relevances in Sparse Linear Regression
In this extended abstract1, we consider the “small n, large p” prediction problem, where the number of available samples n is much smaller compared to the number of covariates p. This challenging setting is common for multiple applications, such as precision medicine, where obtaining additional samples can be extremely costly or even impossible. Extensive research effort has recently been dedic...
متن کاملElicitator: An expert elicitation tool for regression in ecology
Elicitator : an expert elicitation tool for regression in ecology. Abstract. Expert elicitation is the process of retrieving and quantifying expert knowledge in a particular domain. Such information is of particular value when the empirical data is expensive, limited or unreliable. This paper describes a new software tool, called Elicitator, which assists in quantifying expert knowledge in a fo...
متن کاملDesigning Elicitor: Software to Graphically Elicit Expert Priors for Logistic Regression Models in Ecology. 2006
ELICITOR is graphical elicitation software created to elicit normal prior distributions for a Bayesian logistic regression model. Motivated by a real need to include expert knowledge in presence–absence models in ecology, this research describes a synthesis of theory from statistics, psychology and ecology. The aim was to build elicitation software that would be user friendly to environmental s...
متن کاملEliciting expert knowledge in conservation science.
Expert knowledge is used widely in the science and practice of conservation because of the complexity of problems, relative lack of data, and the imminent nature of many conservation decisions. Expert knowledge is substantive information on a particular topic that is not widely known by others. An expert is someone who holds this knowledge and who is often deferred to in its interpretation. We ...
متن کاملKnowledge Elicitation for Design Task Sequencing Knowledge
There are many types of knowledge involved in producing a design (the process of specifying a description of an artifact that satisfies a collection of constraints [Brown, 1992]). Of these, one of the most crucial is the design plan: the sequence of steps taken to create the design (or a portion of the design). A number of knowledge elicitation methods can be used to obtain this knowledge from ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016