Discovering Causal Knowledge by Design
نویسندگان
چکیده
Causal knowledge is frequently pursued by researchers in many fields, such as medicine, economics, and social science, yet very little research in knowledge discovery focuses on discovering causal knowledge. Those researchers rely on a set of methods, called experimental and quasiexperimental designs, that exploit the ontological structure of the world to limit the set of possible statistical models that can produce observed correlations among variables. As a result, designs are powerful techniques for drawing conclusions about cause-and-effect relationships. However, designs are almost never used explicitly by knowledge discovery algorithms. In this work, we provide explicit evidence that designs have the potential to be highly useful as part of algorithms to discover causal knowledge. We first formalize the basic elements of experimental and quasiexperimental designs to characterize a design search space. We then quantify the range and diversity of designs that can be applied to examine the central questions associated with a large and complex domain (Wikipedia). Finally, we show that explicit consideration of designs can substantially improve the accuracy of causal inference and increase the statistical power of algorithms for learning the structure of graphical models.
منابع مشابه
Discovering Knowledge from Medical Databases
We investigate new approaches for knowledge discovery from two medical databases. Two different kinds of knowledge, namely rules and causal structures, are learned. Rules capture interesting patterns and regularities in the database. Causal structures represented by Bayesian networks capture the causality relationships among the attributes. We employ advanced evolutionary algorithms for these d...
متن کاملDiscovering Causal Chains by Integrating Plan Recognition and Sequential Pattern Mining
In this paper we define the notion of causal chains. Causal chains are a particular kind of sequential patterns that reflect causality relations according to background knowledge. We also present an algorithm for mining causal chains from a collection of action traces. We run this algorithm on a realworld domain and observe that causal chains can be computed efficiently by quickly identifying i...
متن کاملDiscovering Causal Relations by Experimentation: Causal Trees
The Controlled Lesion Method (CLM) is a set of principles for inferring the causal structure of deterministic mechanisms by experimentation. CLM formulates an important part of the common-sense logic of causation and experimentation. Previous work showed that CLM could be used to discover the structure of deterministic chains of binary variables more accurately than statistical methods; however...
متن کاملLearning Causal Models of Relational Domains
Methods for discovering causal knowledge from observational data have been a persistent topic of AI research for several decades. Essentially all of this work focuses on knowledge representations for propositional domains. In this paper, we present several key algorithmic and theoretical innovations that extend causal discovery to relational domains. We provide strong evidence that effective le...
متن کاملDiscovering Prerequisite Relationships Among Knowledge Components
Knowing the prerequisite structure among the knowledge components in a domain is crucial for designing instruction and for assessing mastery. Treating KCs as latent variables, we investigate how data on the items that test these skills can be used to discover the prerequisite structure among such skills. Our method assumes that we know or have discovered the Q-matrix (the measurement model) tha...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009