bayeclass: an R package for learning Bayesian network classifiers. Applications to neuroscience
نویسندگان
چکیده
Machine learning provides tools for automatized analysis of data. The most commonly used form of machine learning is supervised classification. A supervised classifier learns a mapping from the descriptive features of an object to the set of possible classes, from a set of features-class pairs. Once learned, it is used to predict the class for novel data instances. The Bayesian network-based supervised classifiers are particularly useful. They have a solid theoretical basis in probability theory and provide competitive predictive performance. Many algorithms for learning Bayesian network classifiers exist. However, only two are provided in freely available software. The R software environment is the leading open-source system for statistical computing. We provide an implementation of state-of-the-art Bayesian network classifiers for the R environment, in the form of an add-on package called bayesclass. The best-known Bayesian network classifier is the naive Bayes. It assumes that the features are independent given the class. In many domains, more accurate classification is obtained by relaxing these assumptions. The semi-naive Bayes model removes all assumptions of conditional independence within disjoint subsets of features. Its state-of-the-art learning algorithm, the backward sequential elimination and joining (BSEJ) algorithm, tends to produce semi-naive Bayes models with small subsets of related features. Such a model removes only a few of naive Bayes’ independence assumptions. We extend the BSEJ algorithm with a second step which removes some of its unwarranted independence assumptions. Our classifier outperforms the BSEJ and five other Bayesian network classifiers on a set of benchmark databases, although the difference in performance is not statistically significant. Classification of neurons is an important problem in neuroscience. Previously, 42 experts classified a set of interneurons according to a proposed taxonomy. They disagreed on many of the terms of the taxonomy. The gathered data allows for constructing an objective computational model that could resolve the conflicts by assigning the definitive class labels. A supervised classifier can learn the required mapping from quantitative neuronal descriptors to the experts’ taxonomical choices. A challenge is that there are up to 42 class labels per neuron and the most voted label is not always reliable (i.e. has many votes). We use only the cells with reliable class labels to build the classifiers and obtain high predictive accuracy.
منابع مشابه
A Surface Water Evaporation Estimation Model Using Bayesian Belief Networks with an Application to the Persian Gulf
Evaporation phenomena is a effective climate component on water resources management and has special importance in agriculture. In this paper, Bayesian belief networks (BBNs) as a non-linear modeling technique provide an evaporation estimation method under uncertainty. As a case study, we estimated the surface water evaporation of the Persian Gulf and worked with a dataset of observations ...
متن کاملBDgraph: An R Package for Bayesian Structure Learning in Graphical Models
We introduce an R package BDgraph which performs Bayesian structure learning in high-dimensional graphical models with either continuous or discrete variables. The package efficiently implements recent improvements in the Bayesian literature, including Mohammadi and Wit (2015b) and Mohammadi, Abegaz Yazew, van den Heuvel, and Wit (2015). The core of the BDgraph package consists of two main MCMC...
متن کاملA Surface Water Evaporation Estimation Model Using Bayesian Belief Networks with an Application to the Persian Gulf
Evaporation phenomena is a effective climate component on water resources management and has special importance in agriculture. In this paper, Bayesian belief networks (BBNs) as a non-linear modeling technique provide an evaporation estimation method under uncertainty. As a case study, we estimated the surface water evaporation of the Persian Gulf and worked with a dataset of observations ...
متن کاملLearning Document Image Features With SqueezeNet Convolutional Neural Network
The classification of various document images is considered an important step towards building a modern digital library or office automation system. Convolutional Neural Network (CNN) classifiers trained with backpropagation are considered to be the current state of the art model for this task. However, there are two major drawbacks for these classifiers: the huge computational power demand for...
متن کاملLearning Bayesian Networks with the bnlearn Package
bnlearn is an R package (R Team 2009) which includes several algorithms for learning the structure of Bayesian networks with either discrete or continuous variables. Both constraint-based and score-based algorithms are implemented, and can use the functionality provided by the snow package (Tierney et al. 2008) to improve their performance via parallel computing. Several network scores and cond...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013