نتایج جستجو برای: linear regression models perform better on unseen data
تعداد نتایج: 9911205 فیلتر نتایج به سال:
ABSTRACT This paper applies a decision tree model and logistic regression models to a real transportation problem, compares results of these two methods and presents model building procedures as well. The data set is partitioned into train, validation and test data. Due to the skewness of some variables, the variable transformation technique has been conducted and a transformed logistic regre...
Optical character recognition (OCR) systems for machine-printed documents typically require large numbers of font styles and character models to work well. When given a document printed in an unseen font, the performance of those systems degrade even in the absence of noise. In this paper, we perform OCR in an unsupervised fashion without using any character models by using a cryptogram decodin...
Gaussian processes (GPs) are a powerful tool for probabilistic inference over functions. They have been applied to both regression and non-linear dimensionality reduction, and offer desirable properties such as uncertainty estimates, robustness to over-fitting, and principled ways for tuning hyper-parameters. However the scalability of these models to big datasets remains an active topic of res...
We use a convolutional neural network to perform authorship identification on a very homogeneous dataset of scientific publications. In order to investigate the effect of domain biases, we obscure words below a certain frequency threshold, retaining only their POS-tags. This procedure improves test performance due to better generalization on unseen data. Using our method, we are able to predict...
Fruits and citrus wastes are generated in the food industry in large quantities. Their management in Iran, as one of the major hubs of fruits and citrus production, is of great importance. In this study, the biochar samples were prepared from pomegranate, orange and lemon peel waste produced in a juice factory using the pyrolysis process in the range of 400-500 °C; then their efficiency for zin...
This paper transforms fuzzy number into clear number using the centroid method, thus we can research the traditional linear regression model which is transformed from the fuzzy linear regression model. The model’s input and output are fuzzy numbers, and the regression coefficients are clear numbers. This paper considers the parameter estimation and impact analysis based on data deletion. Throug...
The process of pollutant adsorption from industrial wastewaters is a multivariate problem. This process is affected by many factors including the contact time (T), pH, adsorbent weight (m), and solution concentration (ppm). The main target of this work is to model and evaluate the process of pollutant adsorption from industrial wastewaters using the non-linear multivariate regression and intell...
solar radiation data play an important role in solar energy relevant researches. these data are not available for some locations due to the absence of the meteorological stations. therefore, solar radiation data have to be predicted by using solar radiation estimation models. this study presents an integrated artificial neural network (ann) approach for estimating solar radiation potential over...
Abstract Negative binomial regression model (NBR) is a popular approach for modeling overdispersed count data with covariates. Several parameterizations have been performed for NBR, and the two well-known models, negative binomial-1 regression model (NBR-1) and negative binomial-2 regression model (NBR-2), have been applied. Another parameterization of NBR is negative binomial-P regression mode...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید