Evaluation of Imputation Techniques for Genotypic Data of Soybean Crop under Missing Completely at Random Mechanism

نویسندگان

چکیده

Background: The issue of missing data is prevalent in all type research work, which can diminish statistical power and lead to inaccurate results if not managed correctly. Missing cannot be ignored because every piece data, no matter how small, affects the outcome significantly. Imputation a key component dealing with data; however, best way impute values has yet been identified. Methods: Our goal this paper compare four more recently developed imputation techniques - MICE, MI, missForest Amelia. In order examine performance various techniques, non-missing were deleted from genotypic soybean crop varied frequency missingness by completely at random mechanism. study compared different for solving using root mean square error absolute error. Result: To fill dataset’s values, technique producing lowest value RMSE MAE will taken into consideration. Finally it observed that performs on proportion missingness.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Missing value imputation on missing completely at random data using multilayer perceptrons

Data mining is based on data files which usually contain errors in the form of missing values. This paper focuses on a methodological framework for the development of an automated data imputation model based on artificial neural networks. Fifteen real and simulated data sets are exposed to a perturbation experiment, based on the random generation of missing values. These data set sizes range fr...

متن کامل

Comparative Study of Four Methods in Missing Value Imputations under Missing Completely at Random Mechanism

In analyzing data from clinical trials and longitudinal studies, the issue of missing values is always a fundamental challenge since the missing data could introduce bias and lead to erroneous statistical inferences. To deal with this challenge, several imputation methods have been developed in the literature to handle missing values where the most commonly used are complete case method, mean i...

متن کامل

Imputation methods for quantile estimation under missing at random

Imputation is frequently used to handle missing data for which multiple imputation is a popular technique. We propose a fractional hot deck imputation which produces a valid variance estimator for quantiles. In the proposed method, the imputed values are chosen from the set of respondents and are assigned with proper fractional weights that use a density function for the working model. In addit...

متن کامل

Comparison of missing value imputation methods for crop yield data

Most ecological data sets contain missing values, a fact which can cause problems in the analysis and limit the utility of resulting inference. However, ecological data also tend to be spatially correlated, which can aid in estimating and imputing missing values. We compared four existing methods of estimating missing values: regression, kernel smoothing, universal kriging, and multiple imputat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Indian journal of agricultural research

سال: 2023

ISSN: ['0367-8245', '0976-058X']

DOI: https://doi.org/10.18805/ijare.a-6094