In healthcare research, data contains many types of categorical variables, such as race, country, zip code, from low to high number levels in each category. However, aforementioned features need encode into numeric forms before applying machine learning algorithms. Therefore, it is critical find a suitable encoding method for coefficient estimation and prediction. this study, we investigate thr...