Machine learning-based classification of rock discontinuity trace: SMOTE oversampling integrated with GBT ensemble learning

نویسندگان

چکیده

This paper presents a hybrid ensemble classifier combined synthetic minority oversampling technique (SMOTE), random search (RS) hyper-parameters optimization algorithm and gradient boosting tree (GBT) to achieve efficient accurate rock trace identification. A thirteen-dimensional database consisting of basic, vector, discontinuity features is established from image samples. All data points are classified as either “trace” or “non-trace” divide the ultimate results into candidate It found that SMOTE technology can effectively improve classification performance by recommending an optimized imbalance ratio 1:5 1:4. Then, sixteen classifiers generated four basic machine learning (ML) models applied for comparison. The reveal proposed RS-SMOTE-GBT outperforms other fifteen ML algorithms both non-trace classifications. Finally, discussions on feature importance, generalization ability error conducted classifier. experimental indicate more critical affecting primarily features. Besides, cleaning up sedimentary pumice reducing area fractured contribute improving overall performance. method provides new alternative approach identification 3D trace.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Oversampling for Imbalanced Learning Based on K-Means and SMOTE

Learning from class-imbalanced data continues to be a common and challenging problem in supervised learning as standard classification algorithms are designed to handle balanced class distributions. While different strategies exist to tackle this problem, methods which generate artificial data to achieve a balanced class distribution are more versatile than modifications to the classification a...

متن کامل

Geometric SMOTE: Effective oversampling for imbalanced learning through a geometric extension of SMOTE

Classification of imbalanced datasets is a challenging task for standard algorithms. Although many methods exist to address this problem in different ways, generating artificial data for the minority class is a more general approach compared to algorithmic modifications. SMOTE algorithm and its variations generate synthetic samples along a line segment that joins minority class instances. In th...

متن کامل

Voting based Extreme Learning Machine with Accuracy based ensemble Pruning

Extreme Learning Machine is a fast single layer feed forward neural network for real valued classification. It suffers from the problem of instability and over fitting. Voting based Extreme Learning Machine, VELM reduces this performance variation in Extreme Learning Machine by employing majority voting based ensembling technique. VELM improves the performance of ELM at the cost of increased re...

متن کامل

Ensemble machine learning on gene expression data for cancer classification.

Whole genome RNA expression studies permit systematic approaches to understanding the correlation between gene expression profiles to disease states or different developmental stages of a cell. Microarray analysis provides quantitative information about the complete transcription profile of cells that facilitate drug and therapeutics development, disease diagnosis, and understanding in the basi...

متن کامل

A novel ensemble machine learning for robust microarray data classification

Microarray data analysis and classification has demonstrated convincingly that it provides an effective methodology for the effective diagnosis of diseases and cancers. Although much research has been performed on applying machine learning techniques for microarray data classification during the past years, it has been shown that conventional machine learning techniques have intrinsic drawbacks...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International journal of mining science and technology

سال: 2022

ISSN: ['2095-2686', '2589-062X']

DOI: https://doi.org/10.1016/j.ijmst.2021.08.004