Machine learning-based classification of rock discontinuity trace: SMOTE oversampling integrated with GBT ensemble learning
نویسندگان
چکیده
This paper presents a hybrid ensemble classifier combined synthetic minority oversampling technique (SMOTE), random search (RS) hyper-parameters optimization algorithm and gradient boosting tree (GBT) to achieve efficient accurate rock trace identification. A thirteen-dimensional database consisting of basic, vector, discontinuity features is established from image samples. All data points are classified as either “trace” or “non-trace” divide the ultimate results into candidate It found that SMOTE technology can effectively improve classification performance by recommending an optimized imbalance ratio 1:5 1:4. Then, sixteen classifiers generated four basic machine learning (ML) models applied for comparison. The reveal proposed RS-SMOTE-GBT outperforms other fifteen ML algorithms both non-trace classifications. Finally, discussions on feature importance, generalization ability error conducted classifier. experimental indicate more critical affecting primarily features. Besides, cleaning up sedimentary pumice reducing area fractured contribute improving overall performance. method provides new alternative approach identification 3D trace.
منابع مشابه
Oversampling for Imbalanced Learning Based on K-Means and SMOTE
Learning from class-imbalanced data continues to be a common and challenging problem in supervised learning as standard classification algorithms are designed to handle balanced class distributions. While different strategies exist to tackle this problem, methods which generate artificial data to achieve a balanced class distribution are more versatile than modifications to the classification a...
متن کاملGeometric SMOTE: Effective oversampling for imbalanced learning through a geometric extension of SMOTE
Classification of imbalanced datasets is a challenging task for standard algorithms. Although many methods exist to address this problem in different ways, generating artificial data for the minority class is a more general approach compared to algorithmic modifications. SMOTE algorithm and its variations generate synthetic samples along a line segment that joins minority class instances. In th...
متن کاملVoting based Extreme Learning Machine with Accuracy based ensemble Pruning
Extreme Learning Machine is a fast single layer feed forward neural network for real valued classification. It suffers from the problem of instability and over fitting. Voting based Extreme Learning Machine, VELM reduces this performance variation in Extreme Learning Machine by employing majority voting based ensembling technique. VELM improves the performance of ELM at the cost of increased re...
متن کاملEnsemble machine learning on gene expression data for cancer classification.
Whole genome RNA expression studies permit systematic approaches to understanding the correlation between gene expression profiles to disease states or different developmental stages of a cell. Microarray analysis provides quantitative information about the complete transcription profile of cells that facilitate drug and therapeutics development, disease diagnosis, and understanding in the basi...
متن کاملA novel ensemble machine learning for robust microarray data classification
Microarray data analysis and classification has demonstrated convincingly that it provides an effective methodology for the effective diagnosis of diseases and cancers. Although much research has been performed on applying machine learning techniques for microarray data classification during the past years, it has been shown that conventional machine learning techniques have intrinsic drawbacks...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International journal of mining science and technology
سال: 2022
ISSN: ['2095-2686', '2589-062X']
DOI: https://doi.org/10.1016/j.ijmst.2021.08.004