نتایج جستجو برای: class imbalance problem

تعداد نتایج: 1244703  

Journal: :Computational Intelligence 2004
Andrew Estabrooks Taeho Jo Nathalie Japkowicz

Re-Sampling methods are commonly used for dealing with the class-imbalance problem. Their advantage over other methods is that they are external and thus, easily transportable. Although such approaches can be very simple to implement, tuning them most effectively is not an easy task. In particular, it is unclear whether oversampling is more effective than undersampling and which oversampling or...

Journal: :Briefings in bioinformatics 2013
Wei-Jiun Lin James J. Chen

A class-imbalanced classifier is a decision rule to predict the class membership of new samples from an available data set where the class sizes differ considerably. When the class sizes are very different, most standard classification algorithms may favor the larger (majority) class resulting in poor accuracy in the minority class prediction. A class-imbalanced classifier typically modifies a ...

2012
Madhuri Agrawal Gajendra Singh Ravindra Kumar Gupta

The paper addresses some theoretical and practical aspects of data mining, focusing on predictive data mining, where two central types of prediction problems are discussed: classification and regression. Further accent is made on predictive data mining, where the time-stamped data greatly increase the dimensions and complexity of problem solving. The main goal is through processing of data (rec...

2006
Manolis Maragoudakis Katia Kermanidis Aristogiannis Garbis Nikos Fakotakis

For the present work, we deal with the significant problem of high imbalance in data in binary or multi-class classification problems. We study two different linguistic applications. The former determines whether a syntactic construction (environment) co-occurs with a verb in a natural text corpus consists a subcategorization frame of the verb or not. The latter is called Name Entity Recognitio...

Journal: :The Journal of Supercomputing 2023

Abstract Network intrusion detection systems (NIDS) are the most common tool used to detect malicious attacks on a network. They help prevent ever-increasing different and provide better security for NIDS classified into signature-based anomaly-based detection. The type of is which based machine learning models able with high accuracy. However, in recent years, has achieved even results detecti...

Journal: :The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences 2020

2016
Ali Braytee Wei Liu Paul J. Kennedy

In this paper, novel cost-sensitive principal component analysis (CSPCA) and cost-sensitive non-negative matrix factorization (CSNMF) methods are proposed for handling the problem of feature extraction from imbalanced data. The presence of highly imbalanced data misleads existing feature extraction techniques to produce biased features, which results in poor classi cation performance especially...

2011
Eleftherios Spyromitros Xioufis Myra Spiliopoulou Grigorios Tsoumakas Ioannis P. Vlahavas

Streams of objects that are associated with one or more labels at the same time appear in many applications. However, stream classification of multi-label data is largely unexplored. Existing approaches try to tackle the problem by transferring traditional single-label stream classification practices to the multi-label domain. Nevertheless, they fail to consider some of the unique properties of...

2014
José A. Sáez Julián Luengo Jerzy Stefanowski Francisco Herrera

Imbalance data constitutes a great difficulty for most algorithms learning classifiers. However, as recent works claim, class imbalance is not a problem in itself and performance degradation is also associated with other factors related to the distribution of the data as the presence of noisy and borderline examples in the areas surrounding class boundaries. This contribution proposes to extend...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید