Sampling strategies for mining in data-scarce domains

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Active Sampling for Data Mining

Data mining is a complex process that aims to derive an accurate predictive model starting from a collection of data. Traditional approaches assume that data are given in advance and their quality, size and structure are independent parameters. In this paper we argue that an extended vision of data mining should include the step of data acquisition as part of the overall process. Moreover the s...

متن کامل

Database Sampling for Data Mining

In data mining, sampling may be used as a technique for reducing the amount of data presented to a data mining algorithm. Other strategies for data reduction include dimension reduction, data compression, and discretisation. For sampling, the aim is to draw, from a database, a random sample, which has the same characteristics as the original database. This chapter looks at the sampling methods ...

متن کامل

Strategies for Parallelizing Data Mining

We classify parallelization strategies for data mining algorithms, concentrating on those techniques in which training data is partitioned, and extracted properties shared between processors at the end of each phase. This approach has been extensively investigated for association rules and decision trees. We sketch some similar work for supervised and unsupervised neural networks, and for the k...

متن کامل

Strategies for parallel data mining

We present a set of cost measures that can be applied to parallel algorithms to predict their computation, data access, and communication performance. These measures make it possible to compare di erent possible parallel implementation strategies for data mining techniques without the necessity to benchmark each one. We give general cost expressions for three common parallelizing strategies, an...

متن کامل

Sizing Strategies in Scarce Environments

Competition is fierce and often the first to act has an advantage, especially in environments where there are excess resources. However, expanding quickly to absorb excess resources creates requirements that might be unmet in future conditions of scarcity. Different patterns of scarcity call for different strategies. We define a model of interacting specialists (entities) to analyze which sizin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Computing in Science & Engineering

سال: 2002

ISSN: 1521-9615

DOI: 10.1109/mcise.2002.1014978