Sampling strategies for mining in data-scarce domains
نویسندگان
چکیده
منابع مشابه
Active Sampling for Data Mining
Data mining is a complex process that aims to derive an accurate predictive model starting from a collection of data. Traditional approaches assume that data are given in advance and their quality, size and structure are independent parameters. In this paper we argue that an extended vision of data mining should include the step of data acquisition as part of the overall process. Moreover the s...
متن کاملDatabase Sampling for Data Mining
In data mining, sampling may be used as a technique for reducing the amount of data presented to a data mining algorithm. Other strategies for data reduction include dimension reduction, data compression, and discretisation. For sampling, the aim is to draw, from a database, a random sample, which has the same characteristics as the original database. This chapter looks at the sampling methods ...
متن کاملStrategies for Parallelizing Data Mining
We classify parallelization strategies for data mining algorithms, concentrating on those techniques in which training data is partitioned, and extracted properties shared between processors at the end of each phase. This approach has been extensively investigated for association rules and decision trees. We sketch some similar work for supervised and unsupervised neural networks, and for the k...
متن کاملStrategies for parallel data mining
We present a set of cost measures that can be applied to parallel algorithms to predict their computation, data access, and communication performance. These measures make it possible to compare di erent possible parallel implementation strategies for data mining techniques without the necessity to benchmark each one. We give general cost expressions for three common parallelizing strategies, an...
متن کاملSizing Strategies in Scarce Environments
Competition is fierce and often the first to act has an advantage, especially in environments where there are excess resources. However, expanding quickly to absorb excess resources creates requirements that might be unmet in future conditions of scarcity. Different patterns of scarcity call for different strategies. We define a model of interacting specialists (entities) to analyze which sizin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Computing in Science & Engineering
سال: 2002
ISSN: 1521-9615
DOI: 10.1109/mcise.2002.1014978