نتایج جستجو برای: categorical data

تعداد نتایج: 2420747  

2015
Adel Benaissa Salima Benbernou Mourad Ouziri Soror Sahri

Today, a large amount of uncertain data is produced by several applications where the management systems of traditional databases including indexing methods are not suitable to handle such type of data. In this paper, we propose an inverted based index method for efficiently searching uncertain categorical data over distributed environments. We address two kinds of query over the distributed un...

ژورنال: محاسبات نرم 2017

Clustering is one of the main techniques in data mining. Clustering is a process that classifies data set into groups. In clustering, the data in a cluster are the closest to each other and the data in two different clusters have the most difference. Clustering algorithms are divided into two categories according to the type of data: Clustering algorithms for numerical data and clustering algor...

Journal: :Statistical Methods in Medical Research 2012

Journal: :JORS 2003
Antonella Basso Stefania Funari

The ethical constraints forced on an investment fund satisfy the fulfilment of humanitarian aims but may lower the investment profitability. Hence, when we measure the performance of ethical mutual funds we cannot disregard the ethical component. In this contribution we propose a performance indicator which considers the expected return, the investment risk, the ethical component and the subscr...

Journal: :International Journal of Data Mining & Knowledge Management Process 2012

2002

The dataset you’ll examine is from a study by the California Department of Corrections (CDC) on the effectiveness of prisoner placement, and the likelihood of misconduct while incarcerated. Upon admission to a California prison, an inmate is given a questionnaire. The score on this questionnaire determines which level of security (Level I is the lightest security, Level IV the heaviest) to whic...

2016
Ying Wen Jun Wang Tianyao Chen Weinan Zhang

This paper presents a method of learning distributed representation for multi-field categorical data, which is a common data format with various applications such as recommender systems, social link prediction, and computational advertising. The success of non-linear models, e.g., factorisation machines, boosted trees, has proved the potential of exploring the interactions among inter-field cat...

2002
Jan de Leeuw

The dataset you’ll examine is from a study by the California Department of Corrections (CDC) on the effectiveness of prisoner placement, and the likelihood of misconduct while incarcerated. Upon admission to a California prison, an inmate is given a questionnaire. The score on this questionnaire determines which level of security (Level I is the lightest security, Level IV the heaviest) to whic...

2012
Arno Siebes René Kersten

Global models of a dataset reflect not only the large scale structure of the data distribution, they also reflect small(er) scale structure. Hence, if one wants to see the large scale structure, one should somehow subtract this smaller scale structure from the model. While for some kinds of model – such as boosted classifiers – it is easy to see the “important” components, for many kind of mode...

1995
FRANK PIESSENS ERIC STEEGMANS Jiri Rosicky

We introduce MD sketches which are a particular kind of Finite Sum sketches Two interesting results about MD sketches are proved First we show that given two MD sketches it is algorithmically decidable whether their model categories are equivalent Next we show that data speci cations as used in database design and software engineering can be translated to MD sketches As a corollary we obtain th...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید