نتایج جستجو برای: pattern discovery problem

تعداد نتایج: 1318153  

2012
Hugo Alatrista Salas Sandra Bringay Frédéric Flouvat Nazha Selmaoui-Folcher Maguelonne Teisseire

Health risks management such as epidemics study produces large quantity of spatio-temporal data. The development of new methods able to manage such specific characteristics becomes crucial. To tackle this problem, we define a theoretical framework for extracting spatio-temporal patterns (sequences representing evolution of locations and their neighborhoods over time). Classical frequency suppor...

Journal: :RAIRO - Operations Research 1984

1997
Heikki Mannila Hannu Toivonen

One of the basic problems in knowledge discovery in databases (KDD) is the following: given a data set r, a class L of sentences for deening subgroups of r, and a selection predicate, nd all sentences of L deemed interesting by the selection predicate. We analyze the simple levelwise algorithm for nding all such descriptions. We give bounds for the number of database accesses that the algorithm...

Journal: :Theoretical Computer Science 2008

2014
Jonathan H. Huggins Cynthia Rudin

This paper formalizes a latent variable inference problem we call supervised pattern discovery, the goal of which is to find sets of observations that belong to a single “pattern.” We discuss two versions of the problem and prove uniform risk bounds for both. In the first version, collections of patterns can be generated in an arbitrary manner and the data consist of multiple labeled collection...

Journal: :CoRR 2014
Jonathan H. Huggins Cynthia Rudin

This paper formalizes a latent variable inference problem we call supervised pattern discovery, the goal of which is to find sets of observations that belong to a single “pattern.” We discuss two versions of the problem and prove uniform risk bounds for both. In the first version, collections of patterns can be generated in an arbitrary manner and the data consist of multiple labeled collection...

2002
Alberto Apostolico

Many tasks of contemporary Molecular Biology rely increasingly on automated techniques for the discovery of interesting patterns and associations among them, both in individual sequences and across sequence families. A number of computational models and tools have been set up in recent years in response to these needs. This paper concentrates on approaches based on discrete combinatorial algori...

2014
Sheehan Khan Russell Greiner

In this paper we present the budgeted biomarker discovery problem as an alternative to the association studies traditionally used to identify biomarkers. We present several strong arguments to show why adopting this new problem will help solve issues in reproducibilty and understanding of association studies. Additionaly, we present several algorithms for this problem and show their performance...

1992
Clayton Scott Gowtham Bellala Rebecca Willett

Abstract: The false discovery rate (FDR) and false nondiscovery rate (FNDR) have received considerable attention in the literature on multiple testing. These performance measures are also appropriate for classification, and in this work we develop generalization error analyses for FDR and FNDR when learning a classifier from labeled training data. Unlike more conventional classification perform...

2007
Guihua Sun Xiaohua Liu Gao Cong Ming Zhou Zhongyang Xiong John Lee Chin-Yew Lin

This paper studies the problem of identifying erroneous/correct sentences. The problem has important applications, e.g., providing feedback for writers of English as a Second Language, controlling the quality of parallel bilingual sentences mined from the Web, and evaluating machine translation results. In this paper, we propose a new approach to detecting erroneous sentences by integrating pat...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید