“ Classification Using Censored Functional Data

نویسنده

  • Aurore Delaigle
چکیده

We consider classification of functional data. This problem has received a lot of attention in the literature in the case where the curves are all observed on the same interval. A difficulty in applications is that the functional curves can be supported on quite different intervals, in which case standard methods of analysis cannot be used. We are interested in constructing classifiers for curves of this type. More precisely, we consider classification of functions supported on a compact interval, in cases where the training sample consists of functions observed on other intervals, which may differ among the training curves. We propose several methods, depending on whether or not the observable intervals overlap by a significant amount. In the case where these intervals differ a lot, our procedure involves extending the curves outside the interval where they were observed. We suggest a new nonparametric approach for doing this. We also introduce flexible ways of combining potential differences in shapes of the curves from different populations, and potential differences between the endpoints of the intervals where the curves from each population are observed. We suggest a fully data-driven approach, and illustrate the performance of our classifier on some real and simulated data. If time permits, we shall talk about some asymptotic properties.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analysis of Censored Survival Data with Dimension Reduction Methods‎: Tehran Lipid and Glucose Study

 ‎Cardiovascular diseases (CVDs) are the leading cause of death worldwide‎. ‎To specify an appropriate model to determine the risk of CVD and predict survival rate‎, ‎users are required to specify a functional form which relates the outcome variables to the input ones‎. ‎In this paper‎, ‎we proposed a dimension reduction method using a general model‎, ‎which includes many widely used survival m...

متن کامل

Increasing the accuracy of the classification of diabetic patients in terms of functional limitation using linear and nonlinear combinations of biomarkers: Ramp AUC method

The Area under the ROC Curve (AUC) is a common index for evaluating the ability of the biomarkers for classification. In practice, a single biomarker has limited classification ability, so to improve the classification performance, we are interested in combining biomarkers linearly and nonlinearly. In this study, while introducing various types of loss functions, the Ramp AUC method and some of...

متن کامل

Bayesian Analysis of Censored Spatial Data Based on a Non-Gaussian Model

Abstract: In this paper, we suggest using a skew Gaussian-log Gaussian model for the analysis of spatial censored data from a Bayesian point of view. This approach furnishes an extension of the skew log Gaussian model to accommodate to both skewness and heavy tails and also censored data. All of the characteristics mentioned are three pervasive features of spatial data. We utilize data augme...

متن کامل

Bayesian Estimation of Reliability of the Electronic Components Using Censored Data from Weibull Distribution: Different Prior Distributions

The Weibull distribution has been widely used in survival and engineering reliability analysis. In life testing experiments is fairly common practice to terminate the experiment before all the items have failed, that means the data are censored. Thus, the main objective of this paper is to estimate the reliability function of the Weibull distribution with uncensored and censored data by using B...

متن کامل

Tracking Interval for Doubly Censored Data with Application of Plasma Droplet Spread Samples

Doubly censoring scheme, which includes left as well as right censored observations, is frequently observed in practical studies. In this paper we introduce a new interval say tracking interval for comparing the two rival models when the data are doubly censored. We obtain the asymptotic properties of maximum likelihood estimator under doubly censored data and drive a statistic for testing the ...

متن کامل

Cost-Sensitive Learning for Recurrence Prediction of Breast Cancer

Breast cancer is one of the top cancer-death causes and specifically accounts for 10.4% of all cancer incidences among women. The prediction of breast cancer recurrence has been a challenging research problem for many researchers. Data mining techniques have recently received considerable attention, especially when used for the construction of prognosis models from survival data. However, exist...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013