FuncICA for Time Series Pattern Discovery
نویسندگان
چکیده
We introduce FuncICA, a new independent component analysis method for pattern discovery in inherently functional data, such as time series data. We show how applying the dual of temporal ICA to temporal data, and likewise applying the dual of spatiotemporal ICA to spatiotemporal data, enables independent component regularization not afforded by the primal forms applied to their original domains. We call this family of regularized dual ICA algorithms FuncICA. FuncICA can be considered an analog to functional principal component analysis, where instead of extracting components to minimize L2 reconstruction error, we maximize independence of the components over the functional observations. In this work, we develop an algorithm for extracting independent component curves, derive a method for optimally smoothing the curves, and validate this method on both synthetic and real datasets. Results for synthetic, gene expression, and electroencephalographic event-related potential data indicate that FuncICA can recover well-known scientific phenomena and improve classification accuracy, highlighting its utility for unsupervised learning in continuous data. We conclude this work with a forward-looking, novel framework for fMRI data analysis by making use of the functional dual of spatiotemporal ICA.
منابع مشابه
Pattern Discovery for Locating Motifs in Multivariate, Real-valued Time-series Data
The problem of locating motifs in multivariate, real-valued time series data concerns the discovery of sets of recurring patterns embedded in the time series. Each set is composed of several nonoverlapping subsequences and constitutes a motif because all of the subsequences are similar. This task is a natural extension of univariate motif discovery in both the symbolic and real-valued domains a...
متن کاملA Novel Fuzzy Based Method for Heart Rate Variability Prediction
Abstract In this paper, a novel technique based on fuzzy method is presented for chaotic nonlinear time series prediction. Fuzzy approach with the gradient learning algorithm and methods constitutes the main components of this method. This learning process in this method is similar to conventional gradient descent learning process, except that the input patterns and parameters are stored in mem...
متن کاملConcept Hierarchy-Based Pattern Discovery in Time Series Database: A Case Study on Financial Database
Data Mining is the process of automatically searching large volumes of data for patterns and it is also a fairly recent and contemporary topic in computing. Nowadays, pattern discovery is a field within the area of data mining. In general, large volumes of time series data are contained in financial database and these data have some useful but not easy finding patterns in it and many financial ...
متن کاملGrammarViz 2.0: A Tool for Grammar-Based Pattern Discovery in Time Series
The problem of frequent and anomalous patterns discovery in time series has received a lot of attention in the past decade. Addressing the common limitation of existing techniques, which require a pattern length to be known in advance, we recently proposed grammar-based algorithms for efficient discovery of variable length frequent and rare patterns. In this paper we present GrammarViz 2.0, an ...
متن کاملTheorectical Analysis of Subsequence Time-Series Clustering from a Frequency-Analysis Viewpoint
Although Subsequence Time Series (STS) clustering is one of the most popular pattern discovery techniques from timeseries data, a mathematical methodology for analyzing STS clustering (or pattern discovery from time-series data) has attracted little attention. In the situation, it has had a surprising report [10] that cluster centers obtained using STS clustering closely resemble ”sine waves” w...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009