Time Series Clustering for Demand Response An Online Algorithmic Approach

نویسندگان

  • Ranjan Pal
  • Charalampos Chelmis
  • Marc Frincu
  • Viktor Prasanna
چکیده

The widespread monitoring of electricity consumption due to increasingly pervasive deployment of networked sensors in urban environments has resulted in an unprecedentedly large volume of data being collected. To improve sustainability in Smart Grids, realtime data analytics challenges induced by high volume and high dimensional context-based data need to be addressed. Particularly, with the emerging Smart Grid technologies becoming more ubiquitous, analytics for discovering the underlying structure of high dimensional time series data are crucial to convert the massive amount of fine-grained energy information gathered from residential smart meters into appropriate Demand Response (DR) insights. In this paper, we propose an online time series clustering approach to systematically and efficiently manage the energy consumption data deluge, and also capture specific behavior i.e., identify households with similar lifestyle patterns. Customers can in this way be segmented into several groups that can be effectively used to enhance DR policies for real time automatic control in the cyberphysical Smart Grid system. Due to the inherent intractability of the ‘optimal clustering’ problem, we propose a novel randomized approximation clustering scheme of electricity consumption data, aiming at addressing three major issues: (i) designing a resource-constrained, online clustering technique for high volume, high dimensional time series data (ii) determining the optimal number of clusters that gives the best approximate clustering configuration, and (iii) providing strong clustering performance guarantees. By the term ‘performance guarantees’, we imply algorithm performance with respect to the best clustering possible for the given data. Our proposed online clustering algorithm is time efficient, achieves a clustering configuration that is optimal within provable worst case approximation factors, scales to large data sets, and is extensible to parallel and distributed architectures. The applicability of our algorithm goes beyond that of the Smart Grid and includes any scenario where clustering needs to be done on high volume and in real-time under space and time constraints.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Online Time Series Clustering For Demand Response A Theory to Break the ‘Curse of Dimensionality’

The advent of smart meters and advanced communication infrastructures catalyzes numerous smart grid applications such as dynamic demand response, and paves the way to solve challenging research problems in sustainable energy consumption. The space of solution possibilities are restricted primarily by the huge amount of generated data requiring considerable computational resources and efficient ...

متن کامل

A Hybrid Time Series Clustering Method Based on Fuzzy C-Means Algorithm: An Agreement Based Clustering Approach

In recent years, the advancement of information gathering technologies such as GPS and GSM networks have led to huge complex datasets such as time series and trajectories. As a result it is essential to use appropriate methods to analyze the produced large raw datasets. Extracting useful information from large data sets has always been one of the most important challenges in different sciences,...

متن کامل

Experimental Evaluation of Algorithmic Effort Estimation Models using Projects Clustering

One of the most important aspects of software project management is the estimation of cost and time required for running information system. Therefore, software managers try to carry estimation based on behavior, properties, and project restrictions. Software cost estimation refers to the process of development requirement prediction of software system. Various kinds of effort estimation patter...

متن کامل

Fuzzy clustering of time series data: A particle swarm optimization approach

With rapid development in information gathering technologies and access to large amounts of data, we always require methods for data analyzing and extracting useful information from large raw dataset and data mining is an important method for solving this problem. Clustering analysis as the most commonly used function of data mining, has attracted many researchers in computer science. Because o...

متن کامل

Online Streaming Feature Selection Using Geometric Series of the Adjacency Matrix of Features

Feature Selection (FS) is an important pre-processing step in machine learning and data mining. All the traditional feature selection methods assume that the entire feature space is available from the beginning. However, online streaming features (OSF) are an integral part of many real-world applications. In OSF, the number of training examples is fixed while the number of features grows with t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014