Distributed Structured Prediction for Big Data
نویسندگان
چکیده
The biggest limitations of learning structured predictors from big data are the computation time and the memory demands. In this paper, we propose to handle those big data problems efficiently by distributing and parallelizing the resource requirements. We present a distributed structured prediction learning algorithm for large scale models that cannot be effectively handled by a single cluster node. Importantly, convergence and optimality guarantees of recently developed algorithms are preserved while keeping between node communication low.
منابع مشابه
VIPOC Project Research Summary (Discussion Paper)
Predicting the output power of renewable energy production plants distributed on a wide territory is a valuable goal, both for marketing and energy management purposes. In this paper, we describe Vi-POC (Virtual Power Operating Center) – a distributed system for storing huge amounts of data, gathered from energy production plants and weather prediction services. Due to the heterogeneity and the...
متن کاملInsight-driven Crisis Information - Preparing for the Unexpected using Big Data
National information and situation centers are faced with rising information needs and the question of how to prepare for unexpected situations. One promising development is the access to vastly growing data produced by distributed sensors and a socially networked society. Current emergency information systems are limited in the amount of complex data they can process and interpret in real-time...
متن کاملA Study on Text Mining over Hadoop Framework ‖
In today’s scenario as data is increasing day by day so text data mining approaches are playing a vital role in extracting many potential information and association from a large amount of text data. The term data mining is used for methods that analyze data and data mining deals with structured data, whereas text mining presents different formats that are unstructured or semi-structured data. ...
متن کاملA Study of Traditional Data Analysis and Sensor Data Analytics
The growth of smart and intelligent devices known as sensors generate large amount of data. These generated data over a time span takes such a large volume which is designated as big data. The data structure of repository holds unstructured data. The traditional data analytics methods well developed and used widely to analyze structured data and to limit extend the semi-structured data which in...
متن کاملDistributed Training of Structured SVM
Training structured prediction models is time-consuming. However, most existing approaches only use a single machine, thus, the advantage of computing power and the capacity for larger data sets of multiple machines have not been exploited. In this work, we propose an efficient algorithm for distributedly training structured support vector machines based on a distributed block-coordinate descen...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012