University of London Imperial College London Department of Computing Asynchronous Iterative Solution for Dominant Eigenvectors with Applications in Performance Modelling and PageRank
نویسنده
چکیده
Performance analysis calculations, for models of any complexity, require a distributed computation effort that can easily occupy a large compute cluster for many days. Producing a simple steady-state measure involves an enormous dominant eigenvector calculation, with even modest performance models having upwards of 10 variables. Computations such as passage-time analysis are an order of magnitude more difficult, producing many hundreds of repeated linear system calculations. As models describe greater concurrency, so the state space of the model increases and with it the magnitude of any performance analysis problem that may be being attempted. The PageRank algorithm is used by Google to measure the relative importance of web pages. It does this by formulating and solving a similarly enormous dominant eigenvector problem, with one variable for every page on the web. As with performance problems, as the number of web pages grows, so the size of the underlying system calculation grows also. With the number of web pages currently estimated to exceed one trillion, the PageRank problem requires many thousands of computers running concurrently over many different clusters. Both problems share the same underlying mathematical type and also the same requirement to run effectively on large distributed clusters. Traditional iterative solution methods scale poorly over large distributed architectures. This is because of the inherent requirement to communicate and synchronise at every iteration step. While asynchronous iterative methods have been around since the 1950s, they have, as yet, not been applied to dominant eigenvector problems without some form of restriction. These methods have been shown to be very successful in other contexts when implemented across large distributed architectures. According to the current state of the art in asynchronous techniques, application to dominant eigenvector problems requires a fixed bound on how and when updates can happen, and thus effectively a bound on the asynchronous communication itself. In this thesis, we show how to apply asynchronous iterative methods to dominant eigenvector problems without any such restrictions. We do this by showing how to map homogeneous, singular linear systems to inhomogeneous, non-singular linear systems which share the same solution. We present a single asynchronous iterative solution framework for performance analysis problems. We also present three particular solution algorithms. We demonstrate analytically and empirically that asynchronous iterative methods offer significant advantages over traditional synchronous solution methods. We use the theoretical tools which we introduce in this thesis to reduce the complexity of the PageRank problem, limiting the ever-increasing impact of dangling web pages. We generate a smaller, sparser problem which may be solved using asynchronous iterative methods.
منابع مشابه
Pay for Performance: A Reflection on How a Global Perspective Could Enhance Policy and Research
Pay-for-performance (P4P) is the provision of financial incentives to healthcare providers based on pre-specified performance targets. P4P has been used as a policy tool to improve healthcare provision globally. However, researchers tend to cluster into those working on high or low- and middle-income countries (LMICs), with still limited knowledge exchange, potentially ...
متن کاملNon-intrusive reduced-order modeling for multiphase porous media flows using Smolyak sparse grids
1Applied Modelling and Computation Group, Department of Earth Science and Engineering, Imperial College London, Prince Consort Road, London, UK 2China University of Geosciences, Wuhan 430074, China 3Zhejiang University, Hangzhou, China 4Department of Scientific Computing, Florida State University, Tallahassee, FL 32306-4120, USA 5Department of Earth Science and Engineering, Imperial College Lon...
متن کاملReduced-order modelling of an adaptive mesh ocean model
Complete List of Authors: Fang, Fangxin; Imperial College London, Earth Science and Engineering Pain, ChristopherC; Imperial College London, Department of Earth Science and Engineering Navon, Michael; Florida State University, School of Computational Science Piggott, MD; Imperial College London, Earth Science and Engineering Gorman, Gerard; Imperial College London, Earth Science and Engineering...
متن کاملClosed-form analytical solution procedure for element design in D regions
This paper presents a novel procedure for solving the equations system of the rotating crack model used for reinforced concrete. It is implemented in the programme NonOPt where it is used to optimise the reinforcement design of D regions. The procedure is based on solving explicit closed-form relations without the need to incrementally increase the applied loads. The solution procedure is based...
متن کاملAsynchronous iterative solution for dominant eigenvectors with applications in performance modelling and PageRank
Performance analysis calculations, for models of any complexity, require a distributed computation effort that can easily occupy a large compute cluster for many days. Producing a simple steady-state measure involves an enormous dominant eigenvector calculation, with even modest performance models having upwards of 10 variables. Computations such as passage-time analysis are an order of magnitu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009