Recursive Shortest Path Algorithm with Application to Density-integration of Weighted Graphs

نویسندگان

Cedric E. Ginestet

Andrew Simmons

چکیده

Graph theory is increasingly commonly utilised in genetics, proteomics and neuroimaging. In such fields, the data of interest generally constitute weighted graphs. Analysis of such weighted graphs often require the integration of topological metrics with respect to the density of the graph. Here, density refers to the proportion of the number of edges present in that graph. When topological metrics based on shortest paths are of interest, such densityintegration usually necessitates the iterative application of Dijkstra’s algorithm in order to compute the shortest path matrix at each density level. In this short note, we describe a recursive shortest path algorithm based on single edge updating, which replaces the need for the iterative use of Dijkstra’s algorithm. Our proposed procedure is based on pairs of breadthfirst searches around each of the vertices incident to the edge added at each recursion. An algorithmic analysis of the proposed technique is provided. When the graph of interest is coded as an adjacency list, our algorithm can be shown to be more efficient than an iterative use of Dijkstra’s algorithm. Introduction The last ten years has seen a surge of interest in graph theory among biologists, physicists and other natural scientists. This was primarily stimulated by the seminal papers of Watts and Strogatz (1998) and Barabasi and Albert (1999). In particular, a wide range of different data types are now analyzed through systematic calculations of various topological measures, such as the characteristic path length or clustering coefficient. In systems biology and neuroscience, subject-specific networks can be constructed in order to compare several populations of networks for testing putative differences between groups of subjects (see Bullmore and Sporns, 2009, for a review). (For convenience, the terms network and graph will here be used interchangeably, as this reflects some of the recent developments in the literature.) Such biological networks, however, tend to be weighted undirected graphs, which generally correspond to some standardized covariance matrices between a set of regions of interest. By contrast, most of the topological measures introduced by Watts and Strogatz (1998) and Barabasi and Albert (1999) pertain to unweighted networks. There is currently no general consensus on how to compute or compare the topology of weighted graphs. This is a particularly arduous problem, since it requires the use of real-valued mathematical tools on objects, which are essentially discrete. One of the possible solutions to this conundrum has been advanced by He et al. (2009), who suggested integrating the topological measures of interest with respect to the density of the network (see also Achard and Bullmore, 2007, Ginestet and Simmons, 2011). The density of a network is here defined as the proportion of the number of edges in a given graph. Such integration, however, is computationally expensive, and its complexity grows quadratically with the number of nodes. A Monte Carlo scheme has been proposed in the literature to address this issue and approximate the value of such an integral (Ginestet et al., Submitted). Such Monte Carlo methods, however, also necessitates large number of simulations in order to reduce the variability of the resulting estimates. Most of the topological metrics of interest to researchers in neuroscience and systems biology tend to involve the computation of the matrix of shortest paths, denoted D. This includes, for instance, the global and local efficiency measures proposed by Latora and Marchiori (2001) (see also Latora and Marchiori, 2003). The computation of D for a given network can be done Cedric E. Ginestet & Andrew Simmons 2 Recursive Shortest Path Algorithm efficiently using the celebrated Dijstra’s algorithm (Dijkstra, 1959). However, when considering weighted networks, Dijstra’s algorithm may need to be invoked as many times as the number of edges in the graph of interest. In this short note, we address this specific problem by proposing a recursive shortest path algorithm based on applying single edge updates to D. In this setup, we only work with the shortest path matrix and compute the value of the desired topological metric at every density level. Taken together, we therefore provide an efficient algorithm for the density-integration of the topological functions of weighted networks. Density-integration of Topological Metrics In this paper, our main focus will be on undirected weighted graphs, containing no graph loops or multiple edges. However, since we also need to refer to unweighted graphs, we introduce the following notation. A graph G is here defined as a triple (V, E ,W), where V(G) is the standard vertex set, E(G) is the edge set and W(G) is a multiset of real-valued weights. Our convention generalizes to directed graphs. In addition, this also includes undirected unweighted (simple) graphs as special cases, for which the elements of W belong to {0, 1}. Such a setup may, for instance, apply to the consideration of correlation matrices or other matrices of similarity measures with real-valued entries. In addition, we will make use of the following notation, NV := |V(G)|, NE := |E(G)|, and NI := NV (NV − 1) 2 , where := signifies that the left-hand side is defined as the right-hand side. We define NI as the number of shortest paths in G. Naturally, NI here takes this value because G is undirected. For a directed network, NI would be NV (NV − 1). For convenience, we will interchangeably use the following two sets of indices to label the elements of W, W(G) = {wv1v2 , . . . , wij , . . . , wNV −1,NV } = {w1, . . . , we, . . . , wNE}. (1) Albeit we will here restrict our attention to undirected graphs, an extension of our proposed technique to directed networks will be discussed in the conclusion. A range of topological metrics necessitating the computation of the shortest path matrix have been proposed in the literature. Two popular choices of topological measures are the global and local efficiency measures introduced by Latora and Marchiori (2001). Both of these quantities can be derived from the general definition of the efficiency, E(·), of a simple graph G = (V, E ,W), which is defined as follows, E(G) := 1 NI NV ∑

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Applying algorithm finding shortest path in the multiple- weighted graphs to find maximal flow in extended linear multicomodity multicost network

The shortest path finding algorithm is used in many problems on graphs and networks. This article will introduce the algorithm to find the shortest path between two vertices on the extended graph. Next, the algorithm finds the shortest path between the pairs of vertices on the extended graph with multiple weights is developed. Then, the shortest path finding algorithms is used to find the maxim...

متن کامل

Shortest path distance in random k-nearest neighbor graphs

Consider a weighted or unweighted k-nearest neighbor graph that has been built on n data points drawn randomly according to some density p on R. We study the convergence of the shortest path distance in such graphs as the sample size tends to infinity. We prove that for unweighted kNN graphs, this distance converges to an unpleasant distance function on the underlying space whose properties are...

متن کامل

A new approach to all-pairs shortest paths on real-weighted graphs

We present a new all-pairs shortest path algorithm that works with real-weighted graphs in the traditional comparison-addition model. It runs in O(mn+n2 log log n) time, improving on the long-standing bound of O(mn + n2 logn) derived from an implementation of Dijkstra's algorithm with Fibonacci heaps. Here m and n are the number of edges and vertices, respectively. Our algorithm is rooted in th...

متن کامل

A Combinatorial Algorithm for All-Pairs Shortest Paths in Directed Vertex-Weighted Graphs with Applications to Disc Graphs

We consider the problem of computing all-pairs shortest paths in a directed graph with real weights assigned to vertices. For an n × n 0 − 1 matrix C, let KC be the complete weighted graph on the rows of C where the weight of an edge between two rows is equal to their Hamming distance. Let MWT (C) be the weight of a minimum weight spanning tree of KC . We show that the all-pairs shortest path p...

متن کامل

The K Shortest Paths Problem with Application to Routing

We present a simple algorithm for explicitly computing all k shortest paths bounded by length L from a fixed source to a target in O(m + kL) and O(mlogm + kL) time for unweighted and weighted directed graphs with m edges respectively. For many graphs, this outperforms existing algorithms by exploiting the fact that real world networks have short average path length. Consequently, we would like ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره abs/1104.1355 شماره

صفحات -

تاریخ انتشار 2011

Recursive Shortest Path Algorithm with Application to Density-integration of Weighted Graphs

نویسندگان

چکیده

منابع مشابه

Applying algorithm finding shortest path in the multiple- weighted graphs to find maximal flow in extended linear multicomodity multicost network

Shortest path distance in random k-nearest neighbor graphs

A new approach to all-pairs shortest paths on real-weighted graphs

A Combinatorial Algorithm for All-Pairs Shortest Paths in Directed Vertex-Weighted Graphs with Applications to Disc Graphs

The K Shortest Paths Problem with Application to Routing

عنوان ژورنال:

اشتراک گذاری