Subgraph Ensembles and Motif Discovery Using a New Heuristic for Graph Isomorphism

نویسندگان

  • Kim Baskerville
  • Maya Paczuski
چکیده

A new heuristic based on vertex invariants is developed to rapidly distinguish non-isomorphic graphs to a desired level of accuracy. The method is applied to sample subgraphs from an E.coli protein interaction network, and as a probe for discovery of extended motifs. The network’s structure is described using statistical properties of its N-node subgraphs for N ≤ 14. The Zipf plots for subgraph occurrences are robust power laws that do not change when rewiring the network while fixing the degree sequence — although the specific subgraphs may exchange ranks. However the exponent depends on N . The study of larger subgraphs highlights some striking patterns for various N . Motifs, or connected pieces that are over-abundant in the ensemble of subgraphs, have more edges, for a given number of nodes, than antimotifs and generally display a bipartite structure or tend towards a complete graph. In contrast, antimotifs, which are under-abundant connected pieces, are mostly trees or contain at most a single, small loop. The extension to directed graphs is straightforward.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Neighbor-Aware Search for Approximate Labeled Graph Matching using the Chi-Square Statistics

Labeled graphs provide a natural way of representing entities, relationships and structures within real datasets such as knowledge graphs and protein interactions. Applications such as question answering, semantic search, and motif discovery entail efficient approaches for subgraph matching involving both label and structural similarities. Given the NP-completeness of subgraph isomorphism and t...

متن کامل

On Network Tools for Network Motif Finding: A Survey Study

Network motifs have been called the building blocks of networks [1]. Graph theory is used to computationally represent and search networks. Many efforts have been put into developing motif discovery tools to search for and find network motifs, patterns or subgraphs within the input network that occur more frequently in the input network than in randomized networks where patterns occur by chance...

متن کامل

A Chronological Edge-Driven Approach to Temporal Subgraph Isomorphism

Many real world networks are considered temporal networks, in which the chronological ordering of the edges has importance to the meaning of the data. Performing temporal subgraph matching on such graphs requires the edges in the subgraphs to match the order of the temporal graph motif we are searching for. Previous methods for solving this rely on the use of static subgraph matching to find po...

متن کامل

Parallel Subgraph Isomorphism

The subgraph isomorphism problem deals with determining whether a given graph H is isomorphic to some subgraph of another graph G. In this paper we attempt to parallelize a fast serial subgraph isomorphism library, VFLib, which uses backtracking search to find a solution. Our parallel solution runs on Cilk++ for efficient execution on multicore machines. In our work we examine the benefits and ...

متن کامل

Solving Hard Subgraph Problems in Parallel

We look at problems involving finding subgraphs in larger graphs, such as the maximum clique problem, the subgraph isomorphism problem, and the maximum common subgraph problem. We investigate variable and value ordering heuristics, different inference strategies, intelligent backtracking search (backjumping), and bitand thread-parallelism to exploit modern hardware.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006