Assigning protein functions by comparative genome analysis: protein phylogenetic profiles.

نویسندگان

  • M Pellegrini
  • E M Marcotte
  • M J Thompson
  • D Eisenberg
  • T O Yeates
چکیده

Determining protein functions from genomic sequences is a central goal of bioinformatics. We present a method based on the assumption that proteins that function together in a pathway or structural complex are likely to evolve in a correlated fashion. During evolution, all such functionally linked proteins tend to be either preserved or eliminated in a new species. We describe this property of correlated evolution by characterizing each protein by its phylogenetic profile, a string that encodes the presence or absence of a protein in every known genome. We show that proteins having matching or similar profiles strongly tend to be functionally linked. This method of phylogenetic profiling allows us to predict the function of uncharacterized proteins.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An improved hypergeometric probability method for identification of functionally linked proteins using phylogenetic profiles

Predicting functions of proteins and alternatively spliced isoforms encoded in a genome is one of the important applications of bioinformatics in the post-genome era. Due to the practical limitation of experimental characterization of all proteins encoded in a genome using biochemical studies, bioinformatics methods provide powerful tools for function annotation and prediction. These methods al...

متن کامل

Phylogenetic Analysis of Three Long Non-coding RNA Genes: AK082072, AK043754 and AK082467

Now, it is clear that protein is just one of the most functional products produced by the eukaryotic genome. Indeed, a major part of the human genome is transcribed to non-coding sequences than to the coding sequence of the protein. In this study, we selected three long non-coding RNAs namely AK082072, AK043754 and AK082467 which show brain expression and local region conservation among vertebr...

متن کامل

Comparative pair-wise domain-combinations for screening the clade specific domain-architectures in metazoan genomes.

In the evolution of the eukaryotic genome, exon or domain shuffling has produced a variety of proteins. On the assumption that each fusion event between two independent protein-domains occurred only once in the evolution of metazoans, we can roughly estimate when the fusion events were happened. For this purpose, we made phylogenetic profiles of pair-wise domain-combinations of metazoans. The p...

متن کامل

Extraction of Organism Groups from Whole Genome Comparisons

The availability of a growing number of fully sequenced genomes makes it possible for us to conduct a large-scale comparative genomic research. In recent years, functional prediction and genome tree construction methods based on phylogenetic profiles have been developed. The phylogenetic profile is defined as a bit pattern that encodes the presence or absence of orthologous genes in a set of or...

متن کامل

A Peer-to-Peer Environment for Annotation of Genomes: The SEED

A genome may be thought of as a set of genes that encode protein sequences. The function of each gene is determined by the activity of the protein it encodes. Genome annotation is the process of assigning functions to genes. Functions are assigned by any of several methods. The most direct form of function assignment involves determining the function of a gene by experiment. Since vastly more g...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Proceedings of the National Academy of Sciences of the United States of America

دوره 96 8  شماره 

صفحات  -

تاریخ انتشار 1999