Subfamily HMMS in Functional Genomics

نویسندگان

  • Duncan P. Brown
  • Nandini Krishnamurthy
  • Joseph M. Dale
  • Wayne Christopher
  • Kimmen Sjölander
چکیده

The limitations of homology-based methods for prediction of protein molecular function are well known; differences in domain structure, gene duplication events and errors in existing database annotations complicate this process. In this paper we present a method to detect and model protein subfamilies, which can be used in high-throughput, genome-scale phylogenomic inference of protein function. We demonstrate the method on a set of nine PFAM families, and show that subfamily HMMs provide greater separation of homologs and non-homologs than is possible with a single HMM for each family. We also show that subfamily HMMs can be used for functional classification with a very low expected error rate. The BETE method for identifying functional subfamilies is illustrated on a set of serotonin receptors.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automated Protein Subfamily Identification and Classification

Function prediction by homology is widely used to provide preliminary functional annotations for genes for which experimental evidence of function is unavailable or limited. This approach has been shown to be prone to systematic error, including percolation of annotation errors through sequence databases. Phylogenomic analysis avoids these errors in function prediction but has been difficult to...

متن کامل

Specialized Hidden Markov Model Databases for Microbial Genomics

As hidden Markov models (HMMs) become increasingly more important in the analysis of biological sequences, so too have databases of HMMs expanded in size, number and importance. While the standard paradigm a short while ago was the analysis of one or a few sequences at a time, it has now become standard procedure to submit an entire microbial genome. In the future, it will be common to submit l...

متن کامل

A robust methodology for inferring physiology of a protein family: application to K+-ion channel family

We are interested in the subtle variations of function among the members of a protein family. A protein family is usually subdivided into subfamilies based on functional differences. Existence of this functional diversity is essential for the successful performance of physiological roles expected of the family. This presents a unique problem: there must be preservation of the active site; simul...

متن کامل

p38 MAPK and PI3K/AKT signalling cascades in Parkinson’s disease

Parkinson's disease (PD) is a chronic neurodegenerative condition which has the second largest incidence rate among all other neurodegenerative disorders barring Alzheimer's disease (AD). Currently there is no cure and researchers continue to probe the therapeutic prospect in cell cultures and animal models of PD. Out of several factors contributing to PD prognosis, the role of p38 MAPKs (mitog...

متن کامل

Dual function of the cytochrome P450 CYP76 family from Arabidopsis thaliana in the metabolism of monoterpenols and phenylurea herbicides.

Comparative genomics analysis unravels lineage-specific bursts of gene duplications related to the emergence of specialized pathways. The CYP76C subfamily of cytochrome P450 enzymes is specific to Brassicaceae. Two of its members were recently associated with monoterpenol metabolism. This prompted us to investigate the CYP76C subfamily genetic and functional diversification. Our study revealed ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

دوره   شماره 

صفحات  -

تاریخ انتشار 2005