Practical limits of function prediction.
نویسندگان
چکیده
The widening gap between known protein sequences and their functions has led to the practice of assigning a potential function to a protein on the basis of sequence similarity to proteins whose function has been experimentally investigated. We present here a critical view of the theoretical and practical bases for this approach. The results obtained by analyzing a significant number of true sequence similarities, derived directly from structural alignments, point to the complexity of function prediction. Different aspects of protein function, including (i) enzymatic function classification, (ii) functional annotations in the form of key words, (iii) classes of cellular function, and (iv) conservation of binding sites can only be reliably transferred between similar sequences to a modest degree. The reason for this difficulty is a combination of the unavoidable database inaccuracies and the plasticity of protein function. In addition, analysis of the relationship between sequence and functional descriptions defines an empirical limit for pairwise-based functional annotations, namely, the three first digits of the six numbers used as descriptors of protein folds in the FSSP database can be predicted at an average level as low as 7.5% sequence identity, two of the four EC digits at 15% identity, half of the SWISS-PROT key words related to protein function would require 20% identity, and the prediction of half of the residues in the binding site can be made at the 30% sequence identity level.
منابع مشابه
Evaluating the suitability of prediction equations for lung function in Indian children: a practical approach.
OBJECTIVE Although several prediction equations to evaluate peak expiratory flow rate (PEFR) of Indian children are available in literature, clinicians and researchers need to make a logical choice of which equation to use as reference. The aim was to demonstrate a practical approach to making such a logical choice by using prediction equations on our study population. METHODS Eighteen linear...
متن کاملPlace Finding and Optimizing the Determination of Production Units Dynamically for Providing the Electricity and Heat in Industrial City
In this article the place and capacity of combined heat and power [CHP] prediction unit wasdetermined dynamically with use of modified particle swarm optimization (MPSO). It was done inoptimization palace and with a capacity of CHP as a production resource with the aim to increasethe reliability capacity. Decrease the loss and provide the electrical and thermal energies ofindustrial city. The f...
متن کاملThe Prediction of Surface Tension of Ternary Mixtures at Different Temperatures Using Artificial Neural Networks
In this work, artificial neural network (ANN) has been employed to propose a practical model for predicting the surface tension of multi-component mixtures. In order to develop a reliable model based on the ANN, a comprehensive experimental data set including 15 ternary liquid mixtures at different temperatures was employed. These systems consist of 777 data points generally containing hydrocar...
متن کاملPrediction-Based Portfolio Optimization Model for Iran’s Oil Dependent Stocks Using Data Mining Methods
This study applied a prediction-based portfolio optimization model to explore the results of portfolio predicament in the Tehran Stock Exchange. To this aim, first, the data mining approach was used to predict the petroleum products and chemical industry using clustering stock market data. Then, some effective factors, such as crude oil price, exchange rate, global interest rate, gold price, an...
متن کاملStiffness Prediction of Beech Wood Flour Polypropylene Composite by using Proper Fiber Orientation Distribution Function
One of the most famous methods to predict the stiffness of short fiber composites is micromechanical modeling. In this study, a Representative Volume Element (RVE) of a beech wood flour natural composite has been designed and the orientation averaging approach has been utilized to predict its stiffness tensor. The novelty of this work is in finding the proper fiber orientation distribution func...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Proteins
دوره 41 1 شماره
صفحات -
تاریخ انتشار 2000