PredHydroxy: computational prediction of protein hydroxylation site locations based on the primary structure.

نویسندگان

  • Shao-Ping Shi
  • Xiang Chen
  • Hao-Dong Xu
  • Jian-Ding Qiu
چکیده

Compared to well-known and extensively studied protein phosphorylation, protein hydroxylation attracts much less attention and the molecular mechanism of the hydroxylation is still incompletely understood. And yet annotation of hydroxylation in proteomes is a first-critical step toward decoding protein function and understanding their physiological roles that have been implicated in the pathological processes and providing useful information for the drug designs of various diseases related with hydroxylation. In this work, we present a novel method called PredHydroxy to automate the prediction of the proline and lysine hydroxylation sites based on position weight amino acids composition, 8 high-quality amino acid indices and support vector machines. The PredHydroxy achieved a promising performance with an area under the receiver operating characteristic curve (AUC) of 82.72% and a Matthew's correlation coefficient (MCC) of 69.03% for hydroxyproline as well as an AUC of 87.41% and a MCC of 66.68% for hydroxylysine in jackknife cross-validation. The results obtained from both the cross validation and independent tests suggest that the PredHydroxy might be a powerful and complementary tool for further experimental investigation of protein hydroxylation. Feature analyses demonstrate that hydroxylation and non-hydroxylation have distinct location-specific differences; alpha and turn propensity is of importance for the hydroxylation of proline and lysine residues. A user-friendly server is freely available on the web at: .

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Protein Secondary Structure Prediction: a Literature Review with Focus on Machine Learning Approaches

DNA sequence, containing all genetic traits is not a functional entity. Instead, it transfers to protein sequences by transcription and translation processes. This protein sequence takes on a 3D structure later, which is a functional unit and can manage biological interactions using the information encoded in DNA. Every life process one can figure is undertaken by proteins with specific functio...

متن کامل

In Silico Analysis of Primary Sequence and Tertiary Structure of Lepidium Draba Peroxidase

Peroxidase enzymes are vastly applicable in industry and diagnosiss. Recently, we introduced a new kind of peroxidase gene from Lepidium draba (LDP). According to protein multiple sequence alignment results, LDP had 93% similarity and 88.96% identity with horseradish peroxidase C1A (HRP C1A). In the current study we employed in silico tools to determine, to which group of peroxidase enzymes LDP...

متن کامل

Prediction of 3D protein Structure based on Mutation of AKAP3 and PLOD3 Gene in Case of Non-Obstructive Azoospermia

Background: The present study has been designed with the aim of evaluating A-kinase anchoring proteins 3 (AKAP3)and Procollagen-Lysine, 2-Oxoglutarate 5-Dioxygenase 3 (PLOD3) gene mutations and prediction of 3D proteinstructure for ligand binding activity in the cases of non-obstructive azoospermic male.Materials and Methods: Clinically diagnosed cases of non-obstructive azoos...

متن کامل

Prediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks

Background: Prediction of the protein localization is among the most important issues in the bioinformatics that is used for the prediction of the proteins in the cells and organelles such as mitochondria. In this study, several machine learning algorithms are applied for the prediction of the intracellular protein locations. These algorithms use the features extracted from pro...

متن کامل

Computational Prediction of the Effects of Single Nucleotide Polymorphisms of the Gene Encoding Human Endothelial Nitric Oxide Synthase

ABSTRACT           Background and Objective: Genetic variations in the gene encoding endothelial nitric oxide synthase (eNOS) enzyme affect the susceptibility to cardiovascular disease. Identification of the way these changes affect eNOS structure and function in laboratory conditions is difficult and time-consuming. Thus, it seems essential to ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Molecular bioSystems

دوره 11 3  شماره 

صفحات  -

تاریخ انتشار 2015