Automated identification of putative methyltransferases from genomic open reading frames.
نویسندگان
چکیده
We have analyzed existing methodologies and created novel methodologies for the automatic assignment of S-adenosylmethionine (AdoMet)-dependent methyltransferase functionality to genomic open reading frames based on predicted protein sequences. A large class of the AdoMet-dependent methyltransferases shares a common binding motif for the AdoMet cofactor in the form of a seven-strand twisted beta-sheet; this structural similarity is mirrored in a degenerate sequence similarity that we refer to as methyltransferase signature motifs. These motifs are the basis of our assignments. We find that simple pattern matching based on the motif sequence is of limited utility and that a new method of "sensitized matrices for scoring methyltransferases" (SM2) produced with modified versions of the MEME and MAST tools gives greatly improved results for the Saccharomyces cerevisiae yeast genome. From our analysis, we conclude that this class of methyltransferases makes up approximately 0.6-1.6% of the genes in the yeast, human, mouse, Drosophila melanogaster, Caenorhabditis elegans, Arabidopsis thaliana, and Escherichia coli genomes. We provide lists of unidentified genes that we consider to have a high probability of being methyltransferases for future biochemical analyses.
منابع مشابه
Specificities of eleven different DNA methyltransferases of Helicobacter pylori strain 26695.
Methyltransferases (MTases) of procaryotes affect general cellular processes such as mismatch repair, regulation of transcription, replication, and transposition, and in some cases may be essential for viability. As components of restriction-modification systems, they contribute to bacterial genetic diversity. The genome of Helicobacter pylori strain 26695 contains 25 open reading frames encodi...
متن کاملIdentification and characterization of putative virulence genes and gene clusters in Aeromonas hydrophila PPD134/91.
Aeromonas hydrophila is a gram-negative opportunistic pathogen of animals and humans. The pathogenesis of A. hydrophila is multifactorial. Genomic subtraction and markers of genomic islands (GIs) were used to identify putative virulence genes in A. hydrophila PPD134/91. Two rounds of genomic subtraction led to the identification of 22 unique DNA fragments encoding 19 putative virulence factors ...
متن کاملGenomic and transcriptomic landscape of Escherichia coli BL21(DE3)
Escherichia coli BL21(DE3) has long served as a model organism for scientific research, as well as a workhorse for biotechnology. Here we present the most current genome annotation of E. coli BL21(DE3) based on the transcriptome structure of the strain that was determined for the first time. The genome was annotated using multiple automated pipelines and compared to the current genome annotatio...
متن کاملSmall-fragment genomic libraries for the display of putative epitopes from clinically significant pathogens.
Taking advantage of whole genome sequences of bacterial pathogens in many thriving diseases with global impact, we developed a comprehensive screening procedure for the identification of putative vaccine candidate antigens. Importantly, this procedure relies on highly representative small-fragment genomic libraries that are expressed to display frame-selected epitope-size peptides on a bacteria...
متن کاملIdentification, Purification and Characterization of Laterosporulin, a Novel Bacteriocin Produced by Brevibacillus sp. Strain GI-9
BACKGROUND Bacteriocins are antimicrobial peptides that are produced by bacteria as a defense mechanism in complex environments. Identification and characterization of novel bacteriocins in novel strains of bacteria is one of the important fields in bacteriology. METHODOLOGY/FINDINGS The strain GI-9 was identified as Brevibacillus sp. by 16 S rRNA gene sequence analysis. The bacteriocin produ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Molecular & cellular proteomics : MCP
دوره 2 8 شماره
صفحات -
تاریخ انتشار 2003