Transcription factor and microRNA motif discovery: the Amadeus platform and a compendium of metazoan target sets.
نویسندگان
چکیده
We present a threefold contribution to the computational task of motif discovery, a key component in the effort of delineating the regulatory map of a genome: (1) We constructed a comprehensive large-scale, publicly-available compendium of transcription factor and microRNA target gene sets derived from diverse high-throughput experiments in several metazoans. We used the compendium as a benchmark for motif discovery tools. (2) We developed Amadeus, a highly efficient, user-friendly software platform for genome-scale detection of novel motifs, applicable to a wide range of motif discovery tasks. Amadeus improves upon extant tools in terms of accuracy, running time, output information, and ease of use and is the only program that attained a high success rate on the metazoan compendium. (3) We demonstrate that by searching for motifs based on their genome-wide localization or chromosomal distributions (without using a predefined target set), Amadeus uncovers diverse known phenomena, as well as novel regulatory motifs.
منابع مشابه
De-Novo Discovery of Differentially Abundant Transcription Factor Binding Sites Including Their Positional Preference
Transcription factors are a main component of gene regulation as they activate or repress gene expression by binding to specific binding sites in promoters. The de-novo discovery of transcription factor binding sites in target regions obtained by wet-lab experiments is a challenging problem in computational biology, which has not been fully solved yet. Here, we present a de-novo motif discovery...
متن کاملDiagnosis and Treatment B non-Hodgkin Lymphoma with System Biology Approaches
Lymphomas are solid tumors of immune system and Non-Hodgkin Lymphomas (NHL) is the most prevalent lymphomas; with wide ranges of histological and clinical features, it is so difficult to identify them. Herein, various bioinformatics tools (such as gene differential expressions, epigenetics and protein analysis) employed to find new treatment approach for NHL based on gene expression variation b...
متن کاملDevelopment of an Efficient Hybrid Method for Motif Discovery in DNA Sequences
This work presents a hybrid method for motif discovery in DNA sequences. The proposed method called SPSO-Lk, borrows the concept of Chebyshev polynomials and uses the stochastic local search to improve the performance of the basic PSO algorithm as a motif finder. The Chebyshev polynomial concept encourages us to use a linear combination of previously discovered velocities beyond that proposed b...
متن کاملA compendium of Caenhorabditis elegans regulatory transcription factors: a resource for mapping transcription regulatory networks
BACKGROUND Transcription regulatory networks are composed of interactions between transcription factors and their target genes. Whereas unicellular networks have been studied extensively, metazoan transcription regulatory networks remain largely unexplored. Caenorhabditis elegans provides a powerful model to study such metazoan networks because its genome is completely sequenced and many functi...
متن کاملSimultaneously Learning DNA Motif along with Its Position and Sequence Rank Preferences through EM Algorithm
Although de novo motifs can be discovered through mining over-represented sequence patterns, this approach misses some real motifs and generates many false positives. To improve accuracy, one solution is to consider some additional binding features (i.e. position preference and sequence rank preference). This information is usually required from the user. This paper presents a de novo motif dis...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Genome research
دوره 18 7 شماره
صفحات -
تاریخ انتشار 2008