Annotating Gene Functions by Spectral Clustering for Combining Gene Expressions and Sequences
نویسندگان
چکیده
Annotating gene functions is a fundamental issue in the post-genomic era. A typical procedure for this issue is first clustering genes and then assigning functions of unknown genes by using known genes in the same cluster. A lot of genomic information are available for this issue, but two major types of data which can be measured for any genes are microarray expressions and sequences, both of which however have their own flaws. Thus a natural and promising approach for gene annotation is to combine these two data sources. We developed an efficient gene annotation method with three steps containing spectral clustering over the integrated clustering cost for each data source. We examined the performance of our proposed method from viewpoints of clustering and annotations. All experimental results indicate our performance advantage over possible clustering/classification-based approaches of gene function annotation, using expressions and/or sequences.
منابع مشابه
Annotating gene functions with integrative spectral clustering on microarray expressions and sequences.
Annotating genes is a fundamental issue in the post-genomic era. A typical procedure for this issue is first clustering genes by their features and then assigning functions of unknown genes by using known genes in the same cluster. A lot of genomic information are available for this issue, but two major types of data which can be measured for any gene are microarray expressions and sequences, b...
متن کاملClustering of a Number of Genes Affecting in Milk Production using Information Theory and Mutual Information
Information theory is a branch of mathematics. Information theory is used in genetic and bioinformatics analyses and can be used for many analyses related to the biological structures and sequences. Bio-computational grouping of genes facilitates genetic analysis, sequencing and structural-based analyses. In this study, after retrieving gene and exon DNA sequences affecting milk yield in dairy ...
متن کاملSpectral Preprocessing for Clustering Time-Series Gene Expressions
Based on gene expression profiles, genes can be partitioned into clusters, which might be associated with biological processes or functions, for example, cell cycle, circadian rhythm, and so forth. This paper proposes a novel clustering preprocessing strategy which combines clustering with spectral estimation techniques so that the time information present in time series gene expressions is ful...
متن کاملخوشهبندی دادههای بیانژنی توسط عدم تشابه جنگل تصادفی
Background: The clustering of gene expression data plays an important role in the diagnosis and treatment of cancer. These kinds of data are typically involve in a large number of variables (genes), in comparison with number of samples (patients). Many clustering methods have been built based on the dissimilarity among observations that are calculated by a distance function. As increa...
متن کاملConstruction of expressing vectors including melanoma differentiation-associated gene-7 (mda-7) fused with the RGD sequences for better tumor targeting
Objective(s): Up to now, many researches have been performed to improve the antitumoral effect of melanoma differentiation-associated gene-7 (mda-7) protein. The purpose of our research was to construct 3 expression vectors producing mda-7 in fusion with RGD (Arginine-Glycine-Aspartic acid) peptide and evaluate their expression. Materials and Methods: mda-7 gene with two different RGD sequ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009