Heuristic Algorithms in Bioinformatics

نویسنده

  • Jakob Vesterstrøm
چکیده

Many bioinformatics problems are very hard to solve optimally and within polynomial time of their size. This motivates the use of heuristics in bioinformatics. The present thesis constitutes research into the design and analysis of efficient heuristic algorithms of practical usage for solving biological problems. The research made in this thesis can be related to three main objectives: 1) The design efficient tools for solving relevant bioinformatics problems, 2) The exploration of stochastic search algorithms, and 3) The use of bioinformatics tools get an increased knowledge about the world from a bioinformatics perspective. The research relates to two bioinformatics problems: Comparison of 3D protein structures also known as structural comparison, and physical mapping, which is a combinatorial problem that originates from the determination of whole genome sequences. The thesis is structured in two parts: Part I, in which the background is introduced for the algorithms used and the problem domains investigated. Part II, in which the four papers I have produced are presented. One paper has been accepted for publication in the Journal of Computational Biology, two have been published in the proceedings of the Congress on Evolutionary Computation, and one is currently in submission. The research on structural comparison resulted in a journal paper, written with William R. Taylor, and a tool, called FASE (Flexible structural Alignment from Secondary structure), for comparison, superposition, and alignment of protein structures. The chosen focus for structural comparison is reflected in the features of FASE: FASE can find non-sequential structure similarities with a novel, local application dynamic programming, FASE uses a new idea to search the space of structural alignments by making initial alignments of the protein structures according to their SSE definitions. This improves the runtime, while still enabling discovery of very subtle and remote similarities. A novel consensus scoring method, combining five measures for structural similarity, was introduced in order to increase robustness in similarity assessment. The significance of the found similarity values was determined using the SCOP classification as reference. In conclusion, FASE was shown to be robust with defaults parameters, and have a better time-complexity than competitive methods. FASE has been tested on

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sequential and Mixed Genetic Algorithm and Learning Automata (SGALA, MGALA) for Feature Selection in QSAR

Feature selection is of great importance in Quantitative Structure-Activity Relationship (QSAR) analysis. This problem has been solved using some meta-heuristic algorithms such as: GA, PSO, ACO, SA and so on. In this work two novel hybrid meta-heuristic algorithms i.e. Sequential GA and LA (SGALA) and Mixed GA and LA (MGALA), which are based on Genetic algorithm and learning automata for QSAR f...

متن کامل

Sequential and Mixed Genetic Algorithm and Learning Automata (SGALA, MGALA) for Feature Selection in QSAR

Feature selection is of great importance in Quantitative Structure-Activity Relationship (QSAR) analysis. This problem has been solved using some meta-heuristic algorithms such as: GA, PSO, ACO, SA and so on. In this work two novel hybrid meta-heuristic algorithms i.e. Sequential GA and LA (SGALA) and Mixed GA and LA (MGALA), which are based on Genetic algorithm and learning automata for QSAR f...

متن کامل

FOA: ‘Following’ Optimization Algorithm for solving Power engineering optimization problems

These days randomized-based population optimization algorithms are in wide use in different branches of science such as bioinformatics, chemical physics andpower engineering. An important group of these algorithms is inspired by physical processes or entities’ behavior. A new approach of applying optimization-based social relationships among the members of a community is investigated in this pa...

متن کامل

gpALIGNER: A Fast Algorithm for Global Pairwise Alignment of DNA Sequences

Bioinformatics, through the sequencing of the full genomes for many species, is increasingly relying on efficient global alignment tools exhibiting both high sensitivity and specificity. Many computational algorithms have been applied for solving the sequence alignment problem. Dynamic programming, statistical methods, approximation and heuristic algorithms are the most common methods appli...

متن کامل

A heuristic managing errors for DNA sequencing

MOTIVATION A new heuristic algorithm for solving DNA sequencing by hybridization problem with positive and negative errors. RESULTS A heuristic algorithm providing better solutions than algorithms known from the literature based on tabu search method.

متن کامل

Two new heuristic algorithms for Covering Tour Problem

Covering Tour Problem (CTP) is the generalized form of Traveling Salesman Problem (TSP), which has found different applications in the designing of distribution networks, disaster relief, and transportation routing. The purpose of this problem is to determine the Hamiltoniancyclewiththe lowest costusinga subset of all the nodes, such that the other nodes would be in a distance shorter than the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005