Improving phonetic alignment by handling secondary sequence structures

نویسندگان

  • Johann-Mattis List
  • Heinrich Heine
چکیده

In traditional alignment analyses, sequences are only compared with regard to their primary structure. Here, the term primary structure refers to the order order of segments, whereby segments are understood as the smallest units of a sequence which directly correspond to the characters of the alphabet from which the sequence is drawn. Apart from the primary structure, sequences can, however, also have a secondary structure. Apart from segmentizing sequences into their primary units, one can further segmentize them into larger units of subsequences consisting of one or more primary segments. A secondary segmentation which is very common in linguistics is, e.g., the segmentation of words into syllables apart from the primary segmentation of words into phonemes. The traditional alignment modes such as global, local, or semiglobal alignment (cf. the overview in Durbin et al. 2002) align sequences only with respect to their primary structure. Thus, given the sequence "THE CATFISH HUNTS" and "THE CAT FISHES", they all yield an alignment in which the subsequence "CATFISH" of the first sequence is matched with the subsequence "CAT FISH" of the second sequence (see Table 1). In contrast to these primary alignments, a secondary alignment displays the similarities of sequences with regard to both their primary and their secondary structure, aligning letters which belong to the same word in one sequence only with those letters in the other sequence which also belong to a single word (see Table 1).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

INTERALIGN: interactive alignment editor for distantly related protein sequences

SUMMARY Improving and ascertaining the quality of a multiple sequence alignment is a very challenging step in protein sequence analysis. This is particularly the case when dealing with sequences in the 'twilight zone', i.e. sharing < 30% identity. Here we describe INTERALIGN, a dedicated user-friendly alignment editor including a view of secondary structures and a synchronized display of carbon...

متن کامل

jPHYDIT: a JAVA-based integrated environment for molecular phylogeny of ribosomal RNA sequences

jPHYDIT is a Java application designed to furnish a visual and integrated environment for molecular phylogeny. The program can be used to visualize intra-strand base-pairing information in secondary and tertiary structures of ribosomal RNA (rRNA) sequences. A function for the semi-automated alignment was included to facilitate handling of the database containing a large number of multiple-align...

متن کامل

SEQUENTIAL PENALTY HANDLING TECHNIQUES FOR SIZING DESIGN OF PIN-JOINTED STRUCTURES BY OBSERVER-TEACHER-LEARNER-BASED OPTIMIZATION

Despite comprehensive literature works on developing fitness-based optimization algorithms, their performance is yet challenged by constraint handling in various engineering tasks. The present study, concerns the widely-used external penalty technique for sizing design of pin-jointed structures. Observer-teacher-learner-based optimization is employed here since previously addressed by a number ...

متن کامل

SPEM: improving multiple sequence alignment with sequence profiles and predicted secondary structures

MOTIVATION Multiple sequence alignment is an essential part of bioinformatics tools for a genome-scale study of genes and their evolution relations. However, making an accurate alignment between remote homologs is challenging. Here, we develop a method, called SPEM, that aligns multiple sequences using pre-processed sequence profiles and predicted secondary structures for pairwise alignment, co...

متن کامل

DTW-based phonetic alignment using multiple acoustic features

This paper presents the results of our effort in improving the accuracy of a DTW-based automatic phonetic aligner. The adopted model assumes that the phonetic segment sequence is already known and so the goal is only to align the spoken utterance with a reference synthetic signal produced by waveform concatenation without prosodic modifications. Instead of using a single acoustic measure to com...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012