AGOUTI: improving genome assembly and annotation using transcriptome data
نویسندگان
چکیده
منابع مشابه
AGOUTI: improving genome assembly and annotation using transcriptome data
BACKGROUND Genomes sequenced using short-read, next-generation sequencing technologies can have many errors and may be fragmented into thousands of small contigs. These incomplete and fragmented assemblies lead to errors in gene identification, such that single genes spread across multiple contigs are annotated as separate gene models. Such biases can confound inferences about the number and id...
متن کاملChopStitch: exon annotation and splice graph construction using transcriptome assembly and whole genome sequencing data.
Motivation Sequencing studies on non-model organisms often interrogate both genomes and transcriptomes with massive amounts of short sequences. Such studies require de novo analysis tools and techniques, when the species and closely related species lack high quality reference resources. For certain applications such as de novo annotation, information on putative exons and alternative splicing m...
متن کاملMachine annotation of genome and transcriptome data
One of the key research topics of post-genome study is annotation of the gene with regards to specific function and biological processes. This can help us to understand the precise role that a gene or a group of genes carries. In this thesis, I developed techniques to automatically annotate genes on single gene and a group of genes levels. It is shown that these techniques improve our understan...
متن کاملImproving the ostrich genome assembly using optical mapping data
BACKGROUND The ostrich (Struthio camelus) is the tallest and heaviest living bird. Ostrich meat is considered a healthy red meat, with an annual worldwide production ranging from 12,000 to 15,000 tons. As part of the avian phylogenomics project, we sequenced the ostrich genome for phylogenetic and comparative genomics analyses. The initial Illumina-based assembly of this genome had a scaffold N...
متن کامل14 . Genome Assembly and Annotation Process
The primary data produced by genome sequencing projects are often highly fragmented and sparsely annotated. This is especially true for the Human Genome Project [http://www.genome.gov/ page.cfm?pageID=10001772] as a result of its policy of releasing sequence data to the public sequence databases every day (1, 2). So that individual researchers do not have to piece together extended segments of ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: GigaScience
سال: 2016
ISSN: 2047-217X
DOI: 10.1186/s13742-016-0136-3