Contamination in sequence databases
نویسندگان
چکیده
منابع مشابه
Errors, Inconsistencies and Contamination in Public Sequence Databases
Sequence databases Over approximately the past decade sequence databases have become one of the most important tools in biological research. The information content of the databases has been growing exponentially increasing their usefulness constantly. Scientist use them to identify transcripts cloned out of libraries and as templates for PCR and sequencing primer design. They are the basis for...
متن کاملAllergen sequence databases.
A number of specialized databases have been developed to facilitate studies of human allergens. These include molecular databases focused on protein sequences and structures, informational databases focused on clinical, biochemical and epidemiological data related to protein allergens, a database on allergen nomenclature, and other knowledge bases or informational websites that are peripherally...
متن کاملProtein sequence databases.
A variety of protein sequence databases exist, ranging from simple sequence repositories, which store data with little or no manual intervention in the creation of the records, to expertly curated universal databases that cover all species and in which the original sequence data are enhanced by the manual addition of further information in each sequence record. As the focus of researchers moves...
متن کاملHIV sequence databases.
Two important databases are often used in HIV genetic research, the HIV Sequence Database in Los Alamos, which collects all sequences and focuses on annotation and data analysis, and the HIV RT/Protease Sequence Database in Stanford, which collects sequences associated with the development of viral resistance against anti-retroviral drugs and focuses on analysis of those sequences. The types of...
متن کاملBioinformatic sequence identification from sequence family databases
We have developed a tool in order to identify sequences in relation to a sequence family database. This tool combines several algorithms: BLAST, multiple sequence alignment and phylogenetic tree building. After identification of the most similar gene family to the query sequence, this query sequence is added to the whole family alignment and the phylogenetic tree of the family is rebuilt includ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Nature Methods
سال: 2020
ISSN: 1548-7091,1548-7105
DOI: 10.1038/s41592-020-0895-8