Biological data cleaning: a case study

نویسندگان

  • Katherine G. Herbert
  • Jason Tsong-Li Wang
چکیده

As databases become more pervasive through the biological sciences, various data quality concerns are emerging. Biological databases tend to develop data quality issues regarding data legacy, data uniformity and data duplication. Due to the nature of this data, each of these problems is non-trivial and can cause many problems for the database. For biological data to be corrected and standardised, methods and frameworks must be developed to handle both structural and traditional data. This paper discusses issues concerning biological data quality with respect to data cleaning. It presents BIO-AJAX, a framework developed to address these issues. It finally describes BIO-JAX for TreeBASE and BIO-AJAX for Lineage Path, two implementations of BIO-AJAX on phylogenetic data sets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Building a Disordered Protein Database: A Case Study in Managing Biological Data

A huge diversity of biological databases is available via the Internet, but many of these databases have been developed in an ad hoc manner rather than in accordance with any data management principles. In addition, in the area of disordered protein databases, many of the databases have not been made publicly available. This poses challenges to researchers, since reliable protein databases are ...

متن کامل

Industrial Cleaning with ultra-clean water according to the Qlean-method – a case study of printed circuit boards

The manufacturing industry today uses many kinds of chemicals in its cleaning processes. The industrial cleaners often contain some sort of degreasing chemical to clean parts and components before the main processes, for instance assembly or surface treatment. These types of cleaning methods are often expensive and involve hazardous handling of chemicals in manufacturing, as well as in the tran...

متن کامل

SQLShare: Scientific Workflow via Relational View Sharing

We consider a case study in using a web-based query-as-a-service platform as an alternative to scriptbased scientific workflows. The context is a project in observational biological oceanography to share and process data from a ship-based continuous profiler of microbial populations called SeaFlow. The representative tasks involve aggregating and cleaning SeaFlow measurements, integrating the c...

متن کامل

Industrial cleaning with Qlean Water: a case study of printed circuit boards

Many manufacturing companies are looking for ways to substitute environmentally problematic cleaning methods for surface treatments with more environmentally friendly ones. In this paper, one potential solution is described. The Qlean method, based on cleaning with highly pure water (in this paper defined as Qlean Water), is a novel cleaning method. This method, now utilized at one plant at a l...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IJIQ

دوره 1  شماره 

صفحات  -

تاریخ انتشار 2007