نتایج جستجو برای: data cleaning

تعداد نتایج: 2424654  

Journal: :CoRR 2017
El Kindi Rezig Mourad Ouzzani Ahmed K. Elmagarmid Walid G. Aref

Data Cleaning refers to the process of detecting and fixing errors in the data. Human involvement is instrumental at several stages of this process, e.g., to identify and repair errors, to validate computed repairs, etc. There is currently a plethora of data cleaning algorithms addressing a wide range of data errors (e.g., detecting duplicates, violations of integrity constraints, missing value...

2018
Hongju Cheng Danyang Feng Xiaobin Shi Chongcheng Chen

The quality of data in wireless sensor networks has a significant impact on decision support, and data cleaning is an effective way to improve data quality. However, if the data cleaning strategies are not correctly designed, it might result in an unsatisfactory cleaning effect with increased system cleaning costs. Initially, data quality evaluation indicators and their measurement methods in w...

Journal: :IOSR Journal of Engineering 2013

2006
Michael Benedikt Philip Bohannon Glenn Bruns

Journal: :PVLDB 2013
Floris Geerts Giansalvatore Mecca Paolo Papotti Donatello Santoro

Data-cleaning (or data-repairing) is considered a crucial problem in many database-related tasks. It consists in making a database consistent with respect to a set of given constraints. In recent years, repairing methods have been proposed for several classes of constraints. However, these methods rely on ad hoc decisions and tend to hard-code the strategy to repair conflicting values. As a con...

Journal: :IOSR Journal of Computer Engineering 2012

2009
Tsuyoshi Okita

Parallel corpora are made by human beings. However, as an MT system is an aggregation of state-of-the-art NLP technologies without any intervention of human beings, it is unavoidable that quite a few sentence pairs are beyond its analysis and that will therefore not contribute to the system. Furthermore, they in turn may act against our objectives to make the overall performance worse. Possible...

2001
Helena Galhardas Daniela Florescu Dennis Shasha Eric Simon Cristian-Augustin Saita

The problem of data cleaning, which consists of removing inconsistencies and errors from original data sets, is well known in the area of decision support systems and data warehouses. However, for some applications, existing ETL (Extraction Transformation Loading) and data cleaning tools for writing data cleaning programs are insufficient. One important challenge with them is the design of a da...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید