نتایج جستجو برای: record matching

تعداد نتایج: 200532  

2014
S Suguna

Record linkage is traditionally performed among the entities of same type. It can be done based on entities that may or may not share a common identifier. In this paper we propose a new linkage method that performs linkage between matching entities of different data types as well. The proposed technique is based on one-class clustering tree that characterizes the entities which are to be linked...

2004
Rainer Schnell Tobias Bachteler Stefan Bender

We developed a record-linkage toolbox in order to compare the performance of various string-similarity measures for German surnames. This ”Matching Tool-Box” (MTB) is made up by independent, highly portable JAVA-programs. MTB is currently used for prototyping pre-processing tools and the empirical comparison of string-similarity measures. Furthermore, MTB has been used successfully in sociologi...

2009
Wenfei Fan Floris Geerts Xibei Jia

Real-life date is often dirty and costs billions of pounds to businesses worldwide each year. This paper presents a promising approach to improving data quality. It effectively detects and fixes inconsistencies in real-life data based on conditional dependencies, an extension of database dependencies by enforcing bindings of semantically related data values. It accurately identifies records fro...

2016
Alfonso Murolo Moira C. Norrie

Recent trends in website design have an impact on methods used for web data extraction. Many existing methods rely on structural analysis of web pages and, with the introduction of CSS, table-based layouts are no longer used, while responsive design means that layout and presentation are dependent on browsing context which also makes the use of visual clues more complex. We present DeepDesign, ...

ژورنال: :مجله گیاهشناسی ایران 2004
مهین جانی قربان

گونه trigonella grandiflora bunge که از استان خراسان جمع آوری شده، برای اولین بار از ایران گزارش می شود.

2012
David Källberg Nikolai Leonenko Oleg Seleznjev

Numerous entropy-type characteristics (functionals) generalizing Rényi entropy are widely used in mathematical statistics, physics, information theory, and signal processing for characterizing uncertainty in probability distributions and distribution identification problems. We consider estimators of some entropy (integral) functionals for discrete and continuous distributions based on the numb...

2000
Francesco Caruso Munir Cochinwala Uma Ganapathy Gail Lalk Paolo Missier

This demonstration illustrates how a comprehensive database reconciliation tool can provide the ability to characterize data-quality and data-reconciliation issues in complex real-world applications. Telcordia’s data reconciliation and data quality analysis tool includes rapid generation of appropriate pre-processing and matching rules applied to a training set created from samples of the data....

2011
Antoine Chevrette Andrew Grant Elizabeth Sheridan Richard Pebody Indrajit Bhattacharya Marianne Winglee Xin Luna Dong Laure Berti-Equille

At Statistics Canada, matching data without unique identifiers is a common practice. The probabilistic record linkage method developed by Ivan Fellegi and Allan Sunter 1 is the primary method recommended by Statistics Canada for this type of matching. In recent decades, work began to generalize the Fellegi–Sunter algorithm in order to offer our community the opportunity to use this methodology ...

Journal: :CoRR 2014
Jeffrey Sukharev Leonid Zhukov Alexandrin Popescul

Name matching is a key component of systems for entity resolution or record linkage. Alternative spellings of the same names are a common occurrence in many applications. We use the largest collection of genealogy person records in the world together with user search query logs to build name matching models. The procedure for building a crowd-sourced training set is outlined together with the p...

Journal: :Real-Time Imaging 1996
Ming-Syan Chen Chung-Sheng Li Philip S. Yu

In this paper we examine a content-based method to download/record digital video from networks to client stations and home VCR's. The method examined is an alternative to the conventional time-based method used for recording analogue video. Various approaches to probing the video content and to triggering the VCR operations are considered, including frame signature matching, program barcode mat...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید