DIRS: Disconnected Information Retrieval System
نویسندگان
چکیده
The World Wide Web gives individuals access to huge amounts of data. This includes access to information also found in traditional formats such as news copy. This study addresses a desire to blend these two mediums in such a way that media consumers can move transparently from a hard copy of a given article to an electronic copy. Document retrieval experiments were performed in an attempt to determine the feasibility of implementing a handheld scanning device used to mark traditional newspaper articles for subsequent online retrieval. Several thousand random articles were fetched from two popular news search services to emulate the scanning of print media also available online. Experiments were performed on these articles to quantify the success of searching with various article attributes. Query success is quantified by measuring whether or not the article is found, and how deep into the query results we must parse to locate the correct article. When searching on the title of a news article, it was retrieved correctly 98% of the time with an average depth of one. When searching for an article based on a randomly chosen, 30-character string, 92% of the articles were retrieved successfully with an average depth of two.
منابع مشابه
Feature Weighting for Improving Document Image Retrieval System Performance
Feature weighting is a technique used to approximate the optimal degree of influence of individual features. This paper presents a feature weighting method for Document Image Retrieval System (DIRS) based on keyword spotting. In this method, we weight the feature using coefficient of multiple correlations. Coefficient of multiple correlations can be used to describe the synthesized effects and ...
متن کاملDocument Image Retrieval Based on Keyword Spotting Using Relevance Feedback
Keyword Spotting is a well-known method in document image retrieval. In this method, Search in document images is based on query word image. In this Paper, an approach for document image retrieval based on keyword spotting has been proposed. In proposed method, a framework using relevance feedback is presented. Relevance feedback, an interactive and efficient method is used in this paper to imp...
متن کاملAn Event Drive Integration Reasoning Scheme for Handling Dynamic Threats in an Unstructured Environment
This paper presents an attempt to devise and develop a domain-independent reasoning system (DIRS) scheme for handling dynamic threats, and uses the scheme for automated route planning of military vehicles in an unstructured environment. Automated route planning is a very important branch in applications of artificial intelligence. In a dynamic unstructured environment, instead of simply using s...
متن کاملPCA-Based Relevance Feedback in Document Image Retrieval
Research has been devoted in the past few years to relevance feedback as an effective solution to improve performance of information retrieval systems. Relevance feedback refers to an interactive process that helps to improve the retrieval performance. In this paper we propose the use of relevance feedback to improve document image retrieval System (DIRS) performance. This paper compares a vari...
متن کاملImproved Skips for Faster Postings List Intersection
Information retrieval can be achieved through computerized processes by generating a list of relevant responses to a query. The document processor, matching function and query analyzer are the main components of an information retrieval system. Document retrieval system is fundamentally based on: Boolean, vector-space, probabilistic, and language models. In this paper, a new methodology for mat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002