text retrieval

نتایج جستجو برای: text retrieval

تعداد نتایج: 238516 فیلتر نتایج به سال:

TPIRS: A System for Document Indexing Reduction on WebCLEF

2005

David Pinto Héctor Jiménez-Salazar Paolo Rosso Emilio Sanchis Arnal

In this paper we present the results of BUAP/UPV universities in WebCLEF, a particular task of CLEF 2005. Particularly, we evaluate our information retrieval system in the bilingual English to Spanish track. Our system uses a term reduction process based on the Transition Point technique. Our results show that it is possible to reduce the number of terms to index, thereby improving the performa...

متن کامل

Recall is the Proper Evaluation Metric for Word Segmentation

2017

Yan Shao Christian Hardmeier Joakim Nivre

We extensively analyse the correlations and drawbacks of conventionally employed evaluation metrics for word segmentation. Unlike in standard information retrieval, precision favours under-splitting systems and therefore can be misleading in word segmentation. Overall, based on both theoretical and experimental analysis, we propose that precision should be excluded from the standard evaluation ...

متن کامل

BUAP-UPV TPIRS: A System for Document Indexing Reduction at WebCLEF

2005

David Pinto Héctor Jiménez-Salazar Paolo Rosso Emilio Sanchis Arnal

In this paper we present the results of BUAP/UPV universities in WebCLEF, a particular task of CLEF 2005. Particularly, we evaluate our information retrieval system at the bilingual “English to Spanish” task. Our system uses a term reduction process based on the Transition Point technique. Our results show that it is possible to reduce the number of terms to index, thereby improving the perform...

متن کامل

Information Retrieval on Noisy Text

2004

David Grangier Alessandro Vinciarelli Hervé Bourlard

متن کامل

Stylistic Variation in an Information Retrieval Experiment

Journal: :CoRR 1994

Jussi Karlgren

Texts exhibit considerable stylistic variation. This paper reports an experiment where a corpus of documents (N= 75 000) is analyzed using various simple stylistic metrics. A subset (n = 1000) of the corpus has been previously assessed to be relevant for answering given information retrieval queries. The experiment shows that this subset differs significantly from the rest of the corpus in term...

متن کامل

The Study on Key Technology of Mongolian Full-Text Retrieval

2011

S. Loglo Sarula

With the development of the Mongolian corpus and website, an increasing number of people have focused their attention on the accurate, complete and fast retrieval of the information that they need. In this paper, such key technological issues in Mongolian full-text retrieval as character shape indexing, drawing the Mongolian verb stem and the automatic recognition of the Mongolian homographic w...

متن کامل

Content Based Image and Video Retrieval Using Embedded Text

2006

Chinmaya Misra Shamik Sural

Extraction of text from image and video is an important step in building efficient indexing and retrieval systems for multimedia databases. We adopt a hybrid approach for such text extraction by exploiting a number of characteristics of text blocks in color images and video frames. Our system detects both caption text as well as scene text of different font, size, color and intensity. We have d...

متن کامل

Overview of the Eighth Text REtrieval Conference (TREC-8)

1999

Ellen M. Voorhees Donna K. Harman

s of U.S. DOE publications 184 226,087 111 120.4

متن کامل

Evaluation of Web Retrieval Methods Using Anchor Text

2002

Kenji Tateishi Hideki Kawai Susumu Akamine Katsushi Matsuda Toshikazu Fukushima

In this paper, we evaluate two types of anchor texts: a page anchor and a site anchor. Since the anchor text tends to summarize information referred ahead, it can be expected that the terms appearing there have important meaning in information retrieval. We introduce a retrieval method to give high priority to the terms in the anchor text. In the experiment, we compared the proposed method with...

متن کامل

Text Mining Promise and Reality

2006

Antonina Durfee

This paper provides taxonomy of common text mining tasks and approaches. We surveyed the market of modern text mining tools, compared their features and grouped them into information retrieval, standard or intelligent text mining categories in order to examine how theoretical promises materialized in modern technologies. The study is the first one in a series of studies trying to provide an und...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید