نتایج جستجو برای: tfidf vector space model

تعداد نتایج: 2616913  

2000
Eiichiro Sumita

Building a bilingual dictionary for transfer in a machine translation system is conventionally done by hand and is very time-consuming. In order to overcome this bottleneck, we propose a new mechanism for lexical transfer, which is simple and suitable for learning from bilingual corpora. It exploits a vector-space model developed in information retrieval research. We present a preliminary resul...

2007
Yannis Tzitzikas Yannis Theoharis

The Vector Space Model (VSM) is probably the most widely used model for retrieving information from text collections (and recently from over other kinds of corpi). Assuming this model, we study the problem of finding the best query that ”names” (or describes) a given (unordered or ordered) set of objects. We formulate several variations of this problem and we provide methods and algorithms for ...

2003
Anca Doloc-Mihu Vijay V. Raghavan Peter Bollmann-Sdorra

Many applications involving similarity search use the QBIC Euclidian distance to match two color histograms. To alleviate certain problems associated with this approach, which is based on a distance metric, in this paper, we propose a Color-Color Similarity Retrieval Approach to compute the similarities between images. This approach, based on the similarity matrix between feature vectors, leads...

Journal: :Journal of Natural Language Processing 2003

1995
Amit Singhal Gerard Salton

Vast amounts of text are now available in machine-readable form and can be processed electronically. The vector space model of text processing has been widely used and has consistently produced superior retrieval results for the last thirty years. Traditionally , information retrieval research has concentrated on improving on-demand retrieval of useful textual information. Often a user has no p...

Journal: :IEEE Software 1997
Dik Lun Lee Huei Chuang Kent E. Seamons

Using several simplifications of the vector-space model for text retrieval queries, the authors seek the optimal balance between processing efficiency and retrieval effectiveness as expressed in relevant document rankings. fficient and effective text retrieval techniques are critical in managing the increasing amount of textual information available in electronic form. Yet text retrieval is a d...

Journal: :CoRR 2017
Barathi Ganesh H. B. M. Anand Kumar K. P. Soman

In this era of digitization, knowing the user’s sociolect aspects have become essential features to build the user specific recommendation systems. These sociolect aspects could be found by mining the user’s language sharing in the form of text in social media and reviews. This paper describes about the experiment that was performed in PAN Author Profiling 2017 shared task. The objective of the...

2013
Jacob Andreas Zoubin Ghahramani

We present a novel compositional, generative model for vector space representations of meaning. This model reformulates earlier tensor-based approaches to vector space semantics as a top-down process, and provides efficient algorithms for transformation from natural language to vectors and from vectors to natural language. We describe procedures for estimating the parameters of the model from p...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید