نتایج جستجو برای: text classification rocchio

تعداد نتایج: 641860  

Journal: :Jurnal Ilmu Komputer dan Agri-Informatika 2018

2003
Fan Li Yiming Yang

This paper presents a formal analysis of popular text classification methods, focusing on their loss functions whose minimization is essential to the optimization of those methods, and whose decomposition into the trainingset loss and the model complexity enables cross-method comparisons on a common basis from an optimization point of view. Those methods include Support Vector Machines, Linear ...

2005
Marco Degemmis Pasquale Lops Stefano Ferilli Nicola Di Mauro Teresa Maria Altomare Basile Giovanni Semeraro

Exploring digital collections to find information relevant to a user’s interests is a challenging task. Algorithms designed to solve this relevant information problem base their relevance computations on user profiles in which representations of the users’ interests are maintained. This paper presents a new method, based on the classical Rocchio algorithm for text categorization, able to discov...

Journal: :Int. J. Systems Science 2006
Marco Degemmis Pasquale Lops Stefano Ferilli Nicola Di Mauro Teresa Maria Altomare Basile Giovanni Semeraro

Exploring digital collections to find information relevant to a user’s interests is a challenging task. Algorithms designed to solve this relevant information problem base their relevance computations on user profiles in which representations of the users’ interests are maintained. This article presents a new method, based on the classic Rocchio algorithm for text categorization, able to discov...

2012
Amin Mantrach Jean-Michel Renders

Most of the approaches in multi-view categorization use early fusion, late fusion or co-training strategies. We propose here a novel classification method that is able to efficiently capture the interactions across the different modes. This method is a multi-modal extension of the Rocchio classification algorithm – very popular in the Information Retrieval community. The extension consists of s...

2002
Carl Sable Kathy McKeown Vasileios Hatzivassiloglou

This paper explores the use of a statistical technique known as density estimation to potentially improve the results of text categorization systems which label documents by computing similarities between documents and categories. In addition to potentially improving a system's overall accuracy, density estimation converts similarity scores to probabilities. These probabilities provide con denc...

Journal: :Journal of Machine Learning Research 2003
Koby Crammer Yoram Singer

We describe a new family of topic-ranking algorithms for multi-labeled documents. The motivation for the algorithms stem from recent advances in online learning algorithms. The algorithms are simple to implement and are also time and memory efficient. We provide a unified analysis of the family of algorithms in the mistake bound model. We then discuss experiments with the proposed family of top...

Abstract- Text classification is one of the main tasks of natural language processing (NLP). In this task, documents are classified into pre-defined categories. There is lots of news spreading on the web. A text classifier can categorize news automatically and this facilitates and accelerates access to the news. The first step in text classification is to represent documents in a suitable way t...

2000
Yi Zhang James P. Callan

We built a filtering system YFILTER this year, which we used for experiments on profile updating and thresholds setting. Our focus is using incremental Rocchio for introducing new query terms and term weighting. Although 1, 0.5, 0.25 is a widely used Rocchio ratio for query expansion based on relevance feedback, we found that the optimal setting for information filtering is corpus and profile d...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید