Justsystem at NTCIR-5 Patent Classification

نویسندگان

  • Tetsuya Tashiro
  • Masaki Rikitoku
  • Takashi Nakagawa
چکیده

Justsystem participated in Patent Classification Subtask at the Fifth NTCIR workshop. This paper overviews our machine learning-based patent application classification system. Straightforward application of Naive Bayes classifier was effective in theme categorization subtask that has a non-hierarchical category structure. In F-term categorization subtask, we regarded the complicated F-term categorization system as a tree with depth 2. We constructed the document classifier based on the Support Vector Machine and classify documents on this tree. Platt’s sigmoid fitting for SVM output was used for the document ranking. We confirmed that this method was effective for this subtask.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Notes on the Limits of CLIR Effectiveness: NTCIR-2 Evaluation Experiments at Justsystem

NTCIR-2 evaluation experiments at the Justsystem site are described with a focus on comparative study of CLIR effectiveness with monolingual retrieval effectiveness of the same retrieval engine. Experiments on the effects of phrasal translation, indexing of translated phrasal terms, pre-translation feedback and parallel documents feedback in diverse retrieval settings, are reported. The results...

متن کامل

Overview of Classification Subtask at NTCIR-5 Patent Retrieval Task

This paper describes Classification Subtask at NTCIR-5 Patent Retrieval Task. We perform two subtasks for patent classification using a multi-dimensional classification structure called “F-term (File Forming Term) classification system”. The first one is Theme Categorization Subtask, where each participant classifies a patent into technological fields called themes. The second one is F-term Cat...

متن کامل

Overview of Classification Subtask at NTCIR-6 Patent Retrieval Task

This paper describes the Classification Subtask of the NTCIR-5 Patent Retrieval Task. The purpose of this subtask is to evaluate the methods of classifying patents into multi-dimensional classification structures called F-term (File Forming Term) classification systems. We report on how this subtask was designed, the test collection released, and the results of the evaluation.

متن کامل

Overview of Patent Retrieval Task at NTCIR-5

In the Fifth NTCIR Workshop, we organized the Patent Retrieval Task and performed three subtasks; Document Retrieval, Passage Retrieval, and Classification. This paper describes the Document Retrieval Subtask and Passage Retrieval Subtask, both of which were intended for patent-to-patent invalidity search task. We show the evaluation results of the groups participating in those subtasks.

متن کامل

Justsystem-Clairvoyance CLIR Experiments at NTCIR-4 Workshop

At the NTCIR-4 workshop, Justsystem Corporation and Clairvoyance Corporation collaborated in participating in the Cross-Language Retrieval Task (CLIR). We submitted results to the sub-tracks of SLIR and BLIR. For the SLIR track, we submitted Chinese, English, and Japanese monolingual runs. For the BLIR track, we submitted Japanese-English and Chinese-English runs. The major goal of our particip...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005