A patent keywords extraction method using TextRank model with prior public knowledge

نویسندگان

چکیده

Abstract For large amount of patent texts, how to extract their keywords in an unsupervised way is a very important problem. In existing methods, only the own information texts analyzed. this study, improved TextRank model proposed, which prior public knowledge effectively utilized. Specifically, two following points are first considered: (1) network constructed for each text, (2) based on dictionary data, edges represent interpretation relationship among all words entries. Then, node rank value evaluation formula designed networks introduced. Finally, can be extracted by finding top-k with higher values. our experiments, text clustering task used examine performance proposed method, wherein several comparison experiments executed. Corresponding results demonstrate that, new method markedly obtain better than methods extraction way.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improved Automatic Keyword Extraction Based on TextRank Using Domain Knowledge

Keyword extraction of scientific articles is beneficial for retrieving scientific articles of a certain topic and grasping the trend of academic development. For the task of keyword extraction for Chinese scientific articles, we adopt the framework of selecting keyword candidates by Document Frequency Accessor Variety(DF-AV) and running TextRank algorithm on a phrase network. To improve domain ...

متن کامل

Interesting Patterns Extraction Using Prior Knowledge

One important challenge in data mining is to extract interesting knowledge and useful information for expert users. Since data mining algorithms extracts a huge quantity of patterns it is therefore necessary to filter out those patterns using various measures. This paper presents IMAK, a part-way interestingness measure between objective and subjective measure, which evaluates patterns consider...

متن کامل

Extraction of Keywords of Novelties From Patent Claims

There are growing needs for patent analysis using Natural Language Processing (NLP)-based approaches. Although NLP-based approaches can extract various information from patents, there are very few approaches proposed to extract those parts what inventors regard as novel or having an inventive step compared to all existing works ever. To extract such parts is difficult even for human annotators ...

متن کامل

Unsupervised Information Extraction with Distributional Prior Knowledge

We address the task of automatic discovery of information extraction template from a given text collection. Our approach clusters candidate slot fillers to identify meaningful template slots. We propose a generative model that incorporates distributional prior knowledge to help distribute candidates in a document into appropriate slots. Empirical results suggest that the proposed prior can brin...

متن کامل

Aspect Extraction with Automated Prior Knowledge Learning

Aspect extraction is an important task in sentiment analysis. Topic modeling is a popular method for the task. However, unsupervised topic models often generate incoherent aspects. To address the issue, several knowledge-based models have been proposed to incorporate prior knowledge provided by the user to guide modeling. In this paper, we take a major step forward and show that in the big data...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Complex & Intelligent Systems

سال: 2021

ISSN: ['2198-6053', '2199-4536']

DOI: https://doi.org/10.1007/s40747-021-00343-8