An LSTM-Based Plagiarism Detection via Attention Mechanism and a Population-Based Approach for Pre-training Parameters with Imbalanced Classes

نویسندگان

چکیده

Plagiarism is one of the leading problems in academic and industrial environments, which its goal to find similar items a typical document or source code. This paper proposes an architecture based on Long Short-Term Memory (LSTM) attention mechanism called LSTM-AM-ABC boosted by population-based approach for parameter initialization. Gradient-based optimization algorithms such as back-propagation (BP) are widely used literature learning process LSTM, mechanism, feed-forward neural network, while they suffer from some getting stuck local optima. To tackle this problem, metaheuristic (PBMH) can be used. end, employs PBMH algorithm, artificial bee colony (ABC), moderate problem. Our proposed algorithm initial values model all simultaneously. In other words, ABC finds promising point starting BP algorithm. For evaluation, we compare our with both conventional methods. The results clearly show that method provide competitive performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

English-Persian Plagiarism Detection based on a Semantic Approach

Plagiarism which is defined as “the wrongful appropriation of other writers’ or authors’ works and ideas without citing or informing them” poses a major challenge to knowledge spread publication. Plagiarism has been placed in four categories of direct, paraphrasing (rewriting), translation, and combinatory. This paper addresses translational plagiarism which is sometimes referred to as cross-li...

متن کامل

A Set-Based Approach to Plagiarism Detection

This paper describes our approach to the Detailed Analysis subtask of the PAN 2012 competition. Our experiments deal with monolingual plagiarism cases, only. We use a simple set-based algorithm, that employs Dice’s coefficient as a similarity measure. Furthermore we employ basic strategies from Information Retrieval and Natural Language Processing for stop word removal and language detection. W...

متن کامل

Anomaly-based Web Attack Detection: The Application of Deep Neural Network Seq2Seq With Attention Mechanism

Today, the use of the Internet and Internet sites has been an integrated part of the people’s lives, and most activities and important data are in the Internet websites. Thus, attempts to intrude into these websites have grown exponentially. Intrusion detection systems (IDS) of web attacks are an approach to protect users. But, these systems are suffering from such drawbacks as low accuracy in ...

متن کامل

a new type-ii fuzzy logic based controller for non-linear dynamical systems with application to 3-psp parallel robot

abstract type-ii fuzzy logic has shown its superiority over traditional fuzzy logic when dealing with uncertainty. type-ii fuzzy logic controllers are however newer and more promising approaches that have been recently applied to various fields due to their significant contribution especially when the noise (as an important instance of uncertainty) emerges. during the design of type- i fuz...

15 صفحه اول

Plagiarism Detection Based on a Novel Trie-based Approach

Nowadays, plagiarism detection becomes as one of major problems in the text mining field. New coming technologies have made plagiarisation easy and more feasible. Therefore, it is vital to develop automatic system to detect plagiarisation in different contents. In this paper, we propose a trie to compare source and suspicious text documents. We use PersianPlagDet text documents as a case study....

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2021

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-030-92238-2_57