An LSTM-Based Plagiarism Detection via Attention Mechanism and a Population-Based Approach for Pre-training Parameters with Imbalanced Classes
نویسندگان
چکیده
Plagiarism is one of the leading problems in academic and industrial environments, which its goal to find similar items a typical document or source code. This paper proposes an architecture based on Long Short-Term Memory (LSTM) attention mechanism called LSTM-AM-ABC boosted by population-based approach for parameter initialization. Gradient-based optimization algorithms such as back-propagation (BP) are widely used literature learning process LSTM, mechanism, feed-forward neural network, while they suffer from some getting stuck local optima. To tackle this problem, metaheuristic (PBMH) can be used. end, employs PBMH algorithm, artificial bee colony (ABC), moderate problem. Our proposed algorithm initial values model all simultaneously. In other words, ABC finds promising point starting BP algorithm. For evaluation, we compare our with both conventional methods. The results clearly show that method provide competitive performance.
منابع مشابه
English-Persian Plagiarism Detection based on a Semantic Approach
Plagiarism which is defined as “the wrongful appropriation of other writers’ or authors’ works and ideas without citing or informing them” poses a major challenge to knowledge spread publication. Plagiarism has been placed in four categories of direct, paraphrasing (rewriting), translation, and combinatory. This paper addresses translational plagiarism which is sometimes referred to as cross-li...
متن کاملA Set-Based Approach to Plagiarism Detection
This paper describes our approach to the Detailed Analysis subtask of the PAN 2012 competition. Our experiments deal with monolingual plagiarism cases, only. We use a simple set-based algorithm, that employs Dice’s coefficient as a similarity measure. Furthermore we employ basic strategies from Information Retrieval and Natural Language Processing for stop word removal and language detection. W...
متن کاملAnomaly-based Web Attack Detection: The Application of Deep Neural Network Seq2Seq With Attention Mechanism
Today, the use of the Internet and Internet sites has been an integrated part of the people’s lives, and most activities and important data are in the Internet websites. Thus, attempts to intrude into these websites have grown exponentially. Intrusion detection systems (IDS) of web attacks are an approach to protect users. But, these systems are suffering from such drawbacks as low accuracy in ...
متن کاملa new type-ii fuzzy logic based controller for non-linear dynamical systems with application to 3-psp parallel robot
abstract type-ii fuzzy logic has shown its superiority over traditional fuzzy logic when dealing with uncertainty. type-ii fuzzy logic controllers are however newer and more promising approaches that have been recently applied to various fields due to their significant contribution especially when the noise (as an important instance of uncertainty) emerges. during the design of type- i fuz...
15 صفحه اولPlagiarism Detection Based on a Novel Trie-based Approach
Nowadays, plagiarism detection becomes as one of major problems in the text mining field. New coming technologies have made plagiarisation easy and more feasible. Therefore, it is vital to develop automatic system to detect plagiarisation in different contents. In this paper, we propose a trie to compare source and suspicious text documents. We use PersianPlagDet text documents as a case study....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2021
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-030-92238-2_57