Detecting Automatically Generated Sentences with Grammatical Structure Similarity

نویسندگان

  • Minh-Tien Nguyen
  • Cyril Labbé
چکیده

Detection of automatically generated papers has been a new field of research. However, all current approaches are working at the document level and are unable to detect a small amount of generated text inside a large body of genuine written text. This paper will present the Grammatical Structure Similarity (GSS) measurement to detect sentences or short fragments from known generators. The proposed approach is tested against common machine learning methods, the ability to detect a modified generator is also tested.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Detecting Grammatical Errors in Text using a Ngram-based Ruleset

Applications like word processors and other writing tools typically include a grammar checker. The purpose of a grammar checker is to identify sentences that are grammatically incorrect based on the syntax of the language. The proposed grammar checker is a rule-based system to identify sentences that are most likely to contain errors. The set of rules are automatically generated from a part of ...

متن کامل

A Sentence Semantic Similarity Calculating Method Based on Segmented Semantic Comparison

In order to calculate sentence semantic similarity more accurately, a sentence semantic similarity calculating method based on segmented semantic comparison was proposed. Sentences would be divided into the trunk and the other segments by some grammar rules, and each segment might be divided into several shorter segments. When calculating the sentence semantic similarity between two sentences, ...

متن کامل

Automatic Analysis of Semantic Coherence in Academic Abstracts Written in Portuguese

SciPo is a system whose ultimate goal is to support novice writers in producing academic texts in Brazilian Portuguese through presentation of critiques and suggestions. Currently, it focuses on the rhetorical structure of texts, being capable of automatically detecting and criticizing the rhetorical structure of Abstract sections. We describe a system that enhances SciPo’s functionality by eva...

متن کامل

Impersonal Russian Sentences with the Subject in the Accusative Case and the Meaning of a Person\'s Physical Condition in the Terms of Persian Language

In this article, considering impersonal sentences with the subject in the accusative case, which conveys the physical state of a living being, an attempt is made to compare them with the Persian correlates. This type of impersonal sentences can cause different problems for the Persian-speaking students due to their grammatical specificity (e.g. the uses of the subject in the accusative, rather ...

متن کامل

A Random, Semantically Appropriate Sentence Generator for Speaker Verification

We describe two systems for automatically generating English sentences, and evaluate the suitability of their output for speaker verification. The first system, SUSGen, generates grammatical but semantically anomalous sentences of controlled length, vocabulary and phonetic content. The second system, SASGen, extends SUSGen to generate a greater variety of sentences and ones that are, for the mo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017