Overview of the International Sexual Predator Identification Competition at PAN-2012

نویسندگان

  • Giacomo Inches
  • Fabio Crestani
چکیده

This contribution presents the evaluation methodology for the identification of potential “sexual predators” in online conversations as part of PAN 2012. We provide details of the realized collection and analyse the submissions of the participants, who had to solve two problems: identify the predators among all the users in the different conversations and identify the part (the lines) of the predator conversations which are the most distinctive of the predator bad behaviour. The methods proposed by the 16 teams participating in the contest made possible the recognition of common pattern for predator identification (e.g. no preprocessing of the conversations, lexical and behavioral analysis, blacklisting of predator terms) as well as possible extension to existing systems (e.g. victimpredator distinction, pre-filtering of not relevant conversations).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sexual predator detection in chats with chained classifiers

This paper describes a novel approach for sexual predator detection in chat conversations based on sequences of classifiers. The proposed approach divides documents into three parts, which, we hypothesize, correspond to the different stages that a predator employs when approaching a child. Local classifiers are trained for each part of the documents and their outputs are combined by a chain str...

متن کامل

Information Retrieval and Classification based Approaches for the Sexual Predator Identification

In this paper we present the evaluation of two different approaches with the aim of tackling the task of Sexual Predator Identification of PAN 2012. The first approach uses a dictionary of sexual terms in order to identify those documents associated in some manner with a sexual predator behavior. In order to do so, we use the sexual terms of the dictionary as a query in an information retrieval...

متن کامل

DYNAMIC COMPLEXITY OF A THREE SPECIES COMPETITIVE FOOD CHAIN MODEL WITH INTER AND INTRA SPECIFIC COMPETITIONS

The present article deals with the inter specific competition and intra-specific competition among predator populations of a prey-dependent three component food chain model consisting of two competitive predator sharing one prey species as their food. The behaviour of the system near the biologically feasible equilibria is thoroughly analyzed. Boundedness and dissipativeness of the system are e...

متن کامل

Quite Simple Approaches for Authorship Attribution, Intrinsic Plagiarism Detection and Sexual Predator Identification Notebook for PAN at CLEF

Tasks such as Authorship Attribution, Intrinsic Plagiarism detection and Sexual Predator Identification are representative of attempts to deceive. In the first two, authors try to convince others that the presented work is theirs, and in the third there is an attempt to convince readers to take actions based on false beliefs or ill-perceived risks. In this paper, we discuss our approaches to th...

متن کامل

Overview of the International Authorship Identification Competition at PAN-2011

This paper gives an overview of the evaluation methodology applied to authorship identification solutions as part of PAN 2011. The two variations of authorship identification that were explored were authorship attribution, determining which of a known set of authors wrote a text, and authorship verification, determining if a specific authors did or did not write a text. We summarize the methods...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012