A Multi-Input Machine Learning Approach to Classifying Sex Trafficking from Online Escort Advertisements

نویسندگان

چکیده

Sex trafficking victims are often advertised through online escort sites. These ads can be publicly accessed, but law enforcement lacks the resources to comb hundreds of identify those that may feature sex-trafficked individuals. The purpose this study was implement and test multi-input, deep learning (DL) binary classification models predict probability an ad being associated with sex (ST) activity aid in detection investigation ST. Data from 12,350 scraped classified were split into training sets (80% 20%, respectively). Multi-input included recurrent neural networks (RNN) for text classification, convolutional (CNN, specifically EfficientNetB6 or ENET) image/emoji (NN) trained used classify 20% set. best-performing DL model imagery inputs, resulting accuracy 0.82 F1 score 0.70. More importantly, best classifier (RNN + correctly identified 14 sites had estimates 0.845 greater (1.0 precision); precision 96% multi-input (NN RNN when only highest positive probabilities (>0.90) considered (n = 202 ads). developed could productionalized piloted criminal investigators, as they potentially increase their efficiency identifying potential ST victims.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Classifying Non-Sentential Utterances in Dialogue: A Machine Learning Approach

Citing this paper Please note that where the full-text provided on King's Research Portal is the Author Accepted Manuscript or Post-Print version this may differ from the final Published version. If citing, it is advised that you check and use the publisher's definitive version for pagination, volume/issue, and date of publication details. And where the final published version is provided on th...

متن کامل

A Machine Learning Approach for Classifying Textual Data in Crowdsourcing

Crowdsourcing represents an innovative approach that allows companies to engage a diverse network of people over the internet and use their collective creativity, expertise, or workforce for completing tasks that have previously been performed by dedicated employees or contractors. However, the process of reviewing and filtering the large amount of solutions, ideas, or feedback submitted by a c...

متن کامل

from linguistics to literature: a linguistic approach to the study of linguistic deviations in the turkish divan of shahriar

chapter i provides an overview of structural linguistics and touches upon the saussurean dichotomies with the final goal of exploring their relevance to the stylistic studies of literature. to provide evidence for the singificance of the study, chapter ii deals with the controversial issue of linguistics and literature, and presents opposing views which, at the same time, have been central to t...

15 صفحه اول

A Multi-Agent Learning Approach to Online Distributed Resource Allocation

Resource allocation in computing clusters is traditionally centralized, which limits the cluster scale. Effective resource allocation in a network of computing clusters may enable building larger computing infrastructures. We consider this problem as a novel application for multiagent learning (MAL). We propose a MAL algorithm and apply it for optimizing online resource allocation in cluster ne...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Machine learning and knowledge extraction

سال: 2023

ISSN: ['2504-4990']

DOI: https://doi.org/10.3390/make5020028