Bayesian Torrent Classification by File Name and Size Only
نویسندگان
چکیده
Torrent traffic, much of which is assumed to be illegal downloads of copyrighted content, accounts for up to 35% of internet downloads. Yet, the process of classification and identification of these downloads is unclear, and original data for such studies is often unavailable. Many torrent items lack supporting description or meta-data, in which case only file name and size are available. We describe a novel Bayesian network based classifier system that predicts medium category, pornographic content and risk of fakes and malware based on torrent name and size, optionally supplemented with external databases of titles and actors. We show that our method outperforms a commercial benchmark system and has the potential to rival human classifiers.
منابع مشابه
Solving Fuzzy Partial Differential Equation by Differential Transformation Method
Normal 0 false false false ...
متن کاملP2P Network Trust Management Survey
Peer-to-peer applications (P2P) are no longer limited to home users, and start being accepted in academic and corporate environments. While file sharing and instant messaging applications are the most traditional examples, they are no longer the only ones benefiting from the potential advantages of P2P networks. For example, network file storage, data transmission, distributed computing, and co...
متن کاملThe Deficit-based Distribution Algorithm in Bit Torrent Swarms for Peer-to-Peer File Sharing Allocation
Peer-to-peer file-sharing claims suffer from a fundamental problem of unfairness. Free-riders cause slower download times for others by contributing little or no upload bandwidth while consuming much download bandwidth. Previous attempts to address this fair bandwidth allocation problem suffer from slow peer discovery, inaccurate predictions of neighboring peers’ bandwidth allocations, underuti...
متن کاملEstimation of genetic parameters of litter size in Moghani sheep using threshold model via Bayesian approach
This study was conducted to estimate the genetic parameters of litter size (LS) in Moghani sheep using threshold model via Bayesian approach. The data originated from the Jafar-Abad Station of Ardabil province, Iran, and included 9698 lactation records of 4977 ewes with lambings from 1995 until 2010. The pedigree file consisted of data on animals born from 1987 to 2010. The significance of fixe...
متن کاملUsing Fuzzy LR Numbers in Bayesian Text Classifier for Classifying Persian Text Documents
Text Classification is an important research field in information retrieval and text mining. The main task in text classification is to assign text documents in predefined categories based on documents’ contents and labeled-training samples. Since word detection is a difficult and time consuming task in Persian language, Bayesian text classifier is an appropriate approach to deal with different...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016