A Hybrid TF-IDF and RNN Model for Multi-label Classification of the Deep and Dark Web
نویسندگان
چکیده
The classification of content on the deep and dark web has been a topic interest for researchers. Researchers focus adopting more efficient effective methods as data available platforms continues to grow. Multi-label is approach simultaneously categorizing into multiple classes. To address this, hybrid combining Term Frequency-Inverse Document Frequency (TF-IDF) Recurrent Neural Network (RNN) proposed. involves preprocessing dataset Hypertext Markup Language (HTML) documents, selecting specific HTML tags generate embeddings using TF-IDF, an RNN model multi-label classification. proposed was evaluated against commonly used (Binary Relevance, Classifier Chains, Label Powerset) precision, recall, F1-score evaluation metrics, demonstrating promising results in accurately classifying from web. This contribution represents noteworthy advancement researchers analysts working this field.
منابع مشابه
the innovation of a statistical model to estimate dependable rainfall (dr) and develop it for determination and classification of drought and wet years of iran
آب حاصل از بارش منبع تأمین نیازهای بی شمار جانداران به ویژه انسان است و هرگونه کاهش در کم و کیف آن مستقیماً حیات موجودات زنده را تحت تأثیر منفی قرار می دهد. نوسان سال به سال بارش از ویژگی های اساسی و بسیار مهم بارش های سالانه ایران محسوب می شود که آثار زیان بار آن در تمام عرصه های اقتصادی، اجتماعی و حتی سیاسی- امنیتی به نحوی منعکس می شود. چون میزان آب ناشی از بارش یکی از مولفه های اصلی برنامه ...
15 صفحه اولthe investigation of the relationship between type a and type b personalities and quality of translation
چکیده ندارد.
A comparative study of TF*IDF, LSI and multi-words for text classification
One of the main themes in text mining is text representation, which is fundamental and indispensable for text-based intellegent information processing. Generally, text representation inludes two tasks: indexing and weighting. This paper has comparatively studied TF IDF, LSI and multi-word for text representation. We used a Chinese and an English document collection to respectively evaluate the ...
متن کاملinvestigating the feasibility of a proposed model for geometric design of deployable arch structures
deployable scissor type structures are composed of the so-called scissor-like elements (sles), which are connected to each other at an intermediate point through a pivotal connection and allow them to be folded into a compact bundle for storage or transport. several sles are connected to each other in order to form units with regular polygonal plan views. the sides and radii of the polygons are...
assessment of deep word knowledge in elementary and advanced iranian efl learners: a comparison of selective and productive wat tasks
testing plays a vital role in any language teaching program. it allows teachers and stakeholders, including program administrators, parents, admissions officers and prospective employers to be assured that the learners are progressing according to an accepted standard (douglas, 2010). the problems currently facing language testers have both practical and theoretical implications but the first i...
ذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Advanced Computer Science and Applications
سال: 2023
ISSN: ['2158-107X', '2156-5570']
DOI: https://doi.org/10.14569/ijacsa.2023.01407106