نتایج جستجو برای: term frequency and inverse document frequency tf idf

تعداد نتایج: 16977020  

Journal: :Device: Jurnal Ilmiah Komputer dan Teknologi 2023

Cyberbullying is the act of sending text, images, or videos using internet, mobile phones, other devices with aim hurting and shaming people. often done through several social media platforms, one which comments on TikTok application. According to a report by We Are Social, has 1.4 billion monthly active users aged 18 above globally. Indonesia currently ranks second in world terms users. As res...

2013
P. Parthasarathi

Document clustering is the act of collecting similar documents into clusters, where similarity is some function on a document. Document clustering method achieves 1) a high accuracy for documents 2) document frequency can be calculated 3) term weight is calculated with the term frequency vector. Document clustering is closely related to the concept of data clustering. Document clustering is a m...

Besides for its own merits, text classification (TC) has become a cornerstone in many applications. Work presented here is part of and a pre-requisite for a project we have overtaken to create a corpus for the Arabic text process. It is an attempt to create modules automatically that would help speed up the process of classification for any text categorization task. It also serves as a tool for...

Journal: :Journal of Information Technology 2022

Intisari—Dengan kemajuan teknologi saat ini seluruh informasi tentang semua film sudah tersedia di Internet. Jika dikelola dengan baik maka dapat memberikan manfaat berupa yang berguna untuk membantu individu atau organisasi mengambil keputusan. Penelitian bertujuan menjelaskan analisis sentimen pada dokumen film. Metode digunakan penelitian adalah TF-IDF (Term Frequency-Inverse Document Freque...

2015
Mohsen Kakavand Norwati Mustapha Aida Mustapha MohdTaufik Abdullah Mohd Taufik Abdullah

Anomaly detection systems are extensively used security tools to detect cyber-threats and attack activities in computer systems and networks. In this paper, we present Text Mining-Based Anomaly Detection (TMAD) model. We discuss n-gram text categorization and focus our attention on a main contribution of method TF-IDF (Term frequency, inverse document frequency), which enhance the performance c...

Journal: :Daehanhanuihakoeji 2022

Objectives: In the health care industry, influence of online reviews is growing. As medical services are provided mainly by providers, those have been managed hospitals and clinics. However, direct promotions providers legally forbidden. Due to this reason, consumers, like patients clients, search a lot on Internet get any information about hospitals, treatments, prices, etc. It can be determin...

Journal: :Applied Soft Computing 2023

Spam emails are unsolicited, annoying and sometimes harmful messages which may contain malware, phishing or hoaxes. Unlike most studies that address the design of efficient anti-spam filters, we approach spam email problem from a different novel perspective. Focusing on needs cybersecurity units, follow topic-based for addressing classification into multiple categories. We propose SPEMC-15K-E S...

Journal: :Jurnal sistem dan teknologi informasi 2023

Clickbait merupakan judul berita yang bombastis dan memberikan informasi tidak utuh sehingga membuat pembaca penasaran ingin tahu dengan cara mengklik tautan berita. Penggunaan clickbait terkadang bersifat menjebak karena dari artikel tersebut utuh. Hal menyebabkan kesimpulan didapat isi sesuai. Sehingga perlu dilakukan penelitian untuk mengklasifikasi termasuk atau bukan. Penelitian ini menggu...

2012
Bing-Han Tsai Yu-Zheng Liu Wen-Juan Hou

The paper presents the experiments carried out as part of the participation in the pilot task of Biomedical about Alzheimer for QA4MRE at CLEF 2012. We have submitted total five unique runs in the pilot task. One run uses Term Frequency (TF) of the query words to weight the sentence. Two runs use Term Frequency-Inverted Document Frequency (TF-IDF) of the query words to weight the sentences. The...

2014
Aymen Abu-errub R. Guzmán-Cabrera M. Montes-y-Gómez P. Rosso A. H. Wahbeh T. Zaki D. Mammass A. Ennaji

Text categorization is the process of classifying documents into a predefined set of categories based on its contents of keywords. Text classification is an extended type of text categorization where the text is further categorized into sub-categories. Many algorithms have been proposed and implemented to solve the problem of English text categorization and classification. However, few studies ...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید