Subject-based semantic document clustering for digital forensic investigations
نویسندگان
چکیده
Computers are increasingly used as tools to commit crimes such as unauthorized access (hacking), drug trafficking, and child pornography. The proliferation of crimes involving computers has created a demand for special forensic tools that allow investigators to look for evidence on a suspect’s computer by analyzing communications and data on the computer’s storage devices. Motivated by the forensic process at Sûreté du Québec (SQ), the Québec provincial police, we propose a new subject-based semantic document clustering model that allows an investigator to cluster documents stored on a suspect’s computer by grouping them into a set of overlapping clusters, each corresponding to a subject of interest initially defined by the investigator.
منابع مشابه
A Joint Semantic Vector Representation Model for Text Clustering and Classification
Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, their shortcoming in capturing semantic concepts of text motivated researches to use...
متن کاملSubject based Clustering for Digital Forensic Investigation with Subject Suggestion
Recently digital forensics has become a prominent activity in crime investigation since computers are increasingly used as tools to commit crimes. During forensic investigation the digital devices such as desktops, notebooks, smart phones etc. found at the crime scene are collected for further investigation. Investigators have to go through humongous amount of data stored on these devices to ga...
متن کاملOptimum Cluster Labeling and Document Clustering for Forensic Analysis
Document clustering or unsupervised document classification is an automated process of grouping documents with similar content. Document clustering is an important task in many Information Retrieval systems. Also document clustering Algorithms can help in discovery of new and useful knowledge or novel class from the documents under analysis. This knowledge or novel class is very important issue...
متن کاملPractical and Legal Challenges of Cloud Investigations
An area presenting new opportunities for both legitimate business, as well as criminal organizations, is Cloud computing. This work gives a strong background in current digital forensic science, as well as a basic understanding of the goal of Law Enforcement when conducting digital forensic investigations. These concepts are then applied to digital forensic investigation of cloud environments i...
متن کاملImplementation of Digital Forensics Investigations Using a Goal-Driven Approach for a Questioned Contract
This paper introduces a new systematic process for describing digital investigations that focuses on forensic goals and anti-forensic obstacles and their operationalisation in terms of human and software actions. The main contribution of the paper is to demonstrate how this process can be used to capture the various forensic and anti-forensic aspects of a real world case study involving documen...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Data Knowl. Eng.
دوره 86 شماره
صفحات -
تاریخ انتشار 2013