Web Based Parallel/Distributed Medical Data Mining Using Software Agents
نویسندگان
چکیده
Using Software Agents Hillol Kargupta, Brian Sta ord, and Ilker Hamzaoglu Computational Science Methods Group X Division, Los Alamos National Laboratory P.O. Box 1663, MS F645, Los Alamos, NM, 87545 This paper describes an experimental parallel/distributed data mining system PADMA (PArallel Data Mining Agents) that uses software agents for local data accessing and analysis and a web based interface for interactive data visualization. It also presents the results of applying PADMA for detecting patterns in unstructured texts of postmortem reports and laboratory test data for Hepatitis C patients. Introduction Data mining involves extraction, transformation, and presentation of data in useful form. As we move more and more toward a paper-less society, each of these components of data mining is likely to face the challenges of dealing with large volume of data and the very distributed nature of the data storage and computing environments. Medical databases are often ideal candidates for large scale, possibly distributed data mining applications. In this paper we describe an experimental software agent based system for parallel/distributed data miningPADMA (PArallel Data Mining Agents). PADMA is characterized by agent based distributed data accessing, distributed data analysis, and web based interactive data visualization. This paper also presents results of applying PADMA in medical databases. Section presents a general overview of the PADMA system. The parallel relational database accessing operations of PADMA agents are described in Section . Section describes the data analysis capabilities of the agents. Section describes the web-based Disk Disk DM DM
منابع مشابه
Privacy-preserving agent-based distributed data clustering
A growing number of applications in distributed environment involve very large data sets that are inherently distributed among a large number of autonomous sources over a network. The demand to extend data mining technology to such distributed data sets has motivated the development of several approaches to distributed data mining and knowledge discovery, of which only a few make use of agents....
متن کاملA Distributed Framework for NLP-Based Keyword and Keyphrase Extraction From Web Pages and Documents
The recent growth of the World Wide Web at increasing rate and speed and the number of online available resources populating Internet represent a massive source of knowledge for various research and business interests. Such knowledge is, for the most part, embedded in the textual content of web pages and documents, which is largely represented as unstructured natural language formats. In order ...
متن کاملScalable, Distributed Data Mining - An Agent Architecture
Scalability determines the potential in distributing h&h rlata anrl rnmnrlt,af.inn in cln.+n. mining. “.,“.I . . ..Avw -_..a “.,--I..v-“e-^--2-----o. The PADMA (PArallel Data Mining Agents) architecture will be described, along with experiments on text to address scalability. PADMA agents offer parallel data access, and hierarchical clustering, with results visualized through a JAVA web-interface.
متن کاملMapReduce K-Means based Co-Clustering Approach for Web Page Recommendation System
Co-clustering is one of the data mining techniques used for web usage mining. Co-clustering Web log data is the process of simultaneous categorization of both users and pages. It is used to extract the users’ information based on subset of pages. Nowadays, the cyberspace is filled with huge volume of data distributed across the world. The business knowledge acquaintance from such a voluminous d...
متن کاملThe Adaptability of Conventional Data Mining Algorithms through Intelligent Mobile Agents in Modern Distributed Systems
Intelligent mobile agents are today accepted as powerful tools for data mining in a distributed environment. The use of data mining algorithms further beefs up the intelligence in software agents. Knowledge discovery and data mining algorithms are applied to discover hidden patterns and relations in complex datasets using intelligent agents. The distributed computing provides remote communicati...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1997