Mining Chat-room Conversations for Social and Semantic Interactions
نویسندگان
چکیده
The ephemeral nature of human communication via networks today poses interesting and challenging problems for information technologists. The Intelink intelligence network, for example, has a need to monitor chat-room conversations to ensure the integrity of sensitive data being transmitted via the network. However, the sheer volume of communication in venues such as email, newsgroups, and chat precludes manual techniques of information management. It has been estimated that over 430 million instant messages, for example, are exchanged each day on the America Online network [3]. Although a not insignificant fraction of such data may be temporarily archived (e.g., newsgroups), no systematic mechanisms exist for accumulating these artifacts of communication in a form that lends itself to the construction of models of semantics [12]. In essence, dynamic techniques of analysis are needed if textual data of this nature is to be effectively mined. This article reports our progress in developing a text mining tool for analysis of chat-room conversations. Central to our efforts is the development of functionality to answer questions such as "What topics are being discussed in a chat-room?", "Who is discussing which topics?" and "Who is interacting with whom?" The objective of our research is to develop technology that can automatically identify such patterns of interaction in both social and semantic terms. In this article we report our preliminary findings in identifying threads of conversation in multi-topic, multi-person chat-rooms. We have achieved promising results in terms of precision and recall by employing pattern recognition techniques based on finite state automata. We also report the design of our approach to building models of social and semantic interactions based on our HDDI text mining infrastructure [13].
منابع مشابه
Error-Driven Boolean-Logic-Rule-Based Learning for Mining Chat-room Conversations
The ephemeral nature of human communication via networks today poses interesting and challenging problems for information technologists. The sheer volume of communication in venues such as email, newsgroups, and chat precludes manual techniques of information management. Currently, no systematic mechanisms exist for accumulating these artifacts of communication in a form that lends itself to th...
متن کاملChat mining: Automatically determination of chat conversations' topic in Turkish text based chat mediums
Mostly, the conversations taking place in chat mediums bear important information concerning the speakers. This information can vary in many fields such as tendencies, habits, attitudes, guilt situations, and intentions of the speakers. Therefore, analysis and processing of these conversations are of much importance. Many social and semantic inferences can be made from these conversations. In d...
متن کاملUtterances Assessment in Chat Conversations
With the continuous evolution of collaborative environments, the needs of automatic analyses and assessment of participants in instant messenger conferences (chat) have become essential. For these aims, on one hand, a series of factors based on natural language processing (including lexical analysis and Latent Semantic Analysis) and data-mining have been taken into consideration. On the other h...
متن کاملText Categorization Approach For Chat Room Monitoring
The Internet has been utilized in several real life aspects such as online searching, and chatting. On the other hand, the Internet has been misused in communication of crime related matters. Monitoring of such communication would aid in crime detection or even crime prevention. This paper presents a text categorization approach for automatic monitoring of chat conversations since the current m...
متن کاملMPC: A Multi-Party Chat Corpus for Modeling Social Phenomena in Discourse
In this paper, we describe our experience with collecting and creating an annotated corpus of multi-party online conversations in a chat-room environment. This effort is part of a larger project to develop computational models of social phenomena such as agenda control, influence, and leadership in on-line interactions. Such models will help capturing the dialogue dynamics that are essential fo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002