All Blogs Are Not Made Equal: Exploring Genre Differences in Sentiment Tagging of Blogs
نویسندگان
چکیده
One of the essential characteristics of blogs is their subjectivity, which makes blogs a particularly interesting domain for research on automatic sentiment determination. In this paper, we explore the properties of two most common subgenres of blogs – personal diaries and “notebooks” – and the effects that these properties have on performance of an automatic sentiment annotation system, which we developed for binary (positive vs. negative) and ternary (positive vs. negative vs. neutral) classification of sentiment at the sentence level. We also investigate the differential effect of inclusion of negations and other valence shifters on the performance of our system on these two subgenres of blogs.
منابع مشابه
Exploring the Use of Linguistic Features in Sentiment Analysis
In this paper we describe some explorations of the potential of genre-revealing features on automatic sentiment analysis. In particular, we use a small subset of the ‘linguistic facets’ employed in recent experiments on automatic genre identification in combination with more traditional sentiment-revealing features on two different single-genre corpora: a corpus of English blogs and a corpus of...
متن کاملBlogvox2: A Modular Domain Independent Sentiment Analysis System
Title of Thesis: Blogvox2: A Modular Domain Independent Sentiment Analysis System. Sandeep Balijepalli, Masters of Science, 2007 Thesis directed by: Dr. Tim Finin, Professor Department of Computer Science and Electrical Engineering Bloggers make a huge impact on society by representing and influencing the people. Blogging by nature is about expressing and listening to opinion. Good sentiment de...
متن کاملScholarly blogging practice as situated genre: an analytical framework based on genre theory
Introduction. Examines how an analytical framework of situated genre analysis can be used to study how research blogs are constructed and used as tools in scholarly communication. Method. A framework was extracted from genre research theories consisting of four concepts: aim, form, content and context. The term situated genre was used to focus on social practices. The context was further elabor...
متن کاملBlogHarvest: Blog Mining and Search Framework
Beyond serving as online diaries, weblogs have evolved into complex social structures. Blogging software allows users to publish opinions on any topic without any constraints on the predefined schema. Analysis of linkage between blogs has indicated that community forming in blogosphere is not a random process but is a result of shared interests binding bloggers together. Learning, analysis and ...
متن کاملCoreference Resolution on Blogs and Commented News
We focus on automatic coreference resolution for blogs and news articles with user comments as part of a project on opinion mining. We aim to study the effect of the genre shift from edited structured newspaper text to unedited, unstructured blog data. We compare our coreference resolution system on three data sets: newspaper articles, mixed newspaper articles and reader comments, and blog data...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007