Late Data Fusion for Microblog Search
نویسندگان
چکیده
The character of microblog environments raises challenges for microblog search because relevancy becomes one of the many aspects for ranking documents. We concentrate on merging multiple ranking strategies at postretrieval time for the TREC Microblog task. We compare several state-of-the-art late data fusion methods, and present a new semi-supervised variant that accounts for microblog characteristics. Our experiments show the utility of late data fusion in microblog search, and that our method helps boost retrieval effectiveness.
منابع مشابه
Burst-aware data fusion for microblog search
We consider the problem of searching posts in microblog environments. We frame this microblog post search problem as a late data fusion problem. Previous work on data fusion has mainly focused on aggregating document lists based on retrieval status values or ranks of documents without fully utilizing temporal features of the set of documents being fused. Additionally, previous work on data fusi...
متن کاملThe Impact of Semantic Document Expansion on Cluster-Based Fusion for Microblog Search
Searching microblog posts, with their limited length and creative language usage, is challenging. We frame the microblog search problem as a data fusion problem. We examine the effectiveness of a recent cluster-based fusion method on the task of retrieving microblog posts. We find that in the optimal setting the contribution of the clustering information is very limited, which we hypothesize to...
متن کاملLearning to Rank Microblog Posts for Real-Time Ad-Hoc Search
Microblogging websites have emerged to the center of information production and diffusion, on which people can get useful information from other users’ microblog posts. In the era of Big Data, we are overwhelmed by the large amount of microblog posts. To make good use of these informative data, an effective search tool is required specialized for microblog posts. However, it is not trivial to d...
متن کاملIncorporating Query Expansion and Quality Indicators in Searching Microblog Posts
We propose a retrieval model for searching microblog posts for a given topic of interest. We develop a language modeling approach tailored to microblogging characteristics, where redundancy-based IR methods cannot be used in a straightforward manner. We enhance this model with two groups of quality indicators: textual and microblog specific. Additionally, we propose a dynamic query expansion mo...
متن کاملUsing Sociological Needs to Characterize Profiles and Contents for Microblog Search
In this work we investigate the issue of modeling users’ sociological needs. We introduce a sociological model approach for Microblog Search based sociological needs Categorization and Opinion Mining from textual content to explain why Vodkaster’s 100,000 users express their opinions and how it can be used for Microblog Search about Festival.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013