Evaluation Metrics Trade-off for Recommendation Systems

ثبت نشده
چکیده

Following the famous Netflix Prize competition, accuracy metrics (such as RMSE) became a de facto standard in the community to evaluate and compare performance of recommendation algorithms. However, it has been shown, that accuracy is not the best way to quantify the quality of recommendations, especially in the context of predicting online performance from offline evaluations [1]. Thus, it makes sense to look at other metrics, such as novelty, coverage, or serendipity, and in particular, trade-off between these metrics [2]. In this project, a student will work with large news recommendation datasets, collected online on swissinfo.ch and lepoint.fr. The goal would be to implement a number of recommender algorithms with tunable hyperparameters, and generate trade-offs between different evaluation metrics by varying these hyperparametrs. A student will then analyse these trade-offs by plotting trade-off curves, comparing them for various algorithms, examining the area under the curves, and possibly looking at their impact on CTR and online success rate.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A multi-objective resource-constrained optimization of time-cost trade-off problems in scheduling project

This paper presents a multi-objective resource-constrained project scheduling problem with positive and negative cash flows. The net present value (NPV) maximization and making span minimization are this study objectives. And since this problem is considered as complex optimization in NP-Hard context, we present a mathematical model for the given problem and solve three evolutionary algorithms;...

متن کامل

Optimizing the Recency-Relevancy Trade-off in Online News Recommendations

Online news media sites are emerging as the primary source of news for a large number of users. The selection of ‘frontpage’ stories on these media sites usually takes into consideration several crowdsourced popularity metrics, such as number of views or shares by the readers. In this work, we focus on automatically recommending front-page stories in such media websites. When recommending news ...

متن کامل

Evaluating the effectiveness of explanations for recommender systems Methodological issues and empirical studies on the impact of personalization

When recommender systems present items, these can be accompanied by explanatory information. Such explanations can serve seven aims: effectiveness, satisfaction, transparency, scrutability, trust, persuasiveness, and efficiency. These aims can be incompatible, so any evaluation needs to state which aim is being investigated and use appropriate metrics. This paper focuses particularly on effecti...

متن کامل

Assessing the Trade-Off between System Building Cost and Output Quality in Data-to-Text Generation

Data-to-text generation systems tend to be knowledge-based and manually built, which limits their reusability and makes them time and cost-intensive to create and maintain. Methods for automating (part of) the system building process exist, but do such methods risk a loss in output quality? In this paper, we investigate the cost/quality trade-off in generation system building. We compare six da...

متن کامل

Dimensions and Metrics for Evaluating Recommendation Systems

recommendation systems support users and developers of various computer and software systems to overcome information overload, perform information discovery tasks and approximate computation, among others. They have recently become popular and have attracted a wide variety of application scenarios from business process modelling to source code manipulation. Due to this wide variety of applicati...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015