Revisiting Negative Sampling vs. Non-sampling in Implicit Recommendation

نویسندگان

چکیده

Recommendation systems play an important role in alleviating the information overload issue. Generally, a recommendation model is trained to discern between positive (liked) and negative (disliked) instances for each user. However, under open-world assumption, there are only but no from users’ implicit feedback, which poses imbalanced learning challenge of lacking samples. To address this, two types strategies have been proposed before, sampling strategy non-sampling strategy. The first samples missing data (i.e., unlabeled data), while regards all as negative. Although known be essential algorithm performance, in-depth comparison has not sufficiently explored by far. bridge this gap, we systematically analyze work. Specifically, theoretically revisit objection non-sampling. Then, with careful setup various representative methods, explore performance different scenarios. Our results empirically show that although widely applied recent models, it non-trivial uniform methods comparable methods. Finally, discuss scalability complexity present some open problems future research topics worth being further explored.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Revisiting Skip-Gram Negative Sampling Model with Regularization

We revisit skip-gram negative sampling (SGNS), a popular neural-network based approach to learning distributed word representation. We first point out the ambiguity issue undermining the SGNSmodel, in the sense that the word vectors can be entirely distorted without changing the objective value. To resolve this issue, we rectify the SGNSmodel with quadratic regularization. A theoretical justifi...

متن کامل

Warped Frames: Dispersive vs. Non–dispersive Sampling

Conventional Time-Frequency and Time-Scale Representations are often too rigid to capture fine details of sound or musical signals. Adaptation of ideal time-frequency tilings is often desirable in order to represent the signal in terms of components that are meaningful from a physical or perceptual point of view. Remapping of the time and frequency axes by means of time and frequency warping ca...

متن کامل

Machine Learning and Non-Negative Compressive Sampling

The new emerging theory of compressive sampling demonstrates that by exploiting the structure of a signal, it is possible to sample a signal below the Nyquist rate—using random projections—and achieve perfect reconstruction. In this paper, we consider a special case of compressive sampling where the uncompressed signal is non-negative, and propose a number of sparse recovery algorithms—which ut...

متن کامل

Negative dependence in sampling

The strong Rayleigh property is a new and robust negative dependence property that implies negative association; in fact it implies conditional negative association closed under external fields (CNA+). Suppose thatX1, . . . , XN and Y1, . . . , YN are two families of 0-1 random variables that satisfy the strong Rayleigh property and let Zi := Xi + Yi. We show that {Zi} conditioned on ∑ iXiYi = ...

متن کامل

Bipartite Graph Sampling Methods for Sampling Recommendation Data

Sampling is the common practice involved in academic and industry efforts on recommendation algorithm evaluation and selection. Experimental analysis often uses a subset of the entire useritem interaction data available in the operational recommender system, often derived by including all transactions associated with a subset of uniformly randomly selected users. Our paper formally studies the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: ACM Transactions on Information Systems

سال: 2023

ISSN: ['1558-1152', '1558-2868', '1046-8188', '0734-2047']

DOI: https://doi.org/10.1145/3522672