Estimating Labels from Label Proportions (2008)

Authors

Abstract

Consider the following problem: given several sets of unlabeled observations, each set with known label proportions, predict the labels of another set of observations, also with known label proportions. This problem appears in areas like e-commerce, spam filtering, and improper content detection. We present consistent estimators which can reconstruct the correct labels with high probability in a uniform convergence sense. Experiments show that our method works well in practice.

Discussion

John Langford, 2008/07/09 01:15

The motivating application of giving coupons or not to people can be phrased as an offline binary contextual bandit problem. Here, the “binary” means that a policy has two actions available: give coupon or not. “offline” is because this data has been recorded, so the explore/exploit tradeoff isn't there and the goal is simply getting a good policy.

Enter your comment (wiki syntax is allowed):
BHEXI
 
paper/2008/337.txt · Last modified: 2009/05/24 18:48 (external edit)
 
Driven by DokuWiki