University of Southern California

Title: Competitive Collaborative Learning

Abstract:

Systems in which users must make a sequence of decisions guided by information from potentially dishonest peers are ubiquitous in contemporary networked applications. (Examples include reputation and recommendation systems in e-commerce, and resource location systems in peer-to-peer networks.) We will discuss a multi-user learning problem which abstracts the sequential decision problems faced by users of such systems when selecting products or resources. Our model is characterized by three key features: (a) The quality of the products or resources may vary over time; (b) Some of the users in the system may be malicious, Byzantine agents; (c) The honest users are willing to obey a specified protocol when making and reporting their selections. We formulate the problem as a multi-user version of the multi-armed bandit problem from online learning theory. After reviewing the classical multi-armed bandit problem, we will exhibit an algorithm for the multi-user version which guarantees that the average quality of the honest users' decisions converges, in polylogarithmic time, to the average quality of the best fixed decision. Relations with other existing algorithms for reputation management and collaborative filtering will be discussed. This is joint work with Baruch Awerbuch.

Bio:

Robert D. Kleinberg is Assistant Professor in the Computer Science Department at Cornell University. Supported by an NSF Mathematical Sciences Postdoctoral Research Fellowship, currently he is a post-doc in the Computer Science Department at UC, Berkeley. He received his PhD from the Mathematics Department at MIT. He has also worked for Akamai Technologies in the past.