Skip to Main Content
 

Global Search Box

 
 
 
 

ETD Abstract Container

Abstract Header

Active Learning for Ranking from Noisy Observations

Abstract Details

2021, Doctor of Philosophy, Ohio State University, Computer Science and Engineering.
Ranking is a fundamental problem that has been extensively studied in the machine learning community due to its broad applicability to different application areas such as recommendation, web searching, social choices, and crowd sourcing. The problems of ranking focus on finding the full ranking or partial rankings (e.g., top-$k$ selection) of a set of items from noisy observations. The items may refer to choices, movies, pages, and advertisements and the observations may refer to the queries about the users' preferences on these items. In many systems, the observations can be done in an active (or adaptive) manner, i.e., for each observation, the learner can adaptively choose items to observe according to past observations. An interesting problem is to understand the number of observations needed (aka sample complexity) for finding the ranking under the active setting. In this dissertation, we present new algorithms and derive new sample complexity upper and lower bounds that improve the state-of-the-art for three significant and fundamental problems in this topic. This dissertation explores the following problems: I. Exploring top-fraction arms in stochastic bandits. The stochastic multi-armed bandit (MAB) model is a classical online learning model for studying sequential decision making under uncertainty. In an MAB model, there are multiple arms (each refers to an action or an item), and each sample (aka pull) of an arm returns an independent reward according to this arm's unknown latent distribution. The first problem is to find $k$ distinct arms that are approximately within the top-$\rho$ fraction with respect to the mean rewards of the arms with error tolerance $\epsilon$ and confidence $1-\delta$. We develop algorithms and analyze the sample complexity lower and upper bounds for four variations (the number of arms is infinite or finite; the threshold of the top-$\rho$ fraction is known or unknown). The upper bounds and lower bounds of all four variations are optimal up to at most logarithmic factors. II. Exact full ranking from noisy comparisons. The problem we investigate is to find the exact full ranking of a set of items from noisy comparisons with confidence $1-\delta$, where a comparison is a process over two (pairwise comparisons) or multiple (multi-wise comparisons) items that outputs a noisy result about the (most) preferred one. We only make three standard or mild assumptions: (i) comparisons are independent across time and items; (ii) there is a strict order of these items; and (iii) the items satisfy the weak stochastic transitivity. We prove a nearly optimal sample complexity lower bound for this problem, and develop an algorithm whose sample complexity upper bound matches the lower bound (up to a constant factor) under mild conditions. III. Top-$k$ items selection from pairwise comparisons. In this problem, we want to study how to find the top-$k$ items from noisy pairwise comparisons under the strong stochastic transitivity and the stochastic triangle inequality. We study two cases: i) finding the probably approximately correct (PAC) top-$k$ items and ii) finding the exact top-$k$ items. For both cases, we develop algorithms and derive new lower bounds. For the PAC top-$k$ selection, our upper and lower bounds are optimal up to a constant factor. For the exact top-$k$ selection, our upper and lower bounds are optimal up to a loglog factor for $k = 1$ and optimal up to a log factor for $k > 1$.
Ness Shroff (Advisor)
Jia Liu (Advisor)
Raef Bassily (Committee Member)
Shaileshh Bojja Venkatakrishnan (Committee Member)
214 p.

Recommended Citations

Citations

  • Ren, W. (2021). Active Learning for Ranking from Noisy Observations [Doctoral dissertation, Ohio State University]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=osu162740833871401

    APA Style (7th edition)

  • Ren, Wenbo. Active Learning for Ranking from Noisy Observations. 2021. Ohio State University, Doctoral dissertation. OhioLINK Electronic Theses and Dissertations Center, http://rave.ohiolink.edu/etdc/view?acc_num=osu162740833871401.

    MLA Style (8th edition)

  • Ren, Wenbo. "Active Learning for Ranking from Noisy Observations." Doctoral dissertation, Ohio State University, 2021. http://rave.ohiolink.edu/etdc/view?acc_num=osu162740833871401

    Chicago Manual of Style (17th edition)