Statistical Complexity and Optimal Algorithms for Non-linear Ridge Bandits

Rajaraman, Nived; Han, Yanjun; Jiao, Jiantao; Ramchandran, Kannan

Statistics > Machine Learning

arXiv:2302.06025 (stat)

[Submitted on 12 Feb 2023 (v1), last revised 10 Jan 2024 (this version, v3)]

Title:Statistical Complexity and Optimal Algorithms for Non-linear Ridge Bandits

Authors:Nived Rajaraman, Yanjun Han, Jiantao Jiao, Kannan Ramchandran

View PDF HTML (experimental)

Abstract:We consider the sequential decision-making problem where the mean outcome is a non-linear function of the chosen action. Compared with the linear model, two curious phenomena arise in non-linear models: first, in addition to the "learning phase" with a standard parametric rate for estimation or regret, there is an "burn-in period" with a fixed cost determined by the non-linear function; second, achieving the smallest burn-in cost requires new exploration algorithms. For a special family of non-linear functions named ridge functions in the literature, we derive upper and lower bounds on the optimal burn-in cost, and in addition, on the entire learning trajectory during the burn-in period via differential equations. In particular, a two-stage algorithm that first finds a good initial action and then treats the problem as locally linear is statistically optimal. In contrast, several classical algorithms, such as UCB and algorithms relying on regression oracles, are provably suboptimal.

Comments:	Revised Section 3 and added an upper bound agnostic to the link function $f$
Subjects:	Machine Learning (stat.ML); Information Theory (cs.IT); Machine Learning (cs.LG); Statistics Theory (math.ST)
Cite as:	arXiv:2302.06025 [stat.ML]
	(or arXiv:2302.06025v3 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2302.06025

Submission history

From: Yanjun Han [view email]
[v1] Sun, 12 Feb 2023 23:20:41 UTC (72 KB)
[v2] Tue, 14 Mar 2023 17:23:52 UTC (74 KB)
[v3] Wed, 10 Jan 2024 02:49:55 UTC (82 KB)

Statistics > Machine Learning

Title:Statistical Complexity and Optimal Algorithms for Non-linear Ridge Bandits

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Statistical Complexity and Optimal Algorithms for Non-linear Ridge Bandits

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators