Adaptive transfer learning

In transfer learning, we wish to make inference about a target population when we have access to data both from the distribution itself, and from a different but related source distribution. We introduce a flexible framework for transfer learning in the context of binary classification, allowing for...

Full description

Saved in:

Bibliographic Details
Published in:	The Annals of statistics Vol. 49; no. 6; p. 3618
Main Authors:	Reeve, Henry W. J., Cannings, Timothy I., Samworth, Richard J.
Format:	Journal Article
Language:	English
Published:	Hayward Institute of Mathematical Statistics 01-12-2021
Subjects:	Algorithms Bayesian analysis Decision trees Design specifications Learning Minimax technique Optimization Smoothness
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	In transfer learning, we wish to make inference about a target population when we have access to data both from the distribution itself, and from a different but related source distribution. We introduce a flexible framework for transfer learning in the context of binary classification, allowing for covariate-dependent relationships between the source and target distributions that are not required to preserve the Bayes decision boundary. Our main contributions are to derive the minimax optimal rates of convergence (up to poly-logarithmic factors) in this problem, and show that the optimal rate can be achieved by an algorithm that adapts to key aspects of the unknown transfer relationship, as well as the smoothness and tail parameters of our distributional classes. This optimal rate turns out to have several regimes, depending on the interplay between the relative sample sizes and the strength of the transfer relationship, and our algorithm achieves optimality by careful, decision tree-based calibration of local nearest-neighbour procedures.
ISSN:	0090-5364 2168-8966
DOI:	10.1214/21-AOS2102