Papers
Topics
Authors
Recent
Search
2000 character limit reached

Unbounded Density Ratio Estimation and Its Application to Covariate Shift Adaptation

Published 31 Mar 2026 in stat.ML and cs.LG | (2603.29725v1)

Abstract: This paper focuses on the problem of unbounded density ratio estimation -- an understudied yet critical challenge in statistical learning -- and its application to covariate shift adaptation. Much of the existing literature assumes that the density ratio is either uniformly bounded or unbounded but known exactly. These conditions are often violated in practice, creating a gap between theoretical guarantees and real-world applicability. In contrast, this work directly addresses unbounded density ratios and integrates them into importance weighting for effective covariate shift adaptation. We propose a three-step estimation method that leverages unlabeled data from both the source and target distributions: (1) estimating a relative density ratio; (2) applying a truncation operation to control its unboundedness; and (3) transforming the truncated estimate back into the standard density ratio. The estimated density ratio is then employed as importance weights for regression under covariate shift. We establish rigorous, non-asymptotic convergence guarantees for both the proposed density ratio estimator and the resulting regression function estimator, demonstrating optimal or near-optimal convergence rates. Our findings offer new theoretical insights into density ratio estimation and learning under covariate shift, extending classical learning theory to more practical and challenging scenarios.

Summary

  • The paper introduces a three-step pipeline—relative ratio estimation, truncation, and recovery—to reliably estimate unbounded density ratios in covariate shift scenarios.
  • It proposes a bounded surrogate via relative density ratio estimation that stabilizes importance weighting and mitigates high variance in heavy-tailed contexts.
  • Theoretical analysis provides non-asymptotic, high-probability convergence rates for kernel regression, clarifying the sample complexity needed for effective adaptation.

Unbounded Density Ratio Estimation for Covariate Shift: Theory and Algorithms

Introduction

The paper "Unbounded Density Ratio Estimation and Its Application to Covariate Shift Adaptation" (2603.29725) tackles a critical problem in transfer learning: handling unbounded density ratios in covariate shift scenarios. Covariate shift occurs when the input distributions for training (source) and test (target) domains differ, but the output conditional on input remains the same. The standard approach to covariate shift, importance weighting, requires estimation of the density ratio θ(x)=dQ/dP\theta(x) = dQ/dP, where PP and QQ denote the source and target input distributions, respectively. Most existing theoretical results and algorithms rely on the (often violated) assumption that θ\theta is bounded, undermining their applicability to high-dimensional and heavy-tailed settings encountered in practice.

This paper designs a robust procedure for unbounded density ratio estimation and provides a complete finite-sample, non-asymptotic convergence analysis of both the density ratio estimation and its application to weighted regression under covariate shift.

Problem Formulation and Motivating Example

Suppose we observe labeled training data (xi,yi)(x_i, y_i) i.i.d. from the source distribution PX×YP_{X\times Y} and wish to estimate the regression function for the target distribution QX×YQ_{X\times Y}, under covariate shift: PY∣X=QY∣XP_{Y|X} = Q_{Y|X}, PX≠QXP_X \neq Q_X. The target risk is

E(x,y)∼Q[(y−f(x))2].\mathbb{E}_{(x, y) \sim Q}[(y - f(x))^2].

In the presence of covariate shift, the minimization of the empirical risk estimator under PP0 becomes biased. The standard correction employs importance weighting:

PP1

Crucially, when PP2 is unbounded (e.g., with log-concave or heavy-tailed targets), both the theoretical risk and its estimation become challenging.

Theoretical Challenges in Unbounded Density Ratio Estimation

Most prior work either (i) assumes PP3 is uniformly bounded or (ii) restricts to truncated or known forms, both unrealistic for modern applications. The key technical issue is that when PP4 is unbounded, importance weights exhibit high variance, and the standard machinery for learning in reproducing kernel Hilbert spaces (RKHS) fails since bounded functional classes cannot express unbounded densities directly.

To overcome this, the paper introduces a surrogate: the relative density ratio PP5, which is always bounded. This is defined with respect to a mixture distribution PP6 by

PP7

PP8 is always in PP9, and QQ0 can be recovered via a nonlinear transformation of QQ1. Figure 1

Figure 1

Figure 1: The standard density ratio QQ2 (unbounded) and the relative density ratio QQ3 (bounded) for illustrative pairs of QQ4, QQ5.

Proposed Three-Step Estimation Procedure

The paper proposes a robust three-step estimator for QQ6:

  1. Relative Ratio Estimation: Use empirical kernel mean matching to estimate the bounded relative ratio QQ7 in RKHS.
  2. Truncation: Apply lower and upper truncation to the estimate, ensuring QQ8 remains in a feasible interval.
  3. Recovery Step: Transform the truncated estimate back to the original scale to obtain an estimator QQ9 for the unbounded density ratio.

This approach enables stable kernel methods to estimate θ\theta0 even when the true ratio is unbounded.

Statistical Guarantees and Algorithmic Framework

Assumptions

  • The density ratio θ\theta1 possesses finite θ\theta2-th moment with respect to θ\theta3 (θ\theta4).
  • The regression function and relative ratio both satisfy standard source conditions (smoothness/qualification) with respect to the chosen kernel and mixture/target measure.

Main Results

  • Relative Ratio Estimation: The estimator θ\theta5 achieves optimal non-asymptotic minimax rates in θ\theta6. The key rate for θ\theta7 unlabeled samples is θ\theta8 where θ\theta9 quantifies regularity.
  • Density Ratio Recovery: The error in (xi,yi)(x_i, y_i)0 translates to the error in (xi,yi)(x_i, y_i)1 (in an (xi,yi)(x_i, y_i)2 sense), with an additional dependence on the truncation parameter (xi,yi)(x_i, y_i)3 and moment (xi,yi)(x_i, y_i)4 of (xi,yi)(x_i, y_i)5.
  • Regression under Covariate Shift: When (xi,yi)(x_i, y_i)6 are (xi,yi)(x_i, y_i)7 labeled samples and (xi,yi)(x_i, y_i)8 unlabeled samples ((xi,yi)(x_i, y_i)9), the kernel ridge regression (KRR) solution using estimated importance weights PX×YP_{X\times Y}0 enjoys non-asymptotic high-probability convergence rates in both the RKHS and PX×YP_{X\times Y}1 norms. Near-optimal rates are possible as long as PX×YP_{X\times Y}2 grows polynomially in PX×YP_{X\times Y}3.

These theorems precisely quantify how the estimation of an unbounded importance function cascades into the excess risk for target distribution regression.

Methodological and Theoretical Insights

  • Implicit Debiasing: By operating in the "relative" space, the method regularizes the ill-posedness, but then must control for the bias introduced by truncation and the nonlinear recovery step.
  • Sample Complexity: Achieving minimax-optimal convergence rates for the target regression function is contingent upon the unlabeled sample size scaling polynomially with the labeled sample size, highlighting the intrinsic difficulty of density ratio estimation relative to regression under covariate shift.
  • Spectral Algorithm Generalization: The analysis encompasses KRR as a special case but applies broadly to general spectral regularization algorithms, allowing flexibility in practical implementation.

Comparison to Prior Work

Earlier works on density ratio estimation are typically restricted to bounded ratios or implicitly regularize by enforcing boundedness in RKHS estimators. Classifier-based and neural methods exist for unbounded ratios, but theoretical support is often limited to parametric settings or expectation-based (not concentration) guarantees. Recent algorithms which use truncation schemes either lack precise characterization of the bias or rely on increasingly complex function classes (e.g., local Hölder).

In contrast, this work's relative ratio framework and full non-asymptotic analysis show that unbounded ratios can be robustly and efficiently estimated via this translation–truncation–recovery pipeline, yielding high-probability risk bounds.

Practical and Theoretical Implications

  • Practicality: The method enables kernel-based covariate shift regression even in the presence of heavy-tailed or sharply peaked PX×YP_{X\times Y}4 ratios—scenarios prevalent in NLP, genomics, climate modeling, and other domains with significant label shift.
  • Theory: The optimality statements extend classical learning theory to the unbounded ratio regime and inform the design of robust adaptation strategies in underdetermined limits.

Limitations and Directions for Future Research

  • Misspecification: The theoretical analysis requires that the regression function lies within the RKHS. Future work should consider the case PX×YP_{X\times Y}5 (model misspecification).
  • Sharper Capacity Measures: By leveraging notions such as effective dimension or embedding index, it may be possible to obtain sharper minimax rates.
  • Debiasing: Addressing the accumulative bias from regularization, truncation, and the nonlinear transformation could further improve performance and suggest new estimation strategies.

Conclusion

This paper provides a principled, statistically sound methodology for unbounded density ratio estimation, enabling effective covariate shift adaptation with rigorous non-asymptotic guarantees. The approach—via relative ratio estimation and careful transfer of error bounds—overcomes longstanding obstacles in applying importance weighting under realistic, heavy-tailed regimes. The quantified relationship between sample complexity and convergence elucidates the fundamental gap between importance weight estimation and regression, providing actionable guidance for data collection and methodological choices in transfer learning.


Reference

  • "Unbounded Density Ratio Estimation and Its Application to Covariate Shift Adaptation" (2603.29725)

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.