Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
158 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Covariate-Elaborated Robust Partial Information Transfer with Conditional Spike-and-Slab Prior (2404.03764v2)

Published 30 Mar 2024 in cs.LG, stat.ME, and stat.ML

Abstract: The popularity of transfer learning stems from the fact that it can borrow information from useful auxiliary datasets. Existing statistical transfer learning methods usually adopt a global similarity measure between the source data and the target data, which may lead to inefficiency when only partial information is shared. In this paper, we propose a novel Bayesian transfer learning method named ``CONCERT'' to allow robust partial information transfer for high-dimensional data analysis. A conditional spike-and-slab prior is introduced in the joint distribution of target and source parameters for information transfer. By incorporating covariate-specific priors, we can characterize partial similarities and integrate source information collaboratively to improve the performance on the target. In contrast to existing work, the CONCERT is a one-step procedure, which achieves variable selection and information transfer simultaneously. We establish variable selection consistency, as well as estimation and prediction error bounds for CONCERT. Our theory demonstrates the covariate-specific benefit of transfer learning. To ensure that our algorithm is scalable, we adopt the variational Bayes framework to facilitate implementation. Extensive experiments and two real data applications showcase the validity and advantage of CONCERT over existing cutting-edge transfer learning methods.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (57)
  1. A penalized complexity prior for deep Bayesian transfer learning with application to materials informatics. The Annals of Applied Statistics, 17(4):3241–3256, 2023.
  2. Efficient variational inference for sparse deep learning with theoretical guarantee. In Advances in Neural Information Processing Systems, volume 33, pp.  466–476. Curran Associates, Inc., 2020.
  3. Spike-and-slab group lassos for grouped regression and sparse generalized additive models. Journal of the American Statistical Association, 117(537):184–197, 2022.
  4. Bishop, C. Pattern recognition and machine learning. Springer New York, 2006.
  5. Scalable spike-and-slab. In International Conference on Machine Learning. PMLR, 2022.
  6. Variational inference: A review for statisticians. Journal of the American Statistical Association, 112(518):859–877, 2017.
  7. Bayesian change point detection with spike-and-slab priors. Journal of Computational and Graphical Statistics, 32(4):1488–1500, 2023.
  8. Scalable variational inference for Bayesian variable selection in regression, and its accuracy in genetic association studies. Bayesian Analysis, 7(1):73 – 108, 2012.
  9. Bayesian neural multi-source transfer learning. Neurocomputing, 378:54–64, 2020.
  10. Transfer learning in information criteria-based feature selection. The Journal of Machine Learning Research, 23(1):5955–6059, 2022.
  11. Mutual transfer learning for massive data. In International Conference on Machine Learning, pp. 1800–1809. PMLR, 2020.
  12. Chérief-Abdellatif, B.-E. Convergence rates of variational inference in sparse deep learning. In Proceedings of the 37th International Conference on Machine Learning, volume 119 of Proceedings of Machine Learning Research, pp.  1831–1842. PMLR, 2020.
  13. Fast and scalable spike and slab variable selection in high-dimensional Gaussian processes. In Proceedings of The 25th International Conference on Artificial Intelligence and Statistics, pp.  7976–8002. PMLR, 2022.
  14. Adaptive and robust multi-task learning. The Annals of Statistics, 51(5):2015–2039, 2023.
  15. Variable selection via Gibbs sampling. Journal of the American Statistical Association, 88(423):881–889, 1993.
  16. Bayesian multi-domain learning for cancer subtype discovery from next-generation sequencing count data. Advances in Neural Information Processing Systems, 31, 2018.
  17. Ensembled transfer learning based multichannel attention networks for human activity recognition in still images. IEEE Access, 10:47051–47062, 2022.
  18. Spike and slab variable selection: Frequentist and Bayesian strategies. The Annals of Statistics, 33(2):730 – 773, 2005.
  19. Online Bayesian transfer learning for sequential data modeling. In International Conference on Learning Representations, 2016.
  20. Bayesian transfer learning between uniformly modelled Bayesian filters. In Informatics in Control, Automation and Robotics: 16th International Conference, ICINCO 2019 Prague, Czech Republic, July 29-31, 2019, Revised Selected Papers, pp.  151–168. Springer, 2021.
  21. Optimal Bayesian transfer learning. IEEE Transactions on Signal Processing, 66(14):3724–3739, 2018a.
  22. Optimal Bayesian transfer regression. IEEE Signal Processing Letters, 25(11):1655–1659, 2018b.
  23. Partial disentanglement for domain adaptation. In International conference on machine learning, pp. 11455–11472. PMLR, 2022.
  24. Bayesian group selection in logistic regression with application to MRI data analysis. Biometrics, 77(2):391–400, 2021.
  25. Transfer learning for high-dimensional linear regression: Prediction, estimation and minimax optimality. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 84(1):149–173, 2022.
  26. Transfer learning in large-scale Gaussian graphical models with false discovery rate control. Journal of the American Statistical Association, 118(543):2171–2183, 2023a.
  27. Estimation and inference for high-dimensional generalized linear models with knowledge transfer. Journal of the American Statistical Association, pp.  1–12, 2023b.
  28. On transfer learning in functional linear regression. arXiv preprint arXiv:2206.04277, 2022.
  29. Identifying latent causal content for multi-source domain adaptation. arXiv preprint arXiv:2208.14161, 2022.
  30. Bayesian lesion estimation with a structured spike-and-slab prior. Journal of the American Statistical Association, pp.  1–15, 2023.
  31. Bayesian variable selection with shrinking and diffusing priors. The Annals of Statistics, 42(2):789–817, 2014. ISSN 00905364.
  32. Skinny Gibbs: A consistent and scalable Gibbs sampler for model selection. Journal of the American Statistical Association, 114(527):1205–1217, 2019.
  33. Bayesian bootstrap spike-and-slab lasso. Journal of the American Statistical Association, 118(543):2013–2028, 2023.
  34. A variational Bayes approach to variable selection. Electronic Journal of Statistics, 11(2):3549 – 3594, 2017.
  35. A survey on transfer learning. IEEE Transactions on knowledge and data engineering, 22(10):1345–1359, 2009.
  36. Robust Bayesian transfer learning between Kalman filters. In 2019 IEEE 29th International Workshop on Machine Learning for Signal Processing (MLSP), pp.  1–6. IEEE, 2019.
  37. Transferring model structure in Bayesian transfer learning for Gaussian process regression. Knowledge-Based Systems, 251:108875, 2022.
  38. Posterior concentration for sparse deep learning. In Advances in Neural Information Processing Systems, volume 31, 2018.
  39. Bayesian inference for logistic models using Pólya–Gamma latent variables. Journal of the American Statistical Association, 108(504):1339–1349, 2013. doi: 10.1080/01621459.2013.829001.
  40. Variational Bayes for high-dimensional linear regression with sparse priors. Journal of the American Statistical Association, 117(539):1270–1281, 2022.
  41. Spike and slab variational Bayes for high dimensional logistic regression. Advances in Neural Information Processing Systems, 33:14423–14434, 2020.
  42. Invariant models for causal transfer learning. Journal of Machine Learning Research, 19(36):1–34, 2018.
  43. The spike-and-slab lasso. Journal of the American Statistical Association, 113(521):431–444, 2018.
  44. Domain adaptation with invariant representation learning: What transformations to learn? Advances in Neural Information Processing Systems, 34:24791–24803, 2021.
  45. A tree-based model averaging approach for personalized treatment effect estimation from heterogeneous data sources. In International Conference on Machine Learning, pp. 21013–21036. PMLR, 2022.
  46. Transfer learning under high-dimensional generalized linear models. Journal of the American Statistical Association, 118(544):2684–2697, 2023.
  47. Tibshirani, R. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society, Series B: Statistical Methodology, 58(1):267–288, 1996.
  48. On the theory of transfer learning: The importance of task diversity. Advances in neural information processing systems, 33:7852–7862, 2020.
  49. Transfer learning approaches to improve drug sensitivity prediction in multiple myeloma patients. IEEE Access, 5:7381–7393, 2017.
  50. Polymorphism of CD1 and SH2D2A genes in inflammatory neuropathies. Journal of the Peripheral Nervous System, 16(s1):48–51, 2011.
  51. Transfer learning for dynamic feature extraction using variational Bayesian inference. IEEE Transactions on Knowledge and Data Engineering, 34(11):5524–5535, 2021.
  52. A difference standardization method for mutual transfer learning. In Proceedings of the 39th International Conference on Machine Learning, volume 162 of Proceedings of Machine Learning Research, pp.  24683–24697. PMLR, 2022.
  53. On the computational complexity of high-dimensional Bayesian variable selection. The Annals of Statistics, 44(6):2497 – 2532, 2016. doi: 10.1214/15-AOS1417. URL https://doi.org/10.1214/15-AOS1417.
  54. Transfer learning for medical images analyses: A survey. Neurocomputing, 489:230–254, 2022.
  55. Bayesian image-on-scalar regression with a spatial global-local spike-and-slab prior. Bayesian Analysis, pp.  1 – 26, 2022.
  56. A survey on negative transfer. IEEE/CAA Journal of Automatica Sinica, 10(2):305–329, 2023.
  57. Transfer learning for high-dimensional quantile regression via convolution smoothing. arXiv preprint arXiv:2212.00428, 2022.

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com