TaskMet: Task-Driven Metric Learning for Model Learning (2312.05250v2)
Abstract: Deep learning models are often deployed in downstream tasks that the training procedure may not be aware of. For example, models solely trained to achieve accurate predictions may struggle to perform well on downstream tasks because seemingly small prediction errors may incur drastic task errors. The standard end-to-end learning approach is to make the task loss differentiable or to introduce a differentiable surrogate that the model can be trained on. In these settings, the task loss needs to be carefully balanced with the prediction loss because they may have conflicting objectives. We propose take the task loss signal one level deeper than the parameters of the model and use it to learn the parameters of the loss function the model is trained on, which can be done by learning a metric in the prediction space. This approach does not alter the optimal prediction model itself, but rather changes the model learning to emphasize the information important for the downstream task. This enables us to achieve the best of both worlds: a prediction model trained in the original prediction space while also being valuable for the desired downstream task. We validate our approach through experiments conducted in two main settings: 1) decision-focused model learning scenarios involving portfolio optimization and budget allocation, and 2) reinforcement learning in noisy environments with distracting states. The source code to reproduce our experiments is available at https://github.com/facebookresearch/taskmet
- Differentiable convex optimization layers. Advances in neural information processing systems, 32, 2019.
- Differentiable mpc for end-to-end planning and control. Advances in neural information processing systems, 31, 2018.
- Anonymous. Predict-then-optimize via learning to optimize from features. In Submitted to The Twelfth International Conference on Learning Representations, 2023. URL https://openreview.net/forum?id=jvOvJ3XSjK. under review.
- Goal-driven dynamics learning via bayesian optimization. In 2017 IEEE 56th Annual Conference on Decision and Control (CDC), pages 5168–5173. IEEE, 2017.
- Yoshua Bengio. Using a financial training criterion rather than a prediction criterion. International journal of neural systems, 8(04):433–443, 1997.
- Differentiable gaussian process motion planning. In 2020 IEEE international conference on robotics and automation (ICRA), pages 10598–10604. IEEE, 2020.
- Efficient and modular implicit differentiation. Advances in neural information processing systems, 35:5230–5242, 2022.
- Decision-aware learning for optimizing health supply chains. arXiv preprint arXiv:2211.08507, 2022.
- Ulisse Dini. Analisi infinitesimale. Lithografia Gorani, 1878.
- Implicit functions and solution mappings, volume 543. Springer, 2009.
- Task-based end-to-end model learning in stochastic optimization. Advances in neural information processing systems, 30, 2017.
- Generalization bounds in the predict-then-optimize framework. Advances in neural information processing systems, 32, 2019.
- Smart “predict, then optimize”. Management Science, 68(1):9–26, 2022.
- Amir-massoud Farahmand. Iterative value-aware model learning. Advances in Neural Information Processing Systems, 31, 2018.
- Value-aware loss function for model-based reinforcement learning. In Artificial Intelligence and Statistics, pages 1486–1494. PMLR, 2017.
- Surco: Learning linear surrogates for combinatorial nonlinear optimization problems. In International Conference on Machine Learning, pages 10034–10052. PMLR, 2023.
- Dynamically weighted balanced loss: class imbalanced learning and confidence calibration of deep neural networks. IEEE Transactions on Neural Networks and Learning Systems, 33(7):2940–2951, 2021.
- Popcorn: Partially observed prediction constrained reinforcement learning. arXiv preprint arXiv:2001.04032, 2020.
- Rishabh Gupta and Qi Zhang. Data-driven decision-focused surrogate modeling. arXiv preprint arXiv:2308.12161, 2023.
- Dream to control: Learning behaviors by latent imagination. arXiv preprint arXiv:1912.01603, 2019a.
- Learning latent dynamics for planning from pixels. In International conference on machine learning, pages 2555–2565. PMLR, 2019b.
- Temporal difference learning for model predictive control. arXiv preprint arXiv:2203.04955, 2022.
- Discriminant adaptive nearest neighbor classification and regression. Advances in neural information processing systems, 8, 1995.
- A geometric take on metric learning. Advances in Neural Information Processing Systems, 25, 2012.
- Few-shot object detection via feature reweighting. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 8420–8429, 2019.
- Deep metric learning: A survey. Symmetry, 11(9):1066, 2019.
- Brian Kulis et al. Metric learning: A survey. Foundations and Trends® in Machine Learning, 5(4):287–364, 2013.
- Objective mismatch in model-based reinforcement learning. arXiv preprint arXiv:2002.04523, 2020.
- Active learning in the predict-then-optimize framework: A margin-based approach. arXiv preprint arXiv:2305.06584, 2023.
- Optimizing millions of hyperparameters by implicit differentiation. In International Conference on Artificial Intelligence and Statistics, pages 1540–1552. PMLR, 2020.
- Smart predict-and-optimize for hard combinatorial optimization problems. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 34, pages 1603–1610, 2020.
- Mean-variance analysis in portfolio choice and capital markets, volume 66. John Wiley & Sons, 2000.
- Richard O Michaud. The markowitz optimization enigma: Is ‘optimized’optimal? Financial analysts journal, 45(1):31–42, 1989.
- Control-oriented model-based reinforcement learning with implicit differentiation. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 36, pages 7886–7894, 2022.
- Resolving class imbalance in object detection with weighted cross entropy losses. arXiv preprint arXiv:2006.01413, 2020.
- Tuning computer vision models with task rewards. arXiv preprint arXiv:2302.08242, 2023.
- A survey of contextual optimization methods for decision making under uncertainty, 2023.
- Decision-focused learning without decision-making: Learning locally optimized decision losses. In Alice H. Oh, Alekh Agarwal, Danielle Belgrave, and Kyunghyun Cho, editors, Advances in Neural Information Processing Systems, 2022. URL https://openreview.net/forum?id=eN2lQxjWL05.
- Leaving the nest: Going beyond local loss functions for predict-then-optimize. arXiv preprint arXiv:2305.16830, 2023.
- Decision-oriented learning with differentiable submodular maximization for vehicle routing problem. arXiv preprint arXiv:2303.01543, 2023.
- Value gradient weighted model-based reinforcement learning. arXiv preprint arXiv:2204.01464, 2022.
- End-to-end learning with multiple modalities for system-optimised renewables nowcasting. arXiv preprint arXiv:2304.07151, 2023.
- Decision-focused learning in restless multi-armed bandits with application to maternal and child care domain. arXiv preprint arXiv:2202.00916, 2022.
- Metric learning for kernel regression. In Artificial intelligence and statistics, pages 612–619. PMLR, 2007.
- Melding the data-decisions pipeline: Decision-focused learning for combinatorial optimization. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 33, pages 1658–1665, 2019.
- Fixes that fail: Self-defeating improvements in machine-learning systems. Advances in Neural Information Processing Systems, 34:11745–11756, 2021.
- Differentiable top-k with optimal transport. Advances in Neural Information Processing Systems, 33:20520–20531, 2020.
- Distance metric learning: A comprehensive survey. Michigan State Universiy, 2(2):4, 2006.
- Landscape surrogate: Learning decision losses for mathematical optimization under partial information. arXiv preprint arXiv:2307.08964, 2023.