ABSTRACT
Real-world applications often involve domain-specific and task-based performance objectives that are not captured by the standard machine learning losses, but are critical for decision making. A key challenge for direct integration of more meaningful domain and task-based evaluation criteria into an end-to-end gradient-based training process is the fact that often such performance objectives are not necessarily differentiable and may even require additional decision-making optimization processing. We propose the Task-Oriented Prediction Network (TOPNet), an end-to-end learning scheme that automatically integrates task-based evaluation criteria into the learning process via a learnable surrogate loss function, which directly guides the model towards the task-based goal. A major benefit of the proposed TOPNet learning scheme lies in its capability of automatically integrating nondifferentiable evaluation criteria, which makes it particularly suitable for diversified and customized task-based evaluation criteria in real-world tasks. We validate the performance of TOPNet on two real-world financial prediction tasks, revenue surprise forecasting and credit risk modeling. The experimental results demonstrate that TOPNet significantly outperforms both traditional modeling with standard losses and modeling with handcrafted heuristic differentiable surrogate losses.
- Yoshua Bengio. Using a financial training criterion rather than a prediction criterion. International Journal of Neural Systems, 8(04):433-443, 1997.Google Scholar
- Priya Donti, Brandon Amos, and J Zico Kolter. Task-based end-to-end model learning in stochastic optimization. In Advances in Neural Information Processing Systems, pages 5484-5494, 2017.Google Scholar
- M. Doumpos, C. Lemonakis, D. Niklis, and C. Zopounidis. Introduction to credit risk modeling and assessment. In In: Analytical Techniques in the Assessment of Credit Risk. EURO Advanced Tutorials on Operational Research, pages 1-21. Springer, Cham, 2019.Google Scholar
- Adam N Elmachtoub and Paul Grigas. Smart" predict, then optimize". arXiv preprint arXiv:1710.08005, 2017.Google Scholar
- Sepp Hochreiter and Jürgen Schmidhuber. Long short-term memory. Neural computation, 9(8):1735-1780, 1997.Google Scholar
- Narasimhan Jegadeesh and Joshua Livnat. Revenue surprises and stock returns. Journal of Accounting and Economics, 41, 2006.Google Scholar
- Yi-hao Kao, Benjamin V Roy, and Xiang Yan. Directed regression. In Advances in Neural Information Processing Systems, pages 889-897, 2009.Google Scholar
- Diederik Kingma and Jimmy Ba. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.Google Scholar
- Andrew Perrault, Bryan Wilder, Eric Ewing, Aditya Mate, Bistra Dilkina, and Milind Tambe. Decision-focused learning of adversary behavior in security games. arXiv preprint arXiv:1903.00958, 2019.Google Scholar
- Marcos Lopez de Prado. Advances in Financial Machine Learning. Wiley, 2018.Google Scholar
- Matthew Riemer, Aditya Vempaty, Flavio P. Calmon, Fenno F. Heath, III, Richard Hull, and Elham Khabiri. Correcting forecasts with multifactor neural attention. In Proceedings of the 33rd International Conference on International Conference on Machine Learning - Volume 48, ICML'16, pages 3010-3019. JMLR.org, 2016.Google Scholar
- Bryan Wilder, Bistra Dilkina, and Milind Tambe. Melding the data-decisions pipeline: Decision-focused learning for combinatorial optimization. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 33, pages 1658-1665, 2019.Google Scholar
- Bryan Wilder, Eric Ewing, Bistra Dilkina, and Milind Tambe. End to end learning and optimization on graphs. arXiv preprint arXiv:1905.13732, 2019.Google Scholar
- Hongxia Yang, Yada Zhu, and Jingrui He. Local algorithm for user action prediction towards display ads. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '17, pages 2091-2099, New York, NY, USA, 2017. ACM.Google Scholar
Index Terms
(auto-classified)Task-based learning via task-oriented prediction network with applications in finance
Comments