Pessimistic Off-Policy Optimization for Learning to Rank
Matej Čief, Branislav Kveton and Michal Kompan
In Proceedings of the 27th European Conference on Artificial Intelligence, ECAI 2024, Santiago de Compostela, Spain, pp. 1896-1903. IOS Press.
2024