- Probabilistic inference in reinforcement learning done right
J. Tarbouriech, T. Lattimore, and B. O'Donoghue
Advances in Neural Information Processing Systems (NeurIPS), 2023
- Optimistic meta-gradients
S. Flennerhag, T. Zahavy, B. O'Donoghue, H. van Hasselt, A. György, and S. Singh
Advances in Neural Information Processing Systems (NeurIPS), 2023
- Efficient exploration via epistemic-risk-seeking policy optimization
B. O'Donoghue
Proceedings of the International Conference on Machine Learning (ICML), 2023
- ReLOAD: Reinforcement learning with optimistic ascent-descent for last-iterate convergence in constrained MDPs
T. Moskovitz, B. O'Donoghue, V. Veeriah, S. Flennerhag, S. Singh, and T. Zahavy
Proceedings of the International Conference on Machine Learning (ICML), 2023
- The neural testbed: Evaluating joint predictions
I. Osband, Z. Wen, S. Asghari, V. Dwaracherla, B. Hao, M. Ibrahimi, D. Lawson, X. Lu, B. O'Donoghue, and B. Van Roy
Advances in Neural Information Processing Systems (NeurIPS), 2022
- Variational Bayesian optimistic samplingSpotlight
B. O'Donoghue and T. Lattimore
Advances in Neural Information Processing Systems (NeurIPS), 2021
- Reward is enough for convex MDPsSpotlight
T. Zahavy, B. O'Donoghue, G. Desjardins, and S. Singh
Advances in Neural Information Processing Systems (NeurIPS), 2021
- Variational Bayesian reinforcement learning with regret bounds
B. O'Donoghue
Advances in Neural Information Processing Systems (NeurIPS), 2021
- Practical large-scale linear programming using primal-dual hybrid gradient
D. Applegate, M. Díaz, O. Hinder, H. Lu, M. Lubin, B. O'Donoghue, and W. Schudy
Advances in Neural Information Processing Systems (NeurIPS), 2021
- Matrix games with bandit feedback
B. O'Donoghue, T. Lattimore, and I. Osband
Proceedings of the 37th Conference on Uncertainty in Artificial Intelligence (UAI), 2021
- Discovering a set of policies for the worst case rewardSpotlight
T. Zahavy, A. Barreto, D. Mankowitz, S. Hou, B. O'Donoghue, I. Kemaev, and S. Singh
Proceedings of the International Conference on Learning Representations (ICLR), 2021
- Sample-efficient reinforcement learning with REINFORCE
J. Zhang, J. Kim, B. O'Donoghue, and S. Boyd
Proceedings of the AAAI Conference on Artificial Intelligence, 35(12), 10887–10895, 2021
- Making sense of reinforcement learning and probabilistic inferenceSpotlight
B. O'Donoghue, I. Osband, and C. Ionescu
Proceedings of the International Conference on Learning Representations (ICLR), 2020
- Hamiltonian descent for composite objectives
B. O'Donoghue and C. J. Maddison
Advances in Neural Information Processing Systems (NeurIPS), 2019
- Visualizations of decision regions in the presence of adversarial examples
G. Swirszcz, B. O'Donoghue, and P. Kohli
Debugging Machine Learning Models Workshop, ICLR, 2019
- Verification of non-linear specifications for neural networks
C. Qin, K. (Dj) Dvijotham, B. O'Donoghue, R. Bunel, R. Stanforth, S. Gowal, J. Uesato, G. Swirszcz, and P. Kohli
Proceedings of the International Conference on Learning Representations (ICLR), 2019
- Adversarial risk and the dangers of evaluating against weak attacks
J. Uesato, B. O'Donoghue, A. van den Oord, and P. Kohli
Proceedings of the International Conference on Machine Learning (ICML), pp. 5025–5034, 2018
- The uncertainty Bellman equation and explorationOral
B. O'Donoghue, I. Osband, R. Munos, and V. Mnih
Proceedings of the International Conference on Machine Learning (ICML), pp. 3836–3845, 2018
- Combining policy gradient and Q-learning
B. O'Donoghue, R. Munos, K. Kavukcuoglu, and V. Mnih
Proceedings of the International Conference on Learning Representations (ICLR), 2017
- Iterated approximate value functions
B. O'Donoghue, Y. Wang, and S. Boyd
Proceedings of the European Control Conference, pp. 3882–3888, Zurich, July 2013
- Min-max approximate dynamic programming
B. O'Donoghue, Y. Wang, and S. Boyd
Proceedings of the IEEE Multi-Conference on Systems and Control, pp. 424–431, September 2011