My research is on sequential decision making problems under uncertainty and potentially limited feedback. In particular, I work on multi-armed bandit, reinforcement learning, and online learning problems. Often the problems I work on are motivated by issues arising in applications such as education and healthcare



Workshop Papers

  • C.Pike-Burke and S.Grünewälder, Recovering Bandits, in European Workshop on Reinforcement Learning (EWRL), 2018.

  • C.Pike-Burke and S.Grünewälder, Optimistic Planning for Question Selection, in NeurIPS workshop on Machine Learning for Education, 2016.


My PhD focused on sequential decision problems arising from the problem of selecting questions to give to students in education software. In particular, I studied several variants of the multi-armed bandit problem specifically motivated by issues arising in education software. My thesis was in collaboration with Sparx and can be accessed here.

* indicates alphabetical ordering of authors