I am happy to supervise students on a number of topics related to models of perception, decision making and learning in humans. Below are two examples of possible projects.

(Dr. Ulrik Beierholm, email: u.beierholm@bham.ac.uk)

 

 

 

Computational advantage of multiple controllers in reinforcement learning

Reinforcement learning is a powerful way of modelling the learning of an organism acting in an environment with potential rewards and punishments. While it is well known that model based versions of RL are most effective, they very fast become computationally intractable as the number of options increase. In contrast a model-free system is computationally simple, but only converges slowly to the correct values. It has been proposed that a system that utilizes both models in parallel and which arbitrates between them based on their own uncertainty can work as a computationally plausible yet efficient solution.

This study will use numerical simulations to test for any computational advantage in using such multiple controller. The placement will be largely based on developing, programming and numerically testing a model of multiple controllers for RL

 

 

 

 

 

Non-parametric statistical models in human learning and perception

 

Our understanding of how humans learn about our environments, and how this relates to our perception of objects and structures in our environment, has been expended in the last 15 years through the use of Bayesian statistical models. However so far this work has almost exclusively relied on parametric models (such as Gaussian) which although simple in structure, are limited in requiring a pre-specified complexity (e.g. In mixture of Gaussian models). Meanwhile modern statistics and machine learning has developed a number of tools based on non-parametric statistics (such as Dirichlet processes) which allow the data itself to specify the complexity of the model; little data leads to simple models while a large dataset will naturally lead to a more complex model.

 

This project proposes to utilise such non-parametric models to further our understanding of both learning and perception in human cognition. The work will involve a combination of analytical model development, numerical simulation and possibly human psychophysical experiments.