Computational Phenotyping in Psychiatry: A Worked Example

doi:10.1523/ENEURO.0049-16.2016

. 2016 Aug 2;3(4):ENEURO.0049-16.2016.

doi: 10.1523/ENEURO.0049-16.2016. eCollection 2016 Jul-Aug.

Computational Phenotyping in Psychiatry: A Worked Example

Philipp Schwartenbeck¹, Karl Friston²

Affiliations

¹ The Wellcome Trust Centre for Neuroimaging, UCL, London WC1N 3BG, UK; Centre for Cognitive Neuroscience, University of Salzburg, 5020 Salzburg, Austria; Neuroscience Institute, Christian-Doppler-Klinik, Paracelsus Medical University Salzburg, A-5020 Salzburg, Austria; Max Planck UCL Centre for Computational Psychiatry and Ageing Research, London WC1B 5EH, UK.
² The Wellcome Trust Centre for Neuroimaging, UCL , London WC1N 3BG, UK.

PMID: 27517087
PMCID: PMC4969668
DOI: 10.1523/ENEURO.0049-16.2016

Computational Phenotyping in Psychiatry: A Worked Example

Philipp Schwartenbeck et al. eNeuro. 2016.

. 2016 Aug 2;3(4):ENEURO.0049-16.2016.

doi: 10.1523/ENEURO.0049-16.2016. eCollection 2016 Jul-Aug.

Authors

Philipp Schwartenbeck¹, Karl Friston²

Affiliations

¹ The Wellcome Trust Centre for Neuroimaging, UCL, London WC1N 3BG, UK; Centre for Cognitive Neuroscience, University of Salzburg, 5020 Salzburg, Austria; Neuroscience Institute, Christian-Doppler-Klinik, Paracelsus Medical University Salzburg, A-5020 Salzburg, Austria; Max Planck UCL Centre for Computational Psychiatry and Ageing Research, London WC1B 5EH, UK.
² The Wellcome Trust Centre for Neuroimaging, UCL , London WC1N 3BG, UK.

PMID: 27517087
PMCID: PMC4969668
DOI: 10.1523/ENEURO.0049-16.2016

Abstract

Computational psychiatry is a rapidly emerging field that uses model-based quantities to infer the behavioral and neuronal abnormalities that underlie psychopathology. If successful, this approach promises key insights into (pathological) brain function as well as a more mechanistic and quantitative approach to psychiatric nosology-structuring therapeutic interventions and predicting response and relapse. The basic procedure in computational psychiatry is to build a computational model that formalizes a behavioral or neuronal process. Measured behavioral (or neuronal) responses are then used to infer the model parameters of a single subject or a group of subjects. Here, we provide an illustrative overview over this process, starting from the modeling of choice behavior in a specific task, simulating data, and then inverting that model to estimate group effects. Finally, we illustrate cross-validation to assess whether between-subject variables (e.g., diagnosis) can be recovered successfully. Our worked example uses a simple two-step maze task and a model of choice behavior based on (active) inference and Markov decision processes. The procedural steps and routines we illustrate are not restricted to a specific field of research or particular computational model but can, in principle, be applied in many domains of computational psychiatry.

Keywords: Markov decision process; active inference; computational psychiatry; generative model; model inversion.

PubMed Disclaimer

Figures

**Figure 1.**
A schematic overview of the analysis stream underlying the treatment of computational psychiatry in this article. The basic procedure involves specifying a model of behavior cast in terms of a Markov decision process (MDP). Under the assumption that choices are made in an approximately Bayes optimal fashion using active (Bayesian) inference, this model is sufficient to predict behavior. If we supplement the model specification (MDP) with empirical choice behavior (Data), we can estimate the prior beliefs responsible for those choices. If this is repeated for a series of subjects, the ensuing priors can then be analyzed in a random-effects (PEB) model to make inferences about group effects or to perform cross-validation. Furthermore, physiological and behavioral predictions can be used as expansion variables for fMRI or other neuroimaging time series (bottom left). The routines in the boxes refer to MATLAB routines that are available in the academic software SPM. These routines are sufficient to both simulate behavioral responses and analyze empirical or observed choice behaviour, at both the within-subject and between-subject levels. The final routine also enables cross-validation and predictions about a new subject’s prior beliefs using a leave-one-out scheme that may be useful for establishing the predictive validity of any models that are considered.

**Figure 2.**
A, Task: we used a simple two-step maze task for our simulations, where a subject starts in the middle of a T-shaped maze and has to decide whether to sample the left or right arm, knowing that one of the two arms will contain a reward but it can sample only one of them (i.e., the arms are absorbing states). Alternatively, the subject could sample a cue at the bottom of the maze, which will tell her which arm to sample. B, State space: Here, the subject has four different control states or actions available: she can move to the middle location, the left or the right arm or the cue location. Based on these control states, we can specify the hidden states, which are all possible states that a subject can visit in a task and often are only partially observable. In this task, the hidden state comprises the location × the context (reward left or right), resulting in 4 × 2 = 8 different hidden states. Finally, we have to specify the possible outcomes or observations that an agent can make. Here, the subject can find itself in the middle location, in the left or right arm with or without obtaining a reward or at the cue location.

**Figure 3.**
Generative model. A, The A-matrix maps from hidden states to observable outcome states (resulting in a 7 × 8 matrix). There is a deterministic mapping when the subject is either in the middle position (simply observing that she is in the middle) or at the cue location (simply observing that she is at the cue location where the cue indicates either left or right). However, when the subject is in the left or right arm, there is a probabilistic mapping to a rewarded and an unrewarded outcome. For example, if the subject is in the left arm and the cue indicated is in the left arm (third column), there is a high probability, p, of a reward, whereas there is a low probability q = 1 − p of no reward. B, The B-matrix encodes the transition probabilities (i.e. the mapping from the current hidden state to the next hidden state contingent on the action taken by the agent). Thus, we need as many B-matrices as there are actions available (four in this example). Illustrated here is the B-matrix for a move to the left arm. We see that the action never changes the context, but (deterministically) does change the location, by always bringing it to the left arm, except when starting from an absorbing state (right arm). C, Finally, we have to specify the preferences over outcome states in a C-vector. Here, the subject strongly prefers ending up in a reward state and strongly dislikes ending up in a left or right arm with no reward, whereas it is somewhat indifferent about the “intermediate” states. Note that these preferences are (prior) beliefs or expectations; for example, the agent beliefs that a rewarding state is $e x p (4) \approx 55$ times more likely than an “intermediate” state [exp(0) = 1].

**Figure 4.**
Data simulation using the routine spm_MDP_VB. A, A simulated example trial, where the left top panel shows the hidden states, the right top panel shows the inferred actions, the middle panels show the inference on policies (i.e., the possible sequences of actions), the bottom left panel shows the preferences over outcome states (c-vector), and the bottom right panel shows the expected precision, which could be encoded by dopamine (Friston et al., n..). In this trial, the subject starts in the middle position where the reward is (most likely) on the right arm. She then makes a selection to sample the cue and, finally, moves to the right arm, as indicated by the cue (darker colors reflect higher posterior probabilities). B, Overview of a simulated experiment comprising 128 trials. The first panel shows the inferred policies (black regions) and initial states (shown as colored circles: red circles, reward is located at the right arm; blue circles, reward is located at the left arm) at every given trial. The second panel shows estimated reaction times (cyan dots), outcome states (colored circles), and the value of those outcomes (black bars). Note that the value of outcomes is expressed in terms of an agent’s (expected) utility, which is defined as the logarithm of an agent’s prior expectations. Thus, the utility of an outcome is at most 0 [= log(1)]. Reaction times reflect the choice conflict at any given trial and are simulated by using the time it takes Matlab to simulate inference and subsequent choice in any given trial (using the tic-toc function in Matlab). The third and fourth panels show simulated event-related potentials for hidden state estimation and expected precision, respectively. The specifics of these simulations are discussed in detail elsewhere (Friston et al., 2016). Finally, panels five and six illustrate learning and habit formation. Our simulations did not include any learning or habitual responses.

**Figure 5.**
Model inversion, as implemented by the routine spm_dcm_mdp, is based on simulated behavior. In this routine, (negative) variational free energy as a lower bound of log-model evidence is maximized and converges after the 13th iteration (top right). The trajectory of two estimated parameters in parameter space is provided (top left) as well as their final conditional estimates (bottom left) and their posterior deviation from the prior value (bottom right). The black bars on the bottom right show the true values, while the gray bars show the conditional estimates, illustrating a characteristic shrinkage toward the prior mean.

**Figure 6.**
A, Conditional estimate and confidence interval for the hyperprior on precision (β) as a function of the number of trials in a simulated experiment. B, True and estimated subject-specific parameters, following model inversion for 16 subjects with a group effect in the hyperprior (β). The two groups can be seen as two clusters along the diagonal.

**Figure 7.**
Hierarchical empirical Bayesian inference on group effects using the function spm_dcm_peb. A, Results of Bayesian model comparison (reduction) to infer whether the full model (with both group mean and group differences) or a reduced (nested) model (bottom left) provides a better explanation for the data. These results indicate high posterior evidence for a model with a group difference, with slightly less evidence for the full model, which also includes a group mean effect (i.e., a deviation from the group prior mean; top panels). Middle panels show the maximum *a posteriori* estimates of the mean and group effects for the full and reduced models. B, Estimated (gray bars) group mean (left) and difference (right) in β. These estimates are about one-quarter (top right), which corresponds to the group effect that was introduced in the simulations (black bars). The small bars correspond to 90% Bayesian confidence intervals. A reduced parameter estimate corresponds to the Bayesian model average over all possible models (full and reduced) following Bayesian model reduction.

**Figure 8.**
Cross-validation based on a leave-one-out scheme. Using the function spm_dcm_loo, we find that group membership is accurately recovered based on the parameter estimate of the hyperprior on each subject. This is evidenced by a high correlation between inferred and true group membership in the top right panel. These reflect out-of-sample estimates of effect sizes, which were large (by design) in this example. The top right panel provides the estimate of the group indicator variable (which is +1 for the first group and −1 for the second group). The bottom panel provides the posterior probability that each subject belongs to the first group.

**Figure 9.**
Simulated group difference between control subjects and patients (with a group difference in precision of one-quarter) in the average reward received. Note that this difference in an observable variable was successfully traced back to a difference in the hyperprior on precision (a latent variable) by our inference scheme, which is important because such inverse problems are usually ill posed and hard to solve.

See this image and copyright information in PMC

Cited by

Prefrontal Computation as Active Inference.
Parr T, Rikhye RV, Halassa MM, Friston KJ. Parr T, et al. Cereb Cortex. 2020 Mar 21;30(2):682-695. doi: 10.1093/cercor/bhz118. Cereb Cortex. 2020. PMID: 31298270 Free PMC article.
An Active Inference Approach to Modeling Structure Learning: Concept Learning as an Example Case.
Smith R, Schwartenbeck P, Parr T, Friston KJ. Smith R, et al. Front Comput Neurosci. 2020 May 19;14:41. doi: 10.3389/fncom.2020.00041. eCollection 2020. Front Comput Neurosci. 2020. PMID: 32508611 Free PMC article.
Greater decision uncertainty characterizes a transdiagnostic patient sample during approach-avoidance conflict: a computational modelling approach.
Smith R, Kirlic N, Stewart JL, Touthang J, Kuplicki R, Khalsa SS, Feinstein J, Paulus MP, Aupperle RL. Smith R, et al. J Psychiatry Neurosci. 2021 Jan 4;46(1):E74-E87. doi: 10.1503/jpn.200032. J Psychiatry Neurosci. 2021. PMID: 33119490 Free PMC article.
Future climates: Markov blankets and active inference in the biosphere.
Rubin S, Parr T, Da Costa L, Friston K. Rubin S, et al. J R Soc Interface. 2020 Nov;17(172):20200503. doi: 10.1098/rsif.2020.0503. Epub 2020 Nov 25. J R Soc Interface. 2020. PMID: 33234063 Free PMC article.
Active Inference, Epistemic Value, and Uncertainty in Conceptual Disorganization in First-Episode Schizophrenia.
Limongi R, Silva AM, Mackinley M, Ford SD, Palaniyappan L. Limongi R, et al. Schizophr Bull. 2023 Mar 22;49(Suppl_2):S115-S124. doi: 10.1093/schbul/sbac125. Schizophr Bull. 2023. PMID: 36946528 Free PMC article.

See all "Cited by" articles

References

1. Adams RA, Stephan KE, Brown HR, Frith CD, Friston KJ (2013) The computational anatomy of psychosis. Front Psychiatry 4:47. 10.3389/fpsyt.2013.00047 - DOI - PMC - PubMed
1. Alagoz O, Hsu H, Schaefer AJ, Roberts MS (2010) Markov decision processes: a tool for sequential decision making under uncertainty. Med Decis Mak 30:474–483. - PMC - PubMed
1. Attias H (2000) A variational Bayesian framework for graphical models . In: Neural information processing systems 12 (Solla SA, Leen TK, Müller K, eds). La Jolla, CA: Neural Information Processing Systems Foundation, Inc.
1. Beal MJ (2003) Variational algorithms for approximate Bayesian inference. Masters thesis, University of Cambridge, UK.
1. Beck JM, Ma WJ, Pitkow X, Latham PE, Pouget A (2012) Not noisy, just wrong: the role of suboptimal inference in behavioral variability. Neuron 74:30–39. 10.1016/j.neuron.2012.03.016 - DOI - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

Wellcome Trust/United Kingdom

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations
Medical
- MedlinePlus Health Information

[1] Adams RA, Stephan KE, Brown HR, Frith CD, Friston KJ (2013) The computational anatomy of psychosis. Front Psychiatry 4:47. 10.3389/fpsyt.2013.00047 - DOI - PMC - PubMed

[2] Adams RA, Stephan KE, Brown HR, Frith CD, Friston KJ (2013) The computational anatomy of psychosis. Front Psychiatry 4:47. 10.3389/fpsyt.2013.00047 - DOI - PMC - PubMed

[3] Alagoz O, Hsu H, Schaefer AJ, Roberts MS (2010) Markov decision processes: a tool for sequential decision making under uncertainty. Med Decis Mak 30:474–483. - PMC - PubMed

[4] Alagoz O, Hsu H, Schaefer AJ, Roberts MS (2010) Markov decision processes: a tool for sequential decision making under uncertainty. Med Decis Mak 30:474–483. - PMC - PubMed

[5] Attias H (2000) A variational Bayesian framework for graphical models . In: Neural information processing systems 12 (Solla SA, Leen TK, Müller K, eds). La Jolla, CA: Neural Information Processing Systems Foundation, Inc.

[6] Attias H (2000) A variational Bayesian framework for graphical models . In: Neural information processing systems 12 (Solla SA, Leen TK, Müller K, eds). La Jolla, CA: Neural Information Processing Systems Foundation, Inc.

[7] Beal MJ (2003) Variational algorithms for approximate Bayesian inference. Masters thesis, University of Cambridge, UK.

[8] Beal MJ (2003) Variational algorithms for approximate Bayesian inference. Masters thesis, University of Cambridge, UK.

[9] Beck JM, Ma WJ, Pitkow X, Latham PE, Pouget A (2012) Not noisy, just wrong: the role of suboptimal inference in behavioral variability. Neuron 74:30–39. 10.1016/j.neuron.2012.03.016 - DOI - PMC - PubMed

[10] Beck JM, Ma WJ, Pitkow X, Latham PE, Pouget A (2012) Not noisy, just wrong: the role of suboptimal inference in behavioral variability. Neuron 74:30–39. 10.1016/j.neuron.2012.03.016 - DOI - PMC - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Computational Phenotyping in Psychiatry: A Worked Example

Affiliations

Computational Phenotyping in Psychiatry: A Worked Example

Authors

Affiliations

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical