Sequential decisions: A computational comparison of observational and reinforcement accounts

Sepahvand, Nazanin Mohammadi; Stottinger, Elisabeth; Danckert, James; Anderson, Britt

Sequential decisions: A computational comparison of observational and reinforcement accounts

Files

file - 2026-06-05T133314.343.pdf (389.74 KB)

Date

2014-04-18

Authors

Sepahvand, Nazanin Mohammadi

Stottinger, Elisabeth

Danckert, James

Anderson, Britt

Publisher

Public Library of Science

Abstract

Right brain damaged patients show impairments in sequential decision making tasks for which healthy people do not show any difficulty. We hypothesized that this difficulty could be due to the failure of right brain damage patients to develop well-matched models of the world. Our motivation is the idea that to navigate uncertainty, humans use models of the world to direct the decisions they make when interacting with their environment. The better the model is, the better their decisions are. To explore the model building and updating process in humans and the basis for impairment after brain injury, we used a computational model of non-stationary sequence learning. RELPH (Reinforcement and Entropy Learned Pruned Hypothesis space) was able to qualitatively and quantitatively reproduce the results of left and right brain damaged patient groups and healthy controls playing a sequential version of Rock, Paper, Scissors. Our results suggests that, in general, humans employ a sub-optimal reinforcement based learning method rather than an objectively better statistical learning approach, and that differences between right brain damaged and healthy control groups can be explained by different exploration policies, rather than qualitatively different learning mechanisms.

Description

© 2014 Mohammadi Sepahvand et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Keywords

human learning, learning, brain damage, entropy, decision making, human performance, forecasting, cognitive impairment

URI

https://doi.org/10.1371/journal.pone.0094308
https://hdl.handle.net/10012/23560

Collections

Waterloo Research

Full item page

Sequential decisions: A computational comparison of observational and reinforcement accounts

Files

Date

Authors

Advisor

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

LC Subject Headings

Citation

URI

Collections