Sequential decisions: A computational comparison of observational and reinforcement accounts

Sepahvand, Nazanin Mohammadi; Stottinger, Elisabeth; Danckert, James; Anderson, Britt

Sequential decisions: A computational comparison of observational and reinforcement accounts

dc.contributor.author	Sepahvand, Nazanin Mohammadi
dc.contributor.author	Stottinger, Elisabeth
dc.contributor.author	Danckert, James
dc.contributor.author	Anderson, Britt
dc.date.accessioned	2026-06-08T12:34:37Z
dc.date.available	2026-06-08T12:34:37Z
dc.date.issued	2014-04-18
dc.description	© 2014 Mohammadi Sepahvand et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
dc.description.abstract	Right brain damaged patients show impairments in sequential decision making tasks for which healthy people do not show any difficulty. We hypothesized that this difficulty could be due to the failure of right brain damage patients to develop well-matched models of the world. Our motivation is the idea that to navigate uncertainty, humans use models of the world to direct the decisions they make when interacting with their environment. The better the model is, the better their decisions are. To explore the model building and updating process in humans and the basis for impairment after brain injury, we used a computational model of non-stationary sequence learning. RELPH (Reinforcement and Entropy Learned Pruned Hypothesis space) was able to qualitatively and quantitatively reproduce the results of left and right brain damaged patient groups and healthy controls playing a sequential version of Rock, Paper, Scissors. Our results suggests that, in general, humans employ a sub-optimal reinforcement based learning method rather than an objectively better statistical learning approach, and that differences between right brain damaged and healthy control groups can be explained by different exploration policies, rather than qualitatively different learning mechanisms.
dc.description.sponsorship	Natural Sciences and Engineering Research Council of Canada (NSERC), Discovery Grant #261628-07 \|\| Canada Research Chair grants \|\| Heart and Stroke Foundation of Ontario, #NA 6999 \|\| Canadian Institutes of Health Research, #219972.
dc.identifier.uri	https://doi.org/10.1371/journal.pone.0094308
dc.identifier.uri	https://hdl.handle.net/10012/23560
dc.language.iso	en
dc.publisher	Public Library of Science
dc.relation.ispartofseries	PLoS ONE; 9(4); e94308
dc.rights	Attribution 4.0 International	en
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/
dc.subject	human learning
dc.subject	learning
dc.subject	brain damage
dc.subject	entropy
dc.subject	decision making
dc.subject	human performance
dc.subject	forecasting
dc.subject	cognitive impairment
dc.title	Sequential decisions: A computational comparison of observational and reinforcement accounts
dc.type	Article
dcterms.bibliographicCitation	Mohammadi Sepahvand N, Stöttinger E, Danckert J, Anderson B (2014) Sequential Decisions: A Computational Comparison of Observational and Reinforcement Accounts. PLoS ONE 9(4): e94308. https://doi.org/10.1371/journal.pone.0094308
uws.contributor.affiliation1	Faculty of Arts
uws.contributor.affiliation2	Psychology
uws.peerReviewStatus	Reviewed
uws.scholarLevel	Faculty
uws.typeOfResource	Text	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: file - 2026-06-05T133314.343.pdf
Size:: 389.74 KB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 4.47 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Waterloo Research