Sequential decisions: A computational comparison of observational and reinforcement accounts

dc.contributor.authorSepahvand, Nazanin Mohammadi
dc.contributor.authorStottinger, Elisabeth
dc.contributor.authorDanckert, James
dc.contributor.authorAnderson, Britt
dc.date.accessioned2026-06-08T12:34:37Z
dc.date.available2026-06-08T12:34:37Z
dc.date.issued2014-04-18
dc.description© 2014 Mohammadi Sepahvand et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
dc.description.abstractRight brain damaged patients show impairments in sequential decision making tasks for which healthy people do not show any difficulty. We hypothesized that this difficulty could be due to the failure of right brain damage patients to develop well-matched models of the world. Our motivation is the idea that to navigate uncertainty, humans use models of the world to direct the decisions they make when interacting with their environment. The better the model is, the better their decisions are. To explore the model building and updating process in humans and the basis for impairment after brain injury, we used a computational model of non-stationary sequence learning. RELPH (Reinforcement and Entropy Learned Pruned Hypothesis space) was able to qualitatively and quantitatively reproduce the results of left and right brain damaged patient groups and healthy controls playing a sequential version of Rock, Paper, Scissors. Our results suggests that, in general, humans employ a sub-optimal reinforcement based learning method rather than an objectively better statistical learning approach, and that differences between right brain damaged and healthy control groups can be explained by different exploration policies, rather than qualitatively different learning mechanisms.
dc.description.sponsorshipNatural Sciences and Engineering Research Council of Canada (NSERC), Discovery Grant #261628-07 || Canada Research Chair grants || Heart and Stroke Foundation of Ontario, #NA 6999 || Canadian Institutes of Health Research, #219972.
dc.identifier.urihttps://doi.org/10.1371/journal.pone.0094308
dc.identifier.urihttps://hdl.handle.net/10012/23560
dc.language.isoen
dc.publisherPublic Library of Science
dc.relation.ispartofseriesPLoS ONE; 9(4); e94308
dc.rightsAttribution 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjecthuman learning
dc.subjectlearning
dc.subjectbrain damage
dc.subjectentropy
dc.subjectdecision making
dc.subjecthuman performance
dc.subjectforecasting
dc.subjectcognitive impairment
dc.titleSequential decisions: A computational comparison of observational and reinforcement accounts
dc.typeArticle
dcterms.bibliographicCitationMohammadi Sepahvand N, Stöttinger E, Danckert J, Anderson B (2014) Sequential Decisions: A Computational Comparison of Observational and Reinforcement Accounts. PLoS ONE 9(4): e94308. https://doi.org/10.1371/journal.pone.0094308
uws.contributor.affiliation1Faculty of Arts
uws.contributor.affiliation2Psychology
uws.peerReviewStatusReviewed
uws.scholarLevelFaculty
uws.typeOfResourceTexten

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
file - 2026-06-05T133314.343.pdf
Size:
389.74 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
4.47 KB
Format:
Item-specific license agreed upon to submission
Description: