CT-3.2

Neural network modeling reveals diverse human exploration behaviors via state space analysis

Hua-Dong Xiong, University of Arizona, United States; Li Ji-An, University of California, San Diego, United States; Marcelo Mattar, New York University, United States; Robert Wilson, University of Arizona, United States

Session:
Contributed Talks 3 Lecture

Track:
Cognitive science

Location:
South Schools / East Schools

Presentation Time:
Sun, 27 Aug, 10:45 - 11:00 United Kingdom Time

Abstract:
The exploration-exploitation trade-off, balancing the acquisition of new information with the utilization of known resources, is a fundamental dilemma faced by all adaptive intelligence. Despite our understanding of models based on normative principles, the diverse explore-exploit behaviors of natural intelligence remain elusive. Here, using neural network behavioral modeling and state space analysis, we examined the diverse human exploration behaviors under a novel two-armed bandit task called Changing Bandit, designed to simulate real-world environmental volatility where exploration becomes essential. Examining behavior in the belief state space of this task, we characterized the disparities across artificial agents with decision boundaries. To extend this analysis to human data, a circumstance where choices are too sparse in the belief state space, we trained a recurrent neural network (RNN) model to predict humans’ choices given past observations. This RNN model outperforms all existing cognitive models. Probing the RNN’s decision boundaries, we found substantial individual differences that evade classical cognitive models. Additionally, our RNN revealed a tendency of “high-stay, low-shift” used by humans in response to higher environmental volatilities. Our work offers a promising approach for investigating diverse decision-making strategies in humans and animals.

Manuscript:
License:
Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 Unported License.
DOI:
10.32470/CCN.2023.1437-0
Publication:
2023 Conference on Cognitive Computational Neuroscience
Presentation
Discussion
Resources
No resources available.
Session CT-3
CT-3.1: A computational shortcut to coordination: common knowledge and neural alignment
Cong Wang, Lusha Zhu, Peking University, China
CT-3.2: Neural network modeling reveals diverse human exploration behaviors via state space analysis
Hua-Dong Xiong, University of Arizona, United States; Li Ji-An, University of California, San Diego, United States; Marcelo Mattar, New York University, United States; Robert Wilson, University of Arizona, United States
CT-3.3: Reward morphs non-spatial cognitive maps in humans
Nir Moneta, Max Planck Institute for Human Development Berlin, Germany; Charley M. Wu, University of Tübingen, Germany; Christian F. Doeller, Max Planck Institute for Human Cognitive and Brain Sciences, Germany; Nicolas W. Schuck, Universität Hamburg, Germany
CT-3.4: The Component Processes of Complex Planning Follow Distinct Developmental Trajectories
Ili Ma, Leiden University, Netherlands; Camille V. Phaneuf, Harvard University, United States; Bas van Opheusden, Princeton University, United States; Wei Ji Ma, Catherine A. Hartley, New York University, United States
CT-3.5: Modulating Reward and Punishment Learning Rates in Low Mood Using Transcranial Direct Current Stimulation
Verena Sarrazin, Margot Overman, Luca Mezossy-Dona, Michael Browning, Jacinta O'Shea, University of Oxford, United Kingdom
CT-3.6: Illusion of control differentially affects outcome predictions in pathological and recreational gamblers
Frederike H. Petzschner, Brown University, United States; Saee Paliwal, Benevolent AI, United Kingdom; Gina Paolini, University of Zurich and ETH Zurich, Switzerland; Stephanie Olaiya, Chloe Zimmerman, Brown University, United States; Nicole Zahnd, Helen Schmidt, Katharina Wellstein, University of Zurich and ETH Zurich, Switzerland; Ines Bodmer, Franz Eidenbenz, Karinna Schärli, Till Siegrist, Zentrum für Spielsucht und andere Verhaltenssüchte, Switzerland; Klaas Enno Stephan, University of Zurich and ETH Zurich, Switzerland