Vast amounts of personalized information are now available to individuals. A vital research challenge is to establish how people decide what information they wish to obtain. Here, over five studies examining information-seeking in different domains we show that information-seeking is associated with three diverse motives. Specifically, we find that participants assess whether information is useful in directing action, how it will make them feel, and whether it relates to concepts they think of often. We demonstrate that participants integrate these assessments into a...
Subjective optimality in finite sequential decision-making
Many decisions in life are sequential and constrained by a time window. Although mathematically derived optimal solutions exist, it has been reported that humans often deviate from making optimal choices. Here, we used a secretary problem, a classic example of finite sequential decision-making, and investigated the mechanisms underlying individuals’ suboptimal choices. Across three independent experiments, we found that a dynamic programming model comprising subjective value function explains individuals’ deviations from optimality and predicts the choice behaviors...
How the value of the environment controls persistence in visual search
Classic foraging theory predicts that humans and animals aim to gain maximum reward per unit time. However, in standard instrumental conditioning tasks individuals adopt an apparently suboptimal strategy: they respond slowly when the expected value is low. This reward-related bias is often explained as reduced motivation in response to low rewards. Here we present evidence this behavior is associated with a complementary increased motivation to search the environment for alternatives. We trained monkeys to search for reward-related visual targets in environments with...
A confirmation bias in perceptual decision-making due to hierarchical approximate inference
Making good decisions requires updating beliefs according to new evidence. This is a dynamical process that is prone to biases: in some cases, beliefs become entrenched and resistant to new evidence (leading to primacy effects), while in other cases, beliefs fade over time and rely primarily on later evidence (leading to recency effects). How and why either type of bias dominates in a given context is an important open question. Here, we study this question in classic perceptual decision-making tasks, where, puzzlingly, previous empirical studies differ in the kinds of...
Striatal BOLD and midfrontal theta power express motivation for action
Action selection is biased by the valence of anticipated outcomes. To assess mechanisms by which these motivational biases are expressed and controlled, we measured simultaneous EEG-fMRI during a motivational Go/NoGo learning task (N = 36), leveraging the temporal resolution of EEG and subcortical access of fMRI. VmPFC BOLD encoded cue valence, importantly predicting trial-by-trial valence-driven response speed differences and EEG theta power around cue onset. In contrast, striatal BOLD encoded selection of active Go responses and correlated with theta power...
Subjective probability is modulated by emotions
Information about risks and probabilities is ubiquitous in our environment, forming the basis for decisions in an uncertain world. Emotions are known to modulate subjective probability assessments when probabilistic information is emotionally valenced. Yet little is known about the role of emotions in subjective probability assessment of affectively neutral events. We investigated this in one correlational study (Study 1, N = 162) and one experimental study (Study 2, N = 119). As predicted, we found that emotional dominance modulated the degree of conservatism...
Affect-congruent attention drives changes in reward expectations
Positive and negative affective states are respectively associated with optimistic and pessimistic expectations regarding future reward. One mechanism that might underlie these affect-related expectation biases is attention to positive- versus negative-valence stimulus features (e.g., attending to the positive reviews of a restaurant versus its expensive price). Here we tested the effects of experimentally induced positive and negative affect on feature-based attention in 120 participants completing a compound-generalization task with eye-tracking. We found that...
Latent motives guide structure learning during adaptive social choice
Predicting the behaviour of others is an essential part of social cognition. Despite its ubiquity, social prediction poses a poorly understood generalization problem: we cannot assume that others will repeat past behaviour in new settings or that their future actions are entirely unrelated to the past. We demonstrate that humans solve this challenge using a structure learning mechanism that uncovers other peoples latent, unobservable motives, such as greed and risk aversion. In four studies, participants (N = 501) predicted other players decisions across four...
When helping is risky: The behavioral and neurobiological trade-off of social and risk preferences
Helping other people can entail risks for the helper. For example, when treating infectious patients, medical volunteers risk their own health. In such situations, decisions to help should depend on the individual’s valuation of others’ well-being (social preferences) and the degree of personal risk the individual finds acceptable (risk preferences). We investigated how these distinct preferences are psychologically and neurobiologically integrated when helping is risky. We used incentivized decision-making tasks (Study 1; N = 292 adults) and manipulated dopamine...
Rumination Derails Reinforcement Learning with Possible Implications for Ineffective Behavior
How does rumination affect reinforcement learning-the ubiquitous process by which we adjust behavior after error in order to behave more effectively in the future? In a within-subject design (n=49), we tested whether experimentally manipulated rumination disrupts reinforcement learning in a multidimensional learning task previously shown to rely on selective attention. Rumination impaired performance, yet unexpectedly this impairment could not be attributed to decreased attentional breadth (quantified using a decay parameter in a computational model). Instead,...
Causal inference regulates audiovisual spatial recalibration via its influence on audiovisual perception
To obtain a coherent perception of the world, our senses need to be in alignment. When we encounter misaligned cues from two sensory modalities, the brain must infer which cue is faulty and recalibrate the corresponding sense. We examined whether and how the brain uses cue reliability to identify the miscalibrated sense by measuring the audiovisual ventriloquism aftereffect for stimuli of varying visual reliability. To adjust for modality-specific biases, visual stimulus locations were chosen based on perceived alignment with auditory stimulus locations for each...
Crowd control: Reducing individual estimation bias by sharing biased social information
Cognitive biases are widespread in humans and animals alike, and can sometimes be reinforced by social interactions. One prime bias in judgment and decision-making is the human tendency to underestimate large quantities. Previous research on social influence in estimation tasks has generally focused on the impact of single estimates on individual and collective accuracy, showing that randomly sharing estimates does not reduce the underestimation bias. Here, we test a method of social information sharing that exploits the known relationship between the true value and the...
Neurocomputational mechanism of controllability inference under a multi-agent setting
Controllability perception significantly influences motivated behavior and emotion and requires an estimation of one’s influence on an environment. Previous studies have shown that an agent can infer controllability by observing contingency between one’s own action and outcome if there are no other outcome-relevant agents in an environment. However, if there are multiple agents who can influence the outcome, estimation of one’s genuine controllability requires exclusion of other agents’ possible influence. Here, we first investigated a computational and neural mechanism...
Generating options and choosing between them depend on distinct forms of value representation
Humans have a remarkable capacity for flexible decision-making, deliberating among actions by modeling their likely outcomes. This capacity allows us to adapt to the specific features of diverse circumstances. In real-world decision-making, however, people face an important challenge: There are often an enormous number of possibilities to choose among, far too many for exhaustive consideration. There is a crucial, understudied prechoice step in which, among myriad possibilities, a few good candidates come quickly to mind. How do people accomplish this? We show across...
Atypically larger variability of resource allocation accounts for visual working memory deficits in schizophrenia
Working memory (WM) deficits have been widely documented in schizophrenia (SZ), and almost all existing studies attributed the deficits to decreased capacity as compared to healthy control (HC) subjects. Recent developments in WM research suggest that other components, such as precision, also mediate behavioral performance. It remains unclear how different WM components jointly contribute to deficits in schizophrenia. We measured the performance of 60 SZ (31 females) and 61 HC (29 females) in a classical delay-estimation visual working memory (VWM) task and evaluated...
Trusting and learning from others: immediate and long-term effects of learning from observation and advice
Social learning underpins our speciess extraordinary success. Learning through observation has been investigated in several species, but learning from advice-where information is intentionally broadcast-is less understood. We used a pre-registered, online experiment (n = 1492) combined with computational modelling to examine learning through observation and advice. Participants were more likely to immediately follow advice than to copy an observed choice, but this was dependent upon trust in the adviser: highly paranoid participants were less likely to follow...
Emotion prediction errors guide socially adaptive behaviour
People make decisions based on deviations from expected outcomes, known as prediction errors. Past work has focused on reward prediction errors, largely ignoring violations of expected emotional experiences-emotion prediction errors. We leverage a method to measure real-time fluctuations in emotion as people decide to punish or forgive others. Across four studies (N = 1,016), we reveal that emotion and reward prediction errors have distinguishable contributions to choice, such that emotion prediction errors exert the strongest impact during decision-making. We...
The selection balance: contrasting value, proximity and priming in a multitarget foraging task
A critical question in visual foraging concerns the mechanisms driving the next target selection. Observers first identify a set of candidate targets, and then select the best option among these candidates. Recent evidence suggests that target selection relies on internal biases towards proximity (nearest target from the last selection), priming (target from the same category as the last selection) and value (target associated with high value). Here, we tested the role of eye movements in target selection, and notably whether disabling eye movements during target...
Valence bias in metacontrol of decision making in adolescents and young adults
The development of metacontrol of decision making and its susceptibility to framing effects were investigated in a sample of 201 adolescents and adults in Germany (12-25 years, 111 female, ethnicity not recorded). In a task that dissociates model-free and model-based decision making, outcome magnitude and outcome valence were manipulated. Both adolescents and adults showed metacontrol and metacontrol tended to increase across adolescence. Furthermore, model-based decision making was more pronounced for loss compared to gain frames but there was no evidence that this...
Humans monitor learning progress in curiosity-driven exploration
Curiosity-driven learning is foundational to human cognition. By enabling humans to autonomously decide when and what to learn, curiosity has been argued to be crucial for self-organizing temporally extended learning curricula. However, the mechanisms driving people to set intrinsic goals, when they are free to explore multiple learning activities, are still poorly understood. Computational theories propose different heuristics, including competence measures (e.g., percent correct) and learning progress, that could be used as intrinsic utility functions to efficiently...
Paranoia, self-deception and overconfidence
Self-deception, paranoia, and overconfidence involve misbeliefs about the self, others, and world. They are often considered mistaken. Here we explore whether they might be adaptive, and further, whether they might be explicable in Bayesian terms. We administered a difficult perceptual judgment task with and without social influence (suggestions from a cooperating or competing partner). Crucially, the social influence was uninformative. We found that participants heeded the suggestions most under the most uncertain conditions and that they did so with high confidence,...
Mnemonic prediction errors promote detailed memories
When our experience violates our predictions, it is adaptive to update our knowledge to promote a more accurate representation of the world and facilitate future predictions. Theoretical models propose that these mnemonic prediction errors should be encoded into a distinct memory trace to prevent interference with previous, conflicting memories. We investigated this proposal by repeatedly exposing participants to pairs of sequentially presented objects (A → B), thus evoking expectations. Then, we violated participants expectations by replacing the second object in the...
The importance of urgency in decision making based on dynamic information
A standard view in the literature is that decisions are the result of a process that accumulates evidence in favor of each alternative until such accumulation reaches a threshold and a decision is made. However, this view has been recently questioned by an alternative proposal that suggests that, instead of accumulated, evidence is combined with an urgency signal. Both theories have been mathematically formalized and supported by a variety of decision-making tasks with constant information. However, recently, tasks with changing information have shown to be more...
Choice history effects in mice and humans improve reward harvesting efficiency
Choice history effects describe how future choices depend on the history of past choices. In experimental tasks this is typically framed as a bias because it often diminishes the experienced reward rates. However, in natural habitats, choices made in the past constrain choices that can be made in the future. For foraging animals, the probability of earning a reward in a given patch depends on the degree to which the animals have exploited the patch in the past. One problem with many experimental tasks that show choice history effects is that such tasks artificially...
Post-interval EEG activity is related to task-goals in temporal discrimination
Studies investigating the neural mechanisms of time perception often measure brain activity while participants perform a temporal task. However, several of these studies are based exclusively on tasks in which time is relevant, making it hard to dissociate activity related to decisions about time from other task-related patterns. In the present study, human participants performed a temporal or color discrimination task of visual stimuli. Participants were informed which magnitude they would have to judge before or after presenting the two stimuli (S1 and S2) in...
Rostral Anterior Cingulate Activations inversely relate to Reward Payoff Maximation & predict Depressed Mood
Choice selection strategies and decision making are typically investigated using multiple-choice gambling paradigms that require participants to maximize reward payoff. However, research shows that performance in such paradigms suffers from individual biases towards the frequency of gains to choose smaller local gains over larger longer term gain, also referred to as melioration. Here, we developed a simple two-choice reward task, implemented in 186 healthy human adult subjects across the adult lifespan to understand the behavioral, computational, and neural bases of...
Joint contributions of metacognition and self-beliefs to uncertainty-guided checking behavior
Checking behavior is a natural and adaptive strategy for resolving uncertainty in everyday situations. Here, we aimed at investigating the psychological drivers of checking and its regulation by uncertainty, in non-clinical participants and controlled experimental settings. We found that the sensitivity of participants’ explicit confidence judgments to actual performance (explicit metacognition) predicted the extent to which their checking strategy was regulated by uncertainty. Yet, a more implicit measure of metacognition (derived from asking participants to opt...
Trait Somatic Anxiety is Associated With Reduced Directed Exploration and Underestimation of Uncertainty
Anxiety has been related to decreased physical exploration, but past findings on the interaction between anxiety and exploration during decision making were inconclusive. Here we examined how latent factors of trait anxiety relate to different exploration strategies when facing volatility-induced uncertainty. Across two studies (total N = 985), we demonstrated that people used a hybrid of directed, random and undirected exploration strategies, which were respectively sensitive to relative uncertainty, total uncertainty and value difference. Trait somatic anxiety,...
Willingness to wait covaries with endogenous variation in cortisol
Stress is a normal part of our everyday lives. It alerts us to changes in our environment working as an early warning system. However, when stress is prolonged, it can become harmful. The deleterious effects of stress on brain function are well established: chronic stress significantly impairs cognitive function reducing our ability to solve problems and to regulate behavior and, therefore, may lead to more challenges that can further exacerbate stress. An important class of decisions that may be made under stress include those between rewards delivered immediately vs....
Reliability assessment of temporal discounting measures in virtual reality environments
In recent years the emergence of high-performance virtual reality (VR) technology has opened up new possibilities for the examination of context effects in psychological studies. The opportunity to create ecologically valid stimulation in a highly controlled lab environment is especially relevant for studies of psychiatric disorders, where it can be problematic to confront participants with certain stimuli in real life. However, before VR can be confidently applied widely it is important to establish that commonly used behavioral tasks generate reliable data within a VR...
Increased temporal discounting and reduced model-based control in problem gambling are not substantially modulated by exposure to virtual gambling environments
High-performance virtual reality (VR) technology has opened new possibilities for the examination of the reactivity towards addiction-related cues (cue-reactivity) in addiction. In this preregistered study (https://osf.io/4mrta), we investigated the subjective, physiological, and behavioral effects of gambling-related VR environment exposure in participants reporting frequent or pathological gambling (n=31) as well as non-gambling controls (n=29). On two separate days, participants explored two rich and navigable VR-environments (neutral: café vs....
Quantifying the contribution of individual variation in timing to delay-discounting
Delay-discounting studies in neuroscience, psychology, and economics have been mostly focused on concepts of self-control, reward evaluation, and discounting. Another important relationship to consider is the link between intertemporal choice and time perception. We presented 50 college students with timing tasks on the range of seconds to minutes and intertemporal-choice tasks on both the time-scale of seconds and of days. We hypothesized that individual differences in time perception would influence decisions about short experienced delays but not long delays. While...
Behavioral and electrocortical effects of transcranial alternating current stimulation during advice-guided decision-making
In decision-making with uncertain outcomes people may rely on external cues, such as expert advice, even if this information has no predictive value. While the fronto-parietal event-related potential (ERP) components feedback-related negativity (FRN) and P3 are associated with both reward/punishment feedback processing, the relationship between ERP modulation and expert advice during decision making remains unclear. In this double-blind sham-controlled within-subject study transcranial alternating current stimulation (tACS) at an intensity of 1 mA was applied to...
Pupil-linked arousal biases evidence accumulation toward desirable percepts during perceptual decision-making
People’s perceptual reports are biased toward percepts they are motivated to see. The arousal system coordinates the body’s response to motivationally significant events and is well positioned to regulate motivational effects on perceptual judgments. However, it remains unclear whether arousal would enhance or reduce motivational biases. Here, we measured pupil dilation as a measure of arousal while participants (N = 38) performed a visual categorization task. We used monetary bonuses to motivate participants to perceive one category over another. Even though the...
Ergodicity-breaking reveals time optimal decision making in humans
Ergodicity describes an equivalence between the expectation value and the time average of observables. Applied to human behaviour, ergodic theories of decision-making reveal how individuals should tolerate risk in different environments. To optimize wealth over time, agents should adapt their utility function according to the dynamical setting they face. Linear utility is optimal for additive dynamics, whereas logarithmic utility is optimal for multiplicative dynamics. Whether humans approximate time optimal behavior across different dynamics is unknown. Here we compare...
Acute Psychosocial Stress Increases Cognitive-Effort Avoidance
Adverse effects following acute stress are traditionally thought to reflect functional impairments of central executive-dependent cognitive-control processes. However, recent evidence demonstrates that cognitive-control application is perceived as effortful and aversive, indicating that stress-related decrements in cognitive performance could denote decreased motivation to expend effort instead. To investigate this hypothesis, we tested 40 young, healthy individuals (20 female, 20 male) under both stress and control conditions in a 2-day study that had a within-subjects...
Transcranial Direct Current Stimulation above the Medial Prefrontal Cortex Facilitates Decision-Making following Periods of Low Outcome Controllability
Recent studies suggest that choice behavior in reinforcement learning tasks is shaped by the level of outcome controllability. In particular, Pavlovian bias (PB) seems to be enhanced under low levels of control, manifesting in approach tendencies toward rewards and response inhibition when facing potential losses. The medial prefrontal cortex (mPFC) has been implicated both in evaluating outcome controllability and in the recruitment of cognitive control (CC) to suppress maladaptive PB during reinforcement learning. The current study tested whether high-definition...
In search of the executive cognitive processes proposed by process-Overlap Theory
Process-Overlap Theory (POT) suggests that measures of cognitive abilities sample from sets of independent cognitive processes. These cognitive processes can be separated into domain-general executive processes, sampled by the majority of cognitive ability measures, and domain-specific processes, sampled only by measures within a certain domain. According to POT, fluid intelligence measures are related because different tests sample similar domain-general executive cognitive processes to some extent. Re-analyzing data from a study by De Simoni and von Bastian (2018), we...
Increased and biased deliberation in social anxiety
A goal of computational psychiatry is to ground symptoms in basic mechanisms. Theory suggests that avoidance in anxiety disorders may reflect dysregulated mental simulation, a process for evaluating candidate actions. If so, these covert processes should have observable consequences: choices reflecting increased and biased deliberation. In two online general population samples, we examined how self-report symptoms of social anxiety disorder predict choices in a socially framed reinforcement learning task, the patent race, in which the pattern of choices reflects the...
Human Belief State-Based Exploration and Exploitation in an Information-Selective Symmetric Reversal Bandit Task
Humans often face sequential decision-making problems, in which information about the environmental reward structure is detached from rewards for a subset of actions. In the current exploratory study, we introduce an information-selective symmetric reversal bandit task to model such situations and obtained choice data on this task from 24 participants. To arbitrate between different decision-making strategies that participants may use on this task, we developed a set of probabilistic agent-based behavioral models, including exploitative and explorative Bayesian agents,...
Instrumental learning in social interactions: Trait learning from faces and voices
Recent research suggests that reinforcement learning may underlie trait formation in social interactions with faces. The current study investigated whether the same learning mechanisms could be engaged for trait learning from voices. On each trial of a training phase, participants (N = 192) chose from pairs of human or slot machine targets that varied in the (1) reward value and (2) generosity of their payouts. Targets were either auditory (voices or tones; Experiment 1) or visual (faces or icons; Experiment 2) and were presented sequentially before payout...
An uncertainty-based model of the effects of fixation on choice
When people view a consumable item for a longer amount of time, they choose it more frequently; this also seems to be the direction of causality. The leading model of this effect is a drift-diffusion model with a fixation-based attentional bias. Here, we propose an explicitly Bayesian account for the same data. This account is based on the notion that the brain builds a posterior belief over the value of an item in the same way it would over a sensory variable. As the agent gathers evidence about the item from sensory observations and from retrieved memories, the...
Prestimulus inhibition of eye movements reflects temporal expectation rather than time estimation
Eye movements are inhibited prior to the occurrence of temporally predictable events. This ‘oculomotor inhibition effect’ has been demonstrated with various tasks and modalities. Specifically, it was shown that when intervals between cue and target are fixed, saccade rate prior to the target is lower than when they are varied. However, it is still an open question whether this effect is linked to temporal expectation to the predictable target, or to the duration estimation of the interval preceding it. Here, we examined this question in 20 participants while they...
Memory and decision making interact to shape the value of unchosen options
The goal of deliberation is to separate between options so that we can commit to one and leave the other behind. However, deliberation can, paradoxically, also form an association in memory between the chosen and unchosen options. Here, we consider this possibility and examine its consequences for how outcomes affect not only the value of the options we chose, but also, by association, the value of options we did not choose. In five experiments (total n = 612), including a preregistered experiment (n = 235), we found that the value assigned to unchosen options...
Model-based planning deficits in compulsivity are linked to faulty neural representations of task structure
Compulsive individuals have deficits in model-based planning, but the mechanisms that drive this have not been established. We examined two candidates-that compulsivity is linked to (1) an impaired model of the task environment and/or (2) an inability to engage cognitive control when making choices. To test this, 192 participants performed a two-step reinforcement learning task with concurrent EEG recordings, and we related the neural and behavioral data to their scores on a self-reported transdiagnostic dimension of compulsivity. To examine subjects’ internal...
Ageing is associated with disrupted reinforcement learning whilst learning to help others is preserved
Reinforcement learning is a fundamental mechanism displayed by many species. However, adaptive behaviour depends not only on learning about actions and outcomes that affect ourselves, but also those that affect others. Using computational reinforcement learning models, we tested whether young (age 18-36) and older (age 60-80, total n = 152) adults learn to gain rewards for themselves, another person (prosocial), or neither individual (control). Detailed model comparison showed that a model with separate learning rates for each recipient best explained behaviour....
Paranoia and belief updating during the COVID-19 crisis
The COVID-19 pandemic has made the world seem less predictable. Such crises can lead people to feel that others are a threat. Here, we show that the initial phase of the pandemic in 2020 increased individuals paranoia and made their belief updating more erratic. A proactive lockdown made peoples belief updating less capricious. However, state-mandated mask-wearing increased paranoia and induced more erratic behaviour. This was most evident in states where adherence to mask-wearing rules was poor but where rule following is typically more common. Computational analyses...
An association between prediction errors and risk-seeking: Theory and behavioral evidence
Reward prediction errors (RPEs) and risk preferences have two things in common: both can shape decision making behavior, and both are commonly associated with dopamine. RPEs drive value learning and are thought to be represented in the phasic release of striatal dopamine. Risk preferences bias choices towards or away from uncertainty; they can be manipulated with drugs that target the dopaminergic system. Based on the common neural substrate, we hypothesize that RPEs and risk preferences are linked on the level of behavior as well. Here, we develop this hypothesis...
Value signals guide abstraction during learning
The human brain excels at constructing and using abstractions, such as rules, or concepts. Here, in two fMRI experiments, we demonstrate a mechanism of abstraction built upon the valuation of sensory features. Human volunteers learned novel association rules based on simple visual features. Reinforcement-learning algorithms revealed that, with learning, high-value abstract representations increasingly guided participant behaviour, resulting in better choices and higher subjective confidence. We also found that the brain area computing value signals - the ventromedial...
Uncertainty increases curiosity, but decreases happiness
You probably know what kind of things you are curious about, but can you also explain what it feels like to be curious? Previous studies have demonstrated that we are particularly curious when uncertainty is high and when information provides us with a substantial update of what we know. It is unclear, however, whether this drive to seek information (curiosity) is appetitive or aversive. Curiosity might correspond to an appetitive drive elicited by the state of uncertainty, because we like that state, or rather it might correspond to an aversive drive to reduce the...