Abstract

Working memory capacity is a robust predictor of higher-order cognitive abilities, and individuals with low working memory capacity have long been characterized as more susceptible to attentional capture by irrelevant distractors. However, recent evidence challenges this conceptualization. This review examines the influential work of Fukuda and Vogel (2011), who proposed and tested the slow disengagement hypothesis—the idea that high- and low-capacity individuals are equally susceptible to initial attentional capture but differ in the speed with which they can disengage from distractors. Drawing on behavioral and electrophysiological evidence, including the N2pc component as a marker of spatial attention, this review synthesizes findings demonstrating that low-capacity individuals exhibit prolonged dwell time on distracting information. The theoretical implications for understanding attentional control, working memory function, and the broader literature on individual differences in cognition are discussed.

Introduction

Working memory (WM) refers to the cognitive system responsible for the temporary maintenance and manipulation of information required for complex tasks such as comprehension, learning, and reasoning. Individual differences in WM capacity are substantial and highly predictive; people with higher WM capacity consistently outperform those with lower capacity on measures of fluid intelligence, reading comprehension, and attentional control (Engle & Kane, 2004; Kane, Bleckley, Conway, & Engle, 2001). For decades, researchers have sought to identify the specific mechanisms that give rise to these individual differences.

A dominant explanation has been that low-capacity individuals possess poorer attentional control, making them more susceptible to having their attention captured by salient but irrelevant stimuli in the environment (Fukuda & Vogel, 2011). According to this view, high-capacity individuals are better able to resist the initial pull of distractors and maintain focus on task-relevant information. However, an alternative account emerged from the work of Fukuda and Vogel (2011), who proposed that the critical difference might lie not in the initial moment of capture, but in what happens afterward. Their slow disengagement hypothesis posits that all individuals are equally susceptible to initial attentional capture, but high-capacity individuals are able to disengage from distractors and re-focus on task-relevant targets more quickly.

This review examines the evidence for the slow disengagement hypothesis, its methodological foundations, and its implications for understanding the relationship between working memory capacity and attentional control.

The Measurement of Working Memory Capacity and Attention Control

Complex Span Measures and Individual Differences

The study of individual differences in working memory capacity has been shaped significantly by the development of complex span measures. Daneman and Carpenter (1980) introduced the reading span task, which required participants to read sentences while simultaneously remembering the final word of each sentence. This task, and subsequent complex span measures, were designed to capture the simultaneous processing and storage demands that characterize working memory in real-world cognition (Engle et al., 1999).

Research using these measures has established that working memory capacity is strongly related to fluid intelligence and higher-order cognitive abilities. Kane et al. (2004), using a latent-variable approach, demonstrated that verbal and visuospatial WM capacity measures reflect a primarily domain-general construct, with executive attention processes driving the broad predictive utility of WM span measures. This domain-general view places attention control at the center of individual differences in working memory capacity (Burgoyne & Engle, 2020).

Neural Measures: The Contralateral Delay Activity

A significant methodological advance came with the development of neural measures of visual working memory capacity. Vogel and Machizawa (2004) identified the contralateral delay activity (CDA), an event-related potential (ERP) component that tracks the number of items an individual can maintain in visual working memory. The CDA amplitude increases with the number of items stored and plateaus around three to four items, reflecting the typical adult working memory capacity (Vogel & Machizawa, 2004). Critically, the increase in CDA amplitude between two-item and four-item arrays correlates with individual subjects’ behavioral WM performance, providing a reliable neural index of capacity (Strzelczyk et al., 2023).

This neural measure enabled researchers to classify individuals as high or low in WM capacity with greater precision and to investigate the mechanisms underlying capacity differences. Vogel, McCollough, and Machizawa (2005) used the CDA to demonstrate that low-capacity individuals are less efficient at filtering irrelevant information during encoding into working memory. When instructed to remember only items of a specified color, low-capacity participants showed neural evidence of also storing distractors. This finding established that distractors gain access to working memory more readily in low-capacity individuals, but the mechanism underlying this effect remained unclear.

The Attentional Capture Debate

Stimulus-Driven Versus Contingent Capture

The question of how and when attention is captured by stimuli has been central to cognitive psychology for decades. Two major perspectives have dominated this debate. The stimulus-driven capture view, championed by Theeuwes (1992, 2010), argues that attention is automatically drawn to the most salient stimulus in the visual field, regardless of the observer’s goals. Evidence for this view comes primarily from the additional singleton paradigm, in which a unique color distractor captures attention even when it is irrelevant to the task (Theeuwes, 1992).

In contrast, the contingent capture view, proposed by Folk, Remington, and Johnston (1992), maintains that capture only occurs when a stimulus matches the observer’s current attentional control settings. Evidence for this view comes from spatial cuing paradigms, where distractors that share critical features with the target capture attention, while equally salient distractors that do not match the attentional set fail to capture attention (Folk, Remington, & Johnston, 1992; Folk, Remington, & Johnston, 1994).

The Rapid Disengagement Construct

The stimulus-driven account has invoked the concept of rapid disengagement to explain why some studies fail to find capture by salient distractors. According to this argument, salient distractors initially capture attention in all cases, but attention can disengage so rapidly that the capture is not detected behaviorally (Theeuwes, 2010). However, Folk and Remington (2010) have critically evaluated this rapid disengagement account, arguing that it is at best unsupported and at worst unfalsifiable. They contend that the construct of rapid disengagement allows stimulus-driven theorists to explain away null findings without providing independent evidence for the proposed rapid disengagement mechanism.

This debate sets the stage for Fukuda and Vogel’s (2011) contribution. Rather than treating disengagement as a theoretical construct invoked post-hoc, they designed experiments to measure disengagement directly, both behaviorally and neurally.

The Slow Disengagement Hypothesis: Fukuda and Vogel (2011)

Theoretical Rationale

Fukuda and Vogel (2011) reasoned that two possibilities could account for why low-capacity individuals encode more distractors into working memory (Vogel et al., 2005). First, low-capacity individuals might be more likely to have their attention captured by distractors in the first place. Second, low-capacity individuals might be captured equally often but take longer to disengage from distractors, increasing the probability that distractors are encoded into working memory.

The slow disengagement hypothesis proposed that high- and low-capacity individuals would show equivalent initial capture by a salient distractor, but would differ in the speed of subsequent disengagement (Fukuda & Vogel, 2011). This hypothesis predicted that at the earliest moments following display onset, both groups would show comparable evidence of attention being directed to the distractor, while at later moments, high-capacity individuals would show faster re-orienting to the target.

Experimental Approach

To test this hypothesis, Fukuda and Vogel (2011) employed two complementary approaches across two experiments. In Experiment 1, they used a psychophysical behavioral task with varying stimulus onset asynchronies (SOAs) between the array and a pattern mask. By plotting accuracy as a function of SOA, they could construct a timeline of attentional processing. At short SOAs, accuracy reflects processing before attention has had time to deploy fully; at longer SOAs, accuracy reflects the efficiency of target processing after attention has been deployed.

In Experiment 2, they used electrophysiological measurement of the N2pc component, an ERP marker of the focus of spatial attention (Luck & Hillyard, 1994). By positioning the target and salient distractor on opposite sides of the display, they could track the trajectory of attention over time. If attention was initially captured by the distractor, an N2pc would first appear contralateral to the distractor’s location; as attention shifted to the target, an N2pc would subsequently appear contralateral to the target’s location. The timing of these signals provided a direct measure of how long attention dwelled on the distractor.

Key Findings

The results from both experiments provided convergent evidence supporting the slow disengagement hypothesis. At the shortest SOAs in Experiment 1, high- and low-capacity participants showed equivalent accuracy on distractor-present trials, indicating that initial capture did not differ as a function of working memory capacity. Similarly, in Experiment 2, the initial N2pc signal was observed for the distractor in both groups, with equivalent amplitude and latency (Fukuda & Vogel, 2011).

However, clear differences emerged in recovery time. As SOA increased in Experiment 1, high-capacity participants showed rapid recovery, with accuracy approaching baseline levels at shorter SOAs, while low-capacity participants showed prolonged impairment. In Experiment 2, the N2pc data revealed that high-capacity participants shifted attention from the distractor to the target rapidly, often within 50-100ms, while low-capacity participants’ attention remained lateralized to the distractor for a significantly longer period (Fukuda & Vogel, 2011).

These findings demonstrated that all individuals are susceptible to initial attentional capture by salient distractors, but individuals with high working memory capacity are able to disengage from distractors and re-focus on task-relevant information more quickly. The deficit associated with low capacity is therefore not greater distractibility, but slower recovery from distraction.

Theoretical Implications

Reframing Attentional Control

The slow disengagement hypothesis refines the concept of attentional control by distinguishing between two processes: the prevention of capture and the termination of capture. The results suggest that attentional control operates not only as a proactive filter that attempts to keep distractions out, but also as a reactive mechanism that disengages attention after a distraction has occurred (Fukuda & Vogel, 2011). Individual differences in working memory capacity appear to be more strongly related to the efficiency of this reactive disengagement process than to the efficacy of proactive filtering.

This distinction aligns with broader theoretical frameworks linking working memory capacity to executive attention. Kane et al. (2001) proposed that WM capacity reflects the ability to maintain goal-relevant information in the face of interference, a process that requires both maintaining task goals and resolving competition from irrelevant information. The slow disengagement identified by Fukuda and Vogel (2011) can be understood as a specific form of competition resolution—the ability to disengage from a distractor that has already captured attention.

Mechanism for Working Memory Clutter

The findings provide a mechanistic explanation for the observation that low-capacity individuals encode distractors into working memory (Vogel et al., 2005). Because low-capacity individuals dwell on distractors for a longer period, these distractors have a greater probability of being consolidated into working memory. The slow recovery from capture thus serves as a gateway through which irrelevant information gains access to limited working memory capacity.

This perspective challenges the simple concept of memory as storage space. As Vogel noted, “Until now, it’s been assumed that people with high capacity visual working memory had greater storage but actually, it’s about the bouncer—a neural mechanism that controls what information gets into awareness” (University of Oregon, 2005). The problem for low-capacity individuals is not that they attempt to store more items, but that they store the wrong items due to prolonged attention to distractors.

Resolution of the Attentional Capture Debate

The study contributes to the resolution of the debate between stimulus-driven and contingent capture accounts. The finding that all individuals show initial capture supports the stimulus-driven view that salient stimuli automatically attract attention (Theeuwes, 1992, 2010). However, the finding that recovery time varies with working memory capacity suggests that top-down control plays a critical role in the aftermath of capture, influencing how quickly attention can be re-oriented to goal-relevant information.

This temporal dissociation—equivalent initial capture followed by differential disengagement—helps reconcile the two perspectives. Initial capture may indeed be stimulus-driven and automatic, as Theeuwes (2010) argues, but contingent capture effects observed in spatial cuing paradigms may reflect the efficiency of disengagement rather than the probability of initial capture. As Folk and Remington (2010) noted, the disengagement construct is critical to understanding attentional dynamics, but it must be measured directly rather than invoked post-hoc—precisely what Fukuda and Vogel (2011) accomplished.

Methodological Contributions

The N2pc as a Tool for Tracking Attention

The use of the N2pc component in Experiment 2 represents a significant methodological contribution. The N2pc provides a real-time, spatially specific marker of attention allocation that does not depend on behavioral responses, eliminating potential confounds related to decision-making or motor processes (Luck & Hillyard, 1994). By positioning the target and distractor on opposite sides of the display, Fukuda and Vogel (2011) were able to track the trajectory of attention moment-by-moment, providing direct neural evidence for differential disengagement times.

This approach demonstrates the value of using ERP measures to investigate the temporal dynamics of cognitive processes. As research on the N2pc and CDA continues to advance, including multi-site replication efforts (Strzelczyk et al., 2023), these neural measures will remain essential tools for understanding individual differences in attention and working memory.

The Importance of Temporal Dynamics

The study also highlights the importance of examining the time course of cognitive processes rather than treating performance as a static snapshot. By manipulating SOA and tracking attention over time, Fukuda and Vogel (2011) were able to dissociate initial capture from subsequent recovery—a distinction that was invisible when only average performance was examined. This approach has broader implications for cognitive research, suggesting that many apparent individual differences may reflect differences in processing speed or dynamics rather than differences in initial processing.

Limitations and Future Directions

Generalizability Across Stimulus Types and Populations

The Fukuda and Vogel (2011) study used visual search tasks with simple geometric shapes and color singletons as distractors. An important question for future research is whether the slow disengagement hypothesis generalizes to other types of stimuli and task contexts. Some evidence suggests that attentional disengagement from emotionally salient stimuli, such as fearful faces, may show different patterns and may be influenced by factors such as long-term yoga experience, which has been associated with facilitated disengagement (Semantic Scholar, n.d.). Understanding the boundary conditions of the slow disengagement effect will be important for developing a comprehensive theory.

The Role of Proactive Control

While Fukuda and Vogel (2011) demonstrated that low-capacity individuals show slower disengagement, this does not preclude the possibility that they also show deficits in proactive control mechanisms. Research on state depression, for example, has suggested that impairments in attentional control may involve both proactive control and distractor processing (Semantic Scholar, n.d.). Future research should examine the relative contributions of proactive and reactive control mechanisms to individual differences in working memory capacity.

Implications for Clinical Populations

The slow disengagement hypothesis has potential implications for understanding attentional deficits in clinical populations. Individuals with attention deficit hyperactivity disorder (ADHD) and schizophrenia show impairments in working memory and attentional control that may reflect slowed disengagement from distractors (University of Oregon, 2005). Developing interventions that target disengagement efficiency, rather than simply attempting to prevent distraction, could lead to more effective treatments.

Conclusion

Fukuda and Vogel (2011) made a significant contribution to our understanding of individual differences in attention and working memory by demonstrating that low-capacity individuals are not more susceptible to initial attentional capture, but rather take longer to disengage from distractors once captured. This slow disengagement hypothesis reframes attentional control as encompassing both proactive filtering and reactive disengagement, with individual differences in working memory capacity more strongly related to the efficiency of disengagement.

The study’s methodological rigor, combining behavioral measures with ERP recordings of the N2pc component, provides a model for investigating the temporal dynamics of cognitive processes. By dissociating initial capture from subsequent recovery, Fukuda and Vogel (2011) revealed a mechanism that was invisible in previous research focused on average performance.

The implications of this work extend beyond the laboratory. Understanding that low-capacity individuals dwell on distractors longer, rather than being distracted more easily, suggests new approaches to improving attentional control in educational, occupational, and clinical settings. Interventions might focus on training rapid disengagement rather than attempting to block distraction altogether—a distinction that could lead to more effective strategies for enhancing cognitive performance.

References

Burgoyne, A. P., & Engle, R. W. (2020). Attention control: A cornerstone of higher-order cognition. Current Directions in Psychological Science, 29(6), 624–630.

Daneman, M., & Carpenter, P. A. (1980). Individual differences in working memory and reading. Journal of Memory and Language, 19, 450–466.

Engle, R. W., & Kane, M. J. (2004). Executive attention, working memory capacity, and a two-factor theory of cognitive control. In B. H. Ross (Ed.), The psychology of learning and motivation: Advances in research and theory (Vol. 44, pp. 145–199). Elsevier Academic Press.

Engle, R. W., Tuholski, S. W., Laughlin, J. E., & Conway, A. R. (1999). Working memory, short-term memory, and general fluid intelligence: A latent-variable approach. Journal of Experimental Psychology: General, 128, 309–331.

Folk, C. L., & Remington, R. W. (2010). A critical evaluation of the disengagement hypothesis. Acta Psychologica, 135(2), 103–105.

Folk, C. L., Remington, R. W., & Johnston, J. C. (1992). Involuntary covert orienting is contingent on attentional control settings. Journal of Experimental Psychology: Human Perception and Performance, 18(4), 1030–1044.

Folk, C. L., Remington, R. W., & Johnston, J. C. (1994). The structure of attentional control: Contingent attentional capture by apparent motion, abrupt onset, and color. Journal of Experimental Psychology: Human Perception and Performance, 20(2), 317–329.

Fukuda, K., & Vogel, E. K. (2011). Individual differences in recovery time from attentional capture. Psychological Science, 22(3), 361–368.

Kane, M. J., Bleckley, M. K., Conway, A. R. A., & Engle, R. W. (2001). A controlled-attention view of working-memory capacity. Journal of Experimental Psychology: General, 130(2), 169–183.

Kane, M. J., Hambrick, D. Z., Tuholski, S. W., Wilhelm, O., Payne, T. W., & Engle, R. W. (2004). The generality of working memory capacity: A latent-variable approach to verbal and visuospatial memory span and reasoning. Journal of Experimental Psychology: General, 133(2), 189–217.

Luck, S. J., & Hillyard, S. A. (1994). Spatial filtering during visual search: Evidence from human electrophysiology. Journal of Experimental Psychology: Human Perception and Performance, 20(5), 1000–1014.

Strzelczyk, D., Clayson, P. E., Sigurdardottir, H. M., Mushtaq, F., Pavlov, Y. G., Devillez, H., Lukashevich, A., Rocha, H. A., Chung, Y. H., Ortego, K. M., Störmer, V. S., García Alanis, J. C., Löffler, C., Schubert, A. L., Biel, A. L., Tretow, A., Xu, W., Hamalainen, J., Lu, Z., … Langer, N. (2023). Contralateral delay activity as a marker of visual working memory capacity: A multi-site registered replication. Cortex. Advance online publication.

Theeuwes, J. (1992). Perceptual selectivity for color and form. Perception & Psychophysics, 51(6), 599–606.

Theeuwes, J. (2010). Top–down and bottom–up control of visual selection. Acta Psychologica, 135(2), 77–99.

University of Oregon. (2005, November 23). Discovery disproves simple concept of memory as ‘storage space’. EurekAlert! https://www.eurekalert.org/news-releases/475673

Vogel, E. K., & Machizawa, M. G. (2004). Neural activity predicts individual differences in visual working memory capacity. Nature, 428(6984), 748–751.

Vogel, E. K., McCollough, A. W., & Machizawa, M. G. (2005). Neural measures reveal individual differences in controlling access to working memory. Nature, 438(7067), 500–503.