Abstract
- Top of page
- Abstract
- Introduction
- Methods
- Results
- Discussion
- References
Context Checklists are commonly proposed tools to reduce error. However, when applied by experts, checklists have the potential to increase cognitive load and result in ‘expertise reversal’. One potential solution is to use checklists in the verification stage, rather than in the initial interpretation stage of diagnostic decisions. This may avoid expertise reversal by preserving the experts’ initial approach. Whether checklist use during the verification stage of diagnostic decision making improves experts’ diagnostic decisions is unknown.
Methods Fifteen experts interpreted 18 electrocardiograms (ECGs) in four different conditions: undirected interpretation; verification without a checklist; verification with a checklist, and interpretation combined with verification with a checklist. Outcomes included the number of errors, cognitive load, interpretation time and interpretation length. Outcomes were compared in two analyses: (i) a comparison of verification conditions with and without a checklist, and (ii) a comparison of all four conditions. Standardised scores for each outcome were used to calculate the efficiency of a checklist and to weigh its relative benefit against its relative cost in terms of cognitive load imposed, interpretation time and interpretation length.
Results In both analyses, checklist use was found to reduce error (more errors were corrected in verification conditions with checklists [0.29 ± 0.77 versus 0.03 ± 0.61 errors per ECG], and fewer net errors occurred in all conditions with checklists [0.39 ± 1.14 versus 1.04 ± 1.49 errors per ECG]; p < 0.01 for both). Checklists were not associated with increased cognitive load (verifications with and without checklists: 3.7 ± 1.9 and 3.3 ± 2.0, respectively; conditions with and without checklists: 4.0 ± 1.8 versus 3.9 ± 2.0, respectively [p = not significant for both]). Checklists resulted in greater interpretation times and lengths (p < 0.01 for all). However, checklists were efficient in terms of the cognitive load invested, interpretation time and interpretation length (p < 0.01 for all).
Conclusions Among ECG interpretation experts, checklist use during the verification stage of diagnostic decisions did not increase cognitive load or cause expertise reversal, but did reduce diagnostic error.
Introduction
- Top of page
- Abstract
- Introduction
- Methods
- Results
- Discussion
- References
To make a medical diagnosis, a large number of interacting variables must be integrated into a summative decision. Given that working memory has a finite capacity, integrating such a large number of variables can quickly exhaust cognitive resources.1 Therefore, the cognitive load involved in making medical diagnoses is often high.
Experts are able to lower the cognitive load involved in making diagnostic decisions.2 Dual processing theory offers unique insights into how this is accomplished. Dual processing refers to two parallel systems of making decisions: intuitive, subconscious thinking (system 1), and analytic, conscious thinking (system 2).3–5 Experts use more system 1 processing.6,7 Rather than holding all relevant variables in working memory, experts recognise patterns they have seen before using subconscious system 1 processing. This involves less cognitive load than an attempt to analyse all interacting variables using system 2. In addition, experts have more efficient system 2 processes. They favour domain-specific strategies (e.g. using a schema) over the higher cognitive load domain-general strategies (e.g. testing one hypothesis at a time) used by novices.8
Other insight into how experts reduce cognitive load can be found in the expertise literature. Experts apply knowledge templates constructed from previous experiences, termed ‘illness scripts’.9 These scripts lower cognitive load by reducing the large number of variables involved in diagnostic decisions to a few key variables relevant to a specific circumstance. How system processing relates to illness scripts has not been formally studied. However, it is likely that both system 1 and system 2 processing are involved in the use of an illness script. Selecting a script is likely to represent a system 1-driven process.10 By contrast, the application of a script probably involves system 2 processing to check key variables using domain-specific strategies.
Despite this ability to lower cognitive load, experts still make errors. In the cognitive psychology literature base, these errors are often viewed as a consequence of over-reliance on system 1 processing.7,11,12 However, forced use of system 2 processing has also been associated with error.13 In the literature on decision making, errors are often traced to systematic biases in how variables are considered. For example, we tend to suppress incongruences14 and ignore missing information.15 In addition, individuals asked to collect information about a hypothesis favour information that confirms their beliefs.16
Checklists are a potentially ideal tool with which to combat diagnostic error.17 A checklist composed of key variables might be used as a decision aid as it can mimic expert illness scripts. However, whereas illness scripts can be idiosyncratic and individual, a checklist ensures all key variables are assessed.9 In addition, checklists encourage system 2 processing and can force independent re-examination of all relevant information.14 Prior evidence suggests that this improves summative decision making in different contexts such as pilot responses to in-flight emergencies, personnel decisions on hiring the right person, and the modification of complex building plans during construction.14,17
However, it is unclear whether a checklist approach can be applied to medical experts. Asking an expert to use a checklist risks increasing the cognitive load of the decision-making process. It might force the expert to abandon his or her own expert processes, ironically resulting in ‘expertise reversal’ or worsened performance.18,19 However, whether or not expertise reversal occurs may depend on when a checklist is used in the decision-making process. The decision-making process can be divided into two stages: interpretation, and verification. Checklist use in the interpretation stage is likely to result in increased cognitive load and expertise reversal.13,18,19 Whether such expertise reversal also occurs when a checklist is used during the verification stage, after the expert has had a chance to use his or her own expert processes, is unknown. Furthermore, merely suggesting that a checklist should be used in the verification stage might derail an expert’s approach to the initial interpretation stage (even if the checklist is not meant to be applied in the interpretation stage).
If checklists do improve performance, it is unclear whether they can do this efficiently. An ideal diagnostic decision process should result in a correct and error-free decision, impose the least amount of cognitive load and use the least amount of time. It should also result in a decision that can be communicated with the least amount of written description possible (i.e. interpretation length). Checklists are likely to add to the cognitive load, time spent and written length of the interpretation. If checklists do improve expert performance, does this improvement outweigh any increases in cognitive load, time spent and interpretation length?
Calculating efficiency, such as by weighing a performance benefit against an increase in cognitive load, is one method of assessing the relative trade-offs between two measured variables. Within the cognitive load literature, the calculation of efficiency was originally described to compare learning tools.20 However, it can also be applied to the study of errors to compare trade-offs in error reduction against expected increases in cognitive load (cognitive efficiency), time spent (time efficiency) and interpretation length (length efficiency).
This study sought to understand whether the use of a checklist during the verification stage would improve or harm expert diagnostic decisions. We hypothesised that checklist use would reduce the number of errors and not result in increased cognitive load or expertise reversal if the checklist was used in a separate verification stage. By contrast, we hypothesised that making experts aware of the need to verify using a checklist before the interpretation stage would result in increased cognitive load, expertise reversal and a greater number of errors. If the use of a checklist did reduce the number of errors, we planned to determine whether this improvement was efficient in terms of cognitive load, time and interpretation length.
Discussion
- Top of page
- Abstract
- Introduction
- Methods
- Results
- Discussion
- References
To our knowledge, this is the first study of the use of cognitive checklists in the interpretation of ECGs. Among experts, checklists afforded a clear benefit. Experts corrected one error for every 3.4 ECGs when checklists were applied. In fact, experts were aware of this benefit. They told us that they routinely used a checklist-like approach and expected to find an error in one in five ECGs.
Verification without a checklist was not associated with benefit, suggesting the effect requires more than just prolonging the verification stage of decision making. Potentiating system 2 processing is likely to be important. A previous study of ECG interpretation by intermediate-level trainees found that errors were detected only when system 2 processing was used, not when system 1 processing was encouraged.22 These findings could be extrapolated to experts: that is, experts are unlikely to detect an error unless they are verifying their interpretation using system 2 processing. In addition, the content of the checklist is likely to be important. The checklist might act as an ‘alternative’ illness script that experts can use in the verification stage. If an expert has not detected an error using his or her own illness script, it is unlikely that reapplying the same illness script in a verification phase will be helpful. However, a checklist offers an alternative approach because the variables are different, or the order of the variables is different or the list of variables is more comprehensive than that in the expert’s own illness script. Future development and study of checklists should be conducted with these hypothesised mechanisms in mind.
Surprisingly, checklist use did not increase cognitive load. The checklist we chose contained familiar variables. As a result, it was probably very easy for experts to adopt it. Furthermore, experts qualitatively valued the checklist approach, which suggests that we had sampled a group of willing participants who were familiar and maybe even expert with a checklist approach. As a result, checklist use in this context was not associated with expertise reversal. Whether this benefit translates to other diagnostic tests or to a context involving experts who place less value on systematic checking cannot be inferred from these results.
There were some disadvantages to the use of the checklist. Use of the checklist did increase verification time. On average, use of the checklist resulted in a 12% increase in verification time of approximately 10 seconds. Interestingly, this was less than estimated by the experts. However, our model does not account for the time taken by experts for systematic checking without prompting, and therefore is likely to represent an underestimate. Although the relative value of errors versus time is unlikely to be easily mathematically summarised, our use of time efficiency suggests that at the very least the time invested resulted in a disproportionate increase in error detection.
The use of a checklist also resulted in longer interpretations. However, although the increase in interpretation length was measurable, it was relatively trivial, at two to 25 characters depending on the condition. Longer interpretations do have consequences in the health care system as they put a burden on consumers of the information. However, this small increment in interpretation length is unlikely to be clinically meaningful.
Limitations
First and foremost, expertise is content- and context-specific. Whether these findings apply in other contexts is unclear. Secondly, the benefit of a checklist is likely to depend on its content, its familiarity to experts and the way in which it is applied. Until the underlying principles of checklist efficacy are more firmly established, checklists should be trialled before use in each context. Thirdly, this study attempted to measure the trade-off between the advantages and disadvantages of using a checklist by adapting measures of efficiency from the cognitive load literature. However, this does not take into account the relative value of each of these measures and should not be applied too literally. How much time should be spent on detecting a life-threatening error on an ECG? We are not suggesting that such a question can be answered by calculating efficiencies. Rather, we have included these measures in order to recognise potential disadvantages and include a relative gauge of effect sizes.
In summary, these results suggest there is substantial benefit to be derived by encouraging the use of checklists among experts in ECG interpretation. Interestingly, this study suggests there is still value to be gained by encouraging greater checklist use among experts who routinely use a systematic or checklist-based approach. Practically, this benefit should be shared not only with practising doctors, but also with the doctors in training who will become our future experts. Finally, participating experts told us that they used checklists in only two thirds of cases. The factors that determine whether an expert will use a checklist in any given case are unknown. Understanding the barriers against the usage of a checklist and content-specific triggers for its avoidance will be important in bringing this benefit to practice.