Improving safety of the continual reassessment method via a modified allocation rule
Abstract
This article proposes a novel criterion for the allocation of patients in phase I dose‐escalation clinical trials, aiming to find the maximum tolerated dose (MTD). Conventionally, using a model‐based approach, the next patient is allocated to the dose with the toxicity estimate closest (in terms of the absolute or squared distance) to the maximum acceptable toxicity. This approach, however, ignores the uncertainty in point estimates and ethical concerns of assigning a lot of patients to overly toxic doses. In fact, balancing the trade‐off between how accurately the MTD can be estimated and how many patients would experience adverse events is one of the primary challenges in phase I studies. Motivated by recent discussions in the theory of estimation in restricted parameter spaces, we propose a criterion that allows to balance these explicitly. The criterion requires a specification of one additional parameter only that has a simple and intuitive interpretation. We incorporate the proposed criterion into the one‐parameter Bayesian continual reassessment method and show, using simulations, that it can result in similar accuracy on average as the original design, but with fewer toxic responses on average. A comparison with other model‐based dose‐escalation designs, such as escalation with overdose control and its modifications, demonstrates that the proposed design can result in either the same mean accuracy as alternatives but fewer toxic responses or in a higher mean accuracy but the same number of toxic responses. Therefore, the proposed design can provide a better trade‐off between the accuracy and the number of patients experiencing adverse events, making the design a more ethical alternative over some of the existing methods for phase I trials.
1 INTRODUCTION
The larger variance of p2 favors the decision to allocate the next patient to d2. On the other hand, allocating a patient to a dose with an estimated toxicity probability of 0.4 might be considered unethical as it exposes a patient to unacceptably high toxicity. Clearly, the squared distance criterion 1 fails to account for the uncertainty in the estimates and allows patients to be allocated to overly toxic doses. One of the main challenges in phase I dose‐escalation trials is, however, to balance the trade‐off between accurately estimating the MTD and minimizing the number of adverse events in the study.3
Another design aimed at tackling the accuracy‐toxicity trade‐off is the Bayesian logistic regression model (BLRM).8 To address the first concern, it is proposed to use the whole distribution of the DLT probability, while the safety aspect is addressed using a penalty to overly toxic doses. The allocation is determined by a loss function computed for each dose. Although this approach has been proven to be useful in practice, it also requires specification of several parameters that need to be calibrated and on which the operating characteristics of the design hinges.
Interestingly, both the BLRM and modifications to the EWOC are building upon a two‐parameter logistic model, while the problem of addressing the accuracy‐toxicity trade‐off for a one‐parameter continual reassessment method (CRM) design1 through the allocation criterion has received little attention in the dose‐finding literature. Commonly, ad hoc practical solutions are to either (i) assign patients to the dose closest but below the estimated MTD dose or (ii) to assign patients to a lower target probability (eg, 0.25 when seeking 0.33). It is clear that both of these approaches will result in underestimation of the MTD. Moreover, such approaches are inflexible in terms of the accuracy‐toxicity trade‐off that can be achieved. Consequently, a more statistically solid approach is required.
In this work, we propose a new criterion for the allocation of patients in dose‐escalation trials, which is based on recent developments in the estimation of the restricted parameter space9 and provides a statistical basis for the inclusion of safety concerns into the decision making. The proposed criterion controls the trade‐off between the uncertainty in estimates and the conservatism of an investigator (in terms of the mean number of toxic responses) and requires only one additional parameter, which has a simple and intuitive interpretation, to be specified. As it is generally agreed that model‐based phase I designs lead to improved operating characteristics than rule‐based alternatives,10 we incorporate the proposed criterion into the Bayesian CRM1 that uses a one‐parameter power model and compare its operating characteristics with the traditional one‐parameter power model CRM design, the EWOC design and its recent modifications, and the BLRM design. We show the proposed design for particular levels of the trade‐off parameter can achieve either (i) the same average accuracy as many of currently employed methods but with fewer patients experiencing adverse events or (ii) obtain higher average accuracy than EWOC but the same number of adverse events. We show that the proposed design offers a more ethical alternative to these methods as it offers a better accuracy‐toxicity trade‐off.
The remainder of the article proceeds as follows. The new criterion and its properties are studied in Section 2. The application of the novel criterion in the context of an actual clinical trial is considered in Section 3. A simulation comparison with the traditional CRM is given in Section 4. A comparison with a range of alternative approaches is given in Section 5. Section 6 concludes with a discussion.
2 METHODS
2.1 Criterion
It was found that using this divergence as a loss function in several classic Bayesian problems with parameters defined on the unit interval can provide benefits in terms of the accuracy of the estimation.9 In this work, we propose to use a generalization of this divergence to govern dose selection in a dose‐finding study. Note that the measure 5 takes its minimum value, δ(·)=0, at p=γ. Due to the denominator, if p=0 or p=1, then δ(·)=∞, meaning that patients would be never allocated to doses corresponding to 0 or 1 DLT probabilities. Indeed, the property of assigning of infinite values to the extreme values “drives away” the selection from the bounds to the neighborhood of the interval of interest γ. 13, 14 Importantly, the criterion 5 also has an information‐theoretic justification as it maximizes the asymptotic information gain in the trial with a special interest in the neighborhood of the maximum acceptable toxicity.15
The target toxicity γ is always less than 0.5 in phase I clinical trials. Consequently, if one would consider two‐point estimates that stand on the same squared distance (γ−θ)2 from the γ (for θ<γ), the criterion 5 favors a higher probability estimate due to the variance term in the denominator, which is maximized at the point p=0.5. Indeed, the same rate of terms p and (1−p) in the denominator implies that overly toxic and overly safe doses are equally penalized. As this, however, contradicts the ethical concerns of dose‐escalation trials, we propose a generalization of the measure, allowing for asymmetric penalization of doses below and above the target toxicity.
2.2 Asymmetry parameter

The CIBP criterion (for both a=1 and a=0.5) goes to infinity faster than the squared distance as the probability, p, approaches the lower bound. At the same time, for a=1, overly toxic doses are penalized less than by alternatives because corresponding values of the toxicity probability are located far from another boundary value 1. The asymmetric CIBP criterion with a=0.5 solves this issue and penalizes overly toxic doses more severely than both the squared distance and the symmetric CIBP. Note that all criteria behave similarly in the neighborhood of the target γ. Overall, one can see that the properties of the proposed criterion allow resolving the ethical concern by setting an appropriate value of the parameter a. Further guideline on the choice of a is given in the following section.
2.3 Choice of the asymmetry parameter

As θ→0 (the uncertainty issue is prioritized), a tends to 2γ, which corresponds to the squared distance allocation rule as shown above. Increasing values of θ correspond to a wider interval in which an investigator prefers a lower toxicity estimate. Consequently, this corresponds to a more conservative allocation and to smaller values of a. Note that a corresponding to θ≈γ guarantees that for two estimates standing on the same squared distance from the target γ, the dose corresponding to the lower toxicity estimate would be always selected. For example, for the target value γ=0.25 and the half‐width θ=0.245, the corresponding value of a is close to 0.3 (marked by the dotted horizontal line in Figure 2).
In the next section, we recall the Bayesian CRM1 and incorporate the proposed allocation criterion in the design.
2.4 Bayesian CRM with new allocation criterion
- Specify the prior distribution f0, skeleton, the asymmetry parameter a and assign the first cohort of patients to the lowest dose.
- After the responses for j patients were observed, update the prior distribution, fj, and assign the next cohort of patients to the dose dk minimizing
(12)among all d1,…,dm, where the expectation is found with respect to the posterior probability fj(β).
- Repeated until the maximum number of patients, n, has been treated. As the uncertainty and the conservatism is important in the allocation only, the squared distance 11 is used for the final MTD selection.
Finally, it is worth to mention that many implementations of the CRM plug the mean value of β in the model instead of using the mean value, . Although no noticeable difference is found in these approaches if a one‐parameter model is used,17 it might affect the results significantly if more complex functions are considered.18 Therefore, the posterior mean of the new criterion 12 is used. For consistency across all designs, we use the mean probability estimate while performing the original CRM design.
3 APPLICATION TO AN ACTUAL CLINICAL TRIAL
3.1 Setting
- Daily dosing of Everolimus 5 mg plus PT (d1).
- Daily dosing of Everolimus 10 mg plus PT (d2).
- Weekly dosing of Everolimus 30 mg plus PT (d3).
The goal is to find the regimen corresponding to the target toxicity γ=0.3. Note that the amount of the complimentary drugs is fixed during the trial and a clinician is confident in the monotonic relationship of toxicity probabilities for d1,…,d3. Thus, the trial can be analyzed using the tools for the single‐agent trials. The aggregated data available by the end of the trial consists of six, 17, and 10 patients being assigned to doses d1,d2,d3, respectively, out of which three, six, and seven experienced DLTs. We revisit the results of this trial using the novel allocation criterion.
We apply the CRM design using the one‐parameter power model 13 by using the robust operational prior distribution 5, 19 and the skeleton (0.20,0.30,0.40) with an adequate spacing20 and implying that the prior MTD is d2. We restrict the design so that the dose skipping is not allowed and enforce starting from the lowest dose. Patients are enrolled in cohorts of three. Note that the parameters of the design are the same for both the original CRM and the CRM using the novel allocation rule. The only difference is the criterion for the selection of doses. The original CRM uses the squared distance 11, whereas the CIBP design uses the criterion 12. Following the interpretation of the asymmetry parameter, we fix a=0.3 to favor less toxic selections in a wide interval of toxicity probabilities. The designs are implemented using the interactive functions of the bcrm‐package.21 We use the aggregated data to generate the responses in one realization of the trial. Clearly, DLTs indicated can appear in any sequence. Therefore, we generate a random sample (without replacement) for each dose to have a specific order of DLTs. We fix this order for both trials. The only exception is that the realization for the first cohort is chosen by us. We consider the influence of this choice later.
3.2 Illustration
The first three patients are assigned to d1 by construction. We begin by assuming that all three patients have not experienced DLTs. The sequential dose selections for the CRM and CIBP designs, in this case, are given in Figure 3. The values of the criteria after each cohort are given in Table 1.

After no DLTs were observed for the first cohort, the criteria used by CRM and CIBP are both maximized for dose d3. However, due to the no‐skipping dose restriction, the designs allocate the second cohort of patients to d2 for which one patient experiences a DLT. Given this toxic outcome, CRM recommends staying at d2 for the third cohort. By contrast, CIBP recommends returning to the previous dose level due to the conservatism of the criterion. Then, after all patients in cohort 3 (using CIBP) experienced the DLT, the trial would be terminated by a clinician due to safety. At the same time, the trial using the original CRM design proceeds. After no DLTs were observed for cohort 3, d3 is recommended for cohort 4 in which all patients have DLT. This leads to deescalation to d2 and after 2 cohorts for which 3 patients (out of 6) had DLT and further de‐escalation to d1. All 3 patients in cohort 7 experienced DLTs and a clinician terminates a trial due to toxicity. Overall, while the CRM assigned 21 patients and 10 of them experienced DLTs to come to the same conclusion as CIBP, the novel criterion allows reducing the sample size to 9 patients with 4 toxicity outcomes only.
The illustration above demonstrates the allocation if no toxicity outcomes are observed in cohort 1, but other possibilities should be considered as well based on aggregated data. Clearly, the other possibilities are one, two, and three DLTs in the first cohort. Considering these scenarios, it was found that both designs lead to the same allocation of patients and never escalate from dose d1. It follows that the novel allocation rule leads to the same MTD selection in all possible sequence of outcomes, but results in a fewer or similar number of toxic responses. This motivates a further investigation of the novel criterion in a comprehensive simulation study.
4 COMPARISON WITH THE ORIGINAL CRM
4.1 Setting
In this section, we compare the traditional one‐parameter CRM to the proposed design in a simulation study. Since the proposed design (Section 2.4) differs from the traditional one‐parameter CRM in the allocation criterion only, any differences in the performances are due to the proposed allocation criterion. We provide a broader comparison with other methods in Section 5.

We consider different values of a={0.3,0.4,0.5} corresponding to the approximate half‐width of the intervals θ≈{0.25,0.20,0.00}. The highest value of a is chosen to be 2γ, as it is deduced in Section 2.3 that this value corresponds to nearly the same allocation as the CRM and values of a below 2γ are expected to result in a more conservative allocation than the CRM—the effect that the proposed design aims to achieve. We denote the CRM with the new escalation criterion using parameter a by CIBP(a). The characteristics of all the models compared are evaluated in R26 using the bcrm‐package.21 To accommodate the new criterion, the corresponding modifications to the package were made.
4.2 Operating characteristics
Proportions of each dose selections, accuracy index in each scenario, and the average number of patients experienced a DLT for CRM and CIBP designs are given in Table 2. We use 40 000 simulations to declare any difference above 1% as a significant one.
Comparing the performance of CIBP for different values of the asymmetry parameter, one can see that more conservative allocation and selection proportions correspond to CIBP(0.3). The greatest difference can be seen in scenarios 1 and 6. The increase in a from 0.3 to 0.5 leads to an increase in the PCS by 5% in the toxic scenario 1 and a decrease in the PCS by 3.5% in the flat scenario 6. This is also reflected in the accuracy indices: CIBP(0.3) corresponds to the highest accuracy index under the toxic scenario 1, whereas the CIBP(0.5) results in the lowest accuracy. In the flat scenario 6, CIBP(0.3) results in the lowest accuracy index with the CIBP(0.5) having the highest accuracy. The differences in the rest of the scenarios are smaller, but still significant. Overall, greater values of a favor higher doses to be selected, leading to a higher accuracy index in safe scenarios and a higher average number of toxic responses.
Regarding the comparison of CIBP and CRM, one can find that CIBP(0.4) has a similar PCS and accuracy index, but also a smaller proportion of toxic responses in all considered scenarios. The CIBP(0.5) performs similar (scenarios 2‐5) or better (scenario 6) than CRM at the cost of 1% decrease in the PCS in scenario 1. The most noticeable difference can be observed by comparing CRM with CIBP(0.3). In terms of the PCS and accuracy, CIBP(0.3) outperforms the CRM in the most toxic scenarios 1 and 2, shows comparable performance in scenarios 3 to 5, and results in a lower accuracy index (0.79 against 81 for the CRM) in scenario 6. Interestingly, the accuracy index shades more light on the difference in performance; although the PCS in scenario 6 differs by less than 1%, the CRM tends to allocate more patients to d5 (difference of 2%). This results in a lower accuracy index for CIBP (0.3) At the same time, CIBP(0.3) outperforms the CRM in terms of the average number of toxic responses in all considered scenario. Although the margin of the difference might be seen to be negligibly small, this improvement results in nearly one fewer patient experiencing a DLT in all scenarios except scenario 1, where the difference is 0.5. Concerning the overall average performances (given in Supporting Information together with the graphical representation of the results), the CIBP with parameters a=0.3,0.4,0.5 results in the (geometric) mean of accuracy indices of 0.74, whereas the CRM results in 0.73. At the same time, CIBP designs results in 6.24, 6.68, and 7.12 average DLT response across the scenarios against 7.03 by the CRM. Therefore, CIBP(0.3) and CIBP(0.4) can be considered as a more ethical alternative to the CRM, as it exposes fewer patients to more toxic doses while leading to a slightly greater mean accuracy.
Another valuable feature of the novel allocation criterion is the additional flexibility that allows controlling the number of toxic responses directly. A clinician can choose the parameter a based on their conservatism and the range of scenarios of interest. For instance, a clinician might be ready to sacrifice the PCS in the flat scenario 6 for the sake of not selecting overly toxic dose in scenario 1. The new criterion enables such an option. At the same time, the design preserves its simplicity and does not result in any extra computational costs.
5 COMPARISON WITH ALTERNATIVE APPROACHES
5.1 Setting
We consider the setting by Wheeler et al7 for discrete dose levels. There are n=40 patients and m=6 doses in the trial. The goal is to find the MTD corresponding to the γ=0.33. The original scenarios are given in Figure 5.

For the CRM and CIBP designs, the prior distribution of β is specified as in the previous section. The only difference is the skeleton, which is now set using the same information as by Wheeler et al7: the prior MTD is d3. Assuming that ethical issues are of the greater interest in this trial, we consider several values of a<2γ=0.66 that would result in a more conservative allocation of patients than the CRM, namely, a={0.65,0.5,0.25,0.10}.
- EWOC design—the original EWOC design using fixed α=0.25.
- TR design6 (where TR stands for the authors of the proposed approach, Tighiouart and Rogatko) using α2=…=α9=0.25, for all future patients.
- Toxicity‐dependent feasibility bound design (TDFB)7 using
where is the number of patients with no DLTS, αmin=0.25 and . For both modifications of the EWOC design above, we use the parameters as in Reference 7.
- BLRM design8 which uses the loss function for the decision. Following the original proposal, we use the same bivariate normal prior distribution for parameters as in the original work and adapt the toxicity intervals for the loss function for γ=0.33
- This modification of the CRM uses the squared distance criterion but allows for the allocation to the doses with the estimated toxicity strictly below the target toxicity γ. We will denote this model as “CRM (M)” where M stands for the modified.
As in the previous section, we are primarily interested in the accuracy‐toxicity trade‐off for each method. Therefore, we study the performance of the designs in terms of (i) the PCS, (ii) the accuracy index, and (iii) the average number of DLTs experienced by the patients in one trial. Again, we will focus on the average performances across the scenarios.
5.2 Operating characteristics
The accuracy indicies and the PCS for the CRM, CRM (M), CIBP using a={0.65,0.5,0.25,0.10}, TDFB, EWOC, TR, and BLRM design in the individual scenarios and corresponding means are given in Figures 6 and 7, respectively. As mentioned above, the accuracy indices and PCS themselves are of a limited interest when considered alone without taking the safety into account. To reflect on the accuracy‐toxicity trade‐off, the mean number of DLTs in all considered designs is shown in Figure 8.



Comparing CIBP for different values of a, one can see that both mean accuracy index and the PCS decrease as parameter a increases. Due to a more conservative allocation for smaller values of a, fewer patients are assigned to doses in the neighborhood of the MTD. The decrease in the accuracy index and PCS is, however, rather small—from 0.88 and 0.72 using a=0.65 to 0.83 and 0.68 using a=0.1, respectively. The most noticeable drop across scenarios can be found in scenario 5—0.20. The variance of the accuracy indexes increases with decreasing a—the more conservative designs lead to a better performance in toxic scenarios 1 to 3 for the cost of a less accurate performance in flat scenarios 8 to 10. Regarding the safety of the CIBP designs, lower values of a result in fewer DLTs on average across all scenarios. As a result, the mean number of DLTs is decreased by approximately two toxic responses comparing CIBP(0.65) and CIBP(0.10).
Comparing different approaches, we will start by presenting the results in the groups of the methods that result in similar characteristics in terms of one of the accuracy measures (accuracy index or PCS) and highlight the difference in other measures. This will allow to focus on the accuracy‐toxicity trade‐off of the methods. First, CRM and CIBP(0.65) correspond to the highest mean accuracy index and mean PCS and the same mean number of DLTs across the scenarios. Therefore, these two designs perform comparably in terms of the accuracy‐toxicity trade‐off. At the same time, CIBP being a more flexible design allows to tackle the balance between accuracy and safety. Specifically, by decreasing the asymmetry parameter to a=0.5, one can decrease the mean number of DTLs by nearly 0.35 for the price of 1.5% decrease in the accuracy index and 0.6% decrease in the mean PCS.
The TR, BLRM, and CIBP(0.5) result in a comparable but slightly lower mean accuracy index than the CRM. The TR design results in nearly 5% lower mean PCS than BLRM and CIBP(0.5). At the same time, CIBP(0.5) results in a lower mean number of DLTs across scenarios than both of these methods: 12.4 against nearly 13 for the TR and BLRM. Consequently, CIBP(0.5) can improve the safety of the TR and BLRM while not compromising (or even improving) the accuracy of selection.
The CRM(M), TDFB, and CIBP(0.25) perform similarly in terms of the mean accuracy index, but CIBP(0.25) results in a slightly higher (by 3%) PCS than the corresponding comparators. In terms of the safety of these methods, CIBP(0.25) and CRM(M) results in the lowest mean number of DLTs with the difference of nearly 1 more toxic response compared with the TDFB. This means that CIBP(0.25) has nearly the same safety properties as CRM(M) but can improve accuracy while improving both of these factors over TDFB.
Finally, we compare two of the remaining designs, EWOC and CIBP(0.1). Both of these designs result in the lowest (and similar) average number of DLTs across the scenarios. At the same time, the EWOC results in the least mean accuracy index due to the large MTD underestimation in scenarios 5 to 7. The mean accuracy index and the PCS associated with the most conservative CIBP(0.10) are greater than those associated with EWOC by 0.15 and 0.16, respectively. Consequently, the proposed design with parameter a=0.1 retains the safety properties of the EWOC but results in a more accurate MTD selection.
Overall, it is found that the original CRM results in the highest accuracy index comparable with a number of other approaches. At the same time, it results in a high average number of toxic responses, but its original version does not allow to incorporate the conservatism of an investigator in the allocation rule. By restricting the allocation rule of the original CRM to select doses that are strictly below γ can improve the safety of the method (for the price in the accuracy), but a better accuracy‐toxicity balance could be achieved using the CIBP design. Generally, the proposed CIBP criterion allows tuning the trade‐off between accuracy and safety explicitly. At the same time, in contrast to TDFB and TR, the proposed design does not change the parameter of conservatism as more patients are trialed. It requires only one extra parameter to be specified. However, one can find a value of parameter a that would lead to a better trade‐off between the accuracy of the MTD selection and the safety, specifically, by improving one aspect while not compromising another.
6 DISCUSSION
A novel dose‐escalation criterion for the allocation of patients is introduced in this work. The criterion requires only one additional parameter, the parameter of asymmetry, which has clear intuitive interpretation and can be easily tuned according to the purposes of the investigator to achieve a desirable trade‐off between accuracy and the number of patients experiencing DLTs. It is found that incorporated into the one‐parameter power Bayesian CRM design, the new criterion, for particular values of the asymmetry parameter, results in nearly the same performance as the original CRM with the squared distance criterion 11 in the terms of the accuracy and safety, but for lower values of the parameter can achieve better accuracy‐toxicity trade‐offs than a number of dose‐finding designs currently used in practice. Specifically, compared with the EWOC, its different variations and two‐parameter BLRM design, it is found that the CIBP design for various values of the parameter leads to either (i) nearly the same average accuracy index and/or PCS but fewer average number of DLTs (comparing CIBP(0.5) with TDFB, TR, and BLRM), or (ii) higher average accuracy and/or PCS but the same number of DLTs (comparing CIBP(0.1) with EWOC). Concerning the modified CRM criterion using the truncated squared distance, the proposed criterion is found to result in the same accuracy index, higher average PCS, and slightly fewer number of DLTs. This makes the proposed design a more ethically viable alternative over some of the existing methods considered in this work. Finally, the asymmetry parameter allows balancing of the characteristics within a unified framework, so the desirable accuracy‐toxicity trade‐off can be achieved within one dose‐escalation method rather than using different methods with each requiring independent calibration.
One of the crucial parts of the proposed criterion is the asymmetry parameter. In the actual trial, this parameter can be chosen together with clinicians using the parameter's interpretation (Section 2.3) and considering the results of the simulation study (in clinically feasible scenarios). Moreover, a recently published tool to calibrate the parameters of the CRM, namely, dose transition pathways (DTP)27 can be used. This tool can also accommodate the inclusion of the proposed parameter: the clinicians are provided with the decision trees obtained by DTP for various values of the parameter a and the parameter is calibrated until the escalation/deescalation rules are consistent with clinicians' knowledge.
In small‐scale settings, the absolute distance will penalize more than the squared distance. This implies that for the same values of a one can expect that the criterion 14 would favor the accuracy of the design rather than safety. Nevertheless, it was found that the proposed criterion and the criterion 14 result in the same recommendation in most situations. Simulations in the setting of Section 5 are given in the Supporting Information.
It is important to reiterate that we focused on the application of the novel criterion to the one‐parameter power model throughout this work, as it has been shown5, 17, 19, 28 to be able to identify the MTD with a high probability. Note, however, that the proposed criterion is generic and can be applied to any parametric model (for instance, the two‐parameter logistic model) if it is preferred by an investigator. Moreover, the application of the criterion was demonstrated in the context of a single‐agent trial only. As there are generalizations of the CRM design for more complex studies, it is also of interest to consider the application of the novel allocation rule to dose‐combination and dose‐schedule trials including the case of delayed toxicity responses.
ACKNOWLEDGEMENTS
The authors acknowledge the insightful and constructive comments that were made by the Associate Editor and two reviewers. These comments have greatly helped us to improve the original submission. This project has received funding from the European Union's Horizon 2020 Research and Innovation Programme under the Marie Sklodowska‐Curie grant agreement No 633567 and by Prof Jaki's Senior Research Fellowship (NIHR‐SRF‐2015‐08‐001) supported by the National Institute for Health Research. The views expressed in this publication are those of the authors and not necessarily those of the NHS, the National Institute for Health Research or the Department of Health.
| CRM | CIBP | ||||||
|---|---|---|---|---|---|---|---|
| Cohort 1 | 0.031 | 0.012 | 0.002 | 0.62 | 0.47 | 0.45 | |
| Cohort 2 | 0.01 | 0.001 | 0.005 | 0.10 | 0.12 | 0.21 | |
| Cohort 3 | 0.03 | 0.013 | 0.001 | 0.30 | 0.67 | 1.41 | |
| Cohort 4 | 0.003 | 0.002 | 0.003 | ||||
| Cohort 5 | 0.003 | 0.002 | 0.024 | ||||
| Cohort 6 | 0.000 | 0.009 | 0.039 | ||||
| Cohort 7 | 0.014 | 0.048 | 0.096 | ||||
- Note: The value of criterion corresponding to the dose selected for next cohort by each design is in bold.
- Abbreviations: CIBP, convex infinite bounds penalization; CRM, continual reassessment method
| d1 | d2 | d3 | d4 | d5 | d6 | Accuracy | DLTs | |
|---|---|---|---|---|---|---|---|---|
| Scenario 1 | ||||||||
| Toxicity | 25.00 | 35.00 | 37.50 | 40.00 | 45.00 | 50.00 | ||
| CIBP(0.3) | 69.18 | 21.65 | 6.18 | 2.27 | 0.61 | 0.11 | 0.84 | 8.49 |
| CIBP(0.4) | 66.40 | 22.20 | 7.25 | 3.08 | 0.91 | 0.16 | 0.82 | 8.84 |
| CIBP(0.5) | 64.12 | 22.25 | 8.49 | 3.80 | 1.15 | 0.18 | 0.80 | 9.17 |
| CRM | 65.59 | 21.16 | 8.22 | 3.79 | 1.07 | 0.17 | 0.81 | 9.05 |
| Scenario 2 | ||||||||
| Toxicity | 15.00 | 25.00 | 35.00 | 40.00 | 45.00 | 50.00 | ||
| CIBP(0.3) | 24.06 | 47.88 | 21.98 | 5.00 | 0.93 | 0.15 | 0.74 | 7.00 |
| CIBP(0.4) | 24.00 | 46.95 | 22.28 | 5.39 | 1.19 | 0.19 | 0.73 | 7.46 |
| CIBP(0.5) | 23.97 | 46.12 | 22.20 | 6.02 | 1.46 | 0.24 | 0.72 | 7.94 |
| CRM | 25.41 | 45.76 | 21.36 | 5.96 | 1.27 | 0.24 | 0.72 | 7.83 |
| Scenario 3 | ||||||||
| Toxicity | 10.00 | 15.00 | 25.00 | 35.00 | 45.00 | 50.00 | ||
| CIBP(0.3) | 4.26 | 25.61 | 46.48 | 20.20 | 3.16 | 0.28 | 0.71 | 6.27 |
| CIBP(0.4) | 4.08 | 25.53 | 46.25 | 20.57 | 3.23 | 0.34 | 0.71 | 6.77 |
| CIBP(0.5) | 3.77 | 25.64 | 46.49 | 20.46 | 3.31 | 0.32 | 0.71 | 7.26 |
| CRM | 3.91 | 26.66 | 45.62 | 20.37 | 3.06 | 0.37 | 0.71 | 7.19 |
| Scenario 4 | ||||||||
| Toxicity | 5.00 | 10.00 | 15.00 | 25.00 | 35.00 | 45.00 | ||
| CIBP(0.3) | 0.22 | 5.01 | 27.27 | 44.62 | 19.65 | 3.23 | 0.65 | 5.81 |
| CIBP(0.4) | 0.17 | 4.78 | 26.64 | 45.66 | 19.59 | 3.15 | 0.66 | 6.30 |
| CIBP(0.5) | 0.18 | 4.74 | 27.16 | 45.74 | 19.36 | 2.83 | 0.66 | 6.73 |
| CRM | 0.18 | 4.50 | 27.82 | 45.32 | 19.15 | 3.03 | 0.66 | 6.73 |
| Scenario 5 | ||||||||
| Toxicity | 2.50 | 5.00 | 10.00 | 15.00 | 25.00 | 35.00 | ||
| CIBP(0.3) | 0.00 | 0.34 | 6.54 | 27.67 | 43.34 | 22.11 | 0.70 | 5.31 |
| CIBP(0.4) | 0.00 | 0.31 | 5.89 | 27.77 | 44.12 | 21.89 | 0.71 | 5.77 |
| CIBP(0.5) | 0.00 | 0.33 | 5.50 | 28.06 | 44.84 | 21.28 | 0.72 | 6.22 |
| CRM | 0.01 | 0.27 | 5.46 | 28.89 | 44.10 | 21.28 | 0.71 | 6.17 |
| Scenario 6 | ||||||||
| Toxicity | 1.50 | 2.50 | 7.50 | 10.00 | 15.00 | 25.00 | ||
| CIBP(0.3) | 0.00 | 0.04 | 2.97 | 10.42 | 26.84 | 59.72 | 0.79 | 4.58 |
| CIBP(0.4) | 0.00 | 0.05 | 2.30 | 9.55 | 27.53 | 60.58 | 0.80 | 4.96 |
| CIBP(0.5) | 0.00 | 0.04 | 1.68 | 7.31 | 27.71 | 63.26 | 0.82 | 5.39 |
| CRM | 0.00 | 0.05 | 1.88 | 8.65 | 28.89 | 60.53 | 0.81 | 5.20 |
- Note: Results are based on 40 000 simulations.
References
Citing Literature
Number of times cited according to CrossRef: 1
- Pavel Mozgunov, Thomas Jaki, An information theoretic approach for selecting arms in clinical trials, Journal of the Royal Statistical Society: Series B (Statistical Methodology), 10.1111/rssb.12391, 0, 0, (2020).




