Volume 196, Issue 5 pp. 1274-1277
Correspondence
Free Access

Synthetic control arms in studies of multiple myeloma and diffuse large B-cell lymphoma

Rahul Banerjee

Rahul Banerjee

Division of Hematology/Oncology, Department of Medicine, University of California San Francisco, San Francisco, CA, USA

Search for more papers by this author
Shonali Midha

Shonali Midha

Division of Myeloma, Department of Medical Oncology, Dana-Farber Cancer Institute, USA

Search for more papers by this author
Amar H. Kelkar

Amar H. Kelkar

Division of Transplantation and Cellular Therapies, Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA

Search for more papers by this author
Aaron Goodman

Aaron Goodman

University of California San Diego, San Diego, CA, USA

Search for more papers by this author
Vinay Prasad

Vinay Prasad

Division of Hematology/Oncology, Department of Medicine, University of California San Francisco, San Francisco, CA, USA

Search for more papers by this author
Ghulam Rehman Mohyuddin

Corresponding Author

Ghulam Rehman Mohyuddin

University of Utah, Salt Lake City, UT, USA

Search for more papers by this author
First published: 08 November 2021
Citations: 4

Within the past year, several publications have compared the outcomes of clinical trials in relapsed/refractory (R/R) diffuse large B-cell lymphoma (DLBCL) or R/R multiple myeloma (MM) against retrospective real-world datasets.1-4 Two such datasets include the SCHOLAR-1 cohort in DLBCL (comprising 636 patients analysed between 2001 and 2014) and the MAMMOTH cohort in MM (comprising 275 patients analysed between 2017 and 2018).5, 6 When these observational datasets are compared against single-arm trials in such a manner, their use has been described as constituting a ‘synthetic control’ arm.7, 8

However, synthetic control arms may be inadequate substitutes for true control arms generated via randomised controlled trials (RCTs). Specific critiques of SCHOLAR-1 and MAMMOTH as synthetic controls involve differences in patient eligibility due to (i) slight but significant variations in definitions of R/R status and (ii) general exclusion of unfit patients from trials versus their general inclusion in observational datasets.9, 10 Furthermore, the generalisability of historical observational results as controls for more recent trials is uncertain. Lastly, whether potential conflict of interests exist in these analyses has not been described.

We thus characterised all studies using SCHOLAR-1 or MAMMOTH as synthetic controls by comprehensively reviewing relevant publications or abstracts available on PubMed or Google Scholar before the data cut-off of 30 September 2021. As shown in Fig 1, a total of four studies in DLBCL3, 4, 11, 12 and five studies in MM1, 2, 13-15 were included in our present analysis. These included four peer-reviewed publications and five conference abstracts (Table I).1-4

Details are in the caption following the image
Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) flow diagram. DLBCL, diffuse large B-cell lymphoma; MM, multiple myeloma.
Table I. Characteristics of included studies.
Novel agent in DLBCL Enrolment for trial of novel agent R/R status differences versus SCHOLAR-1 Adjustment using IPD Response metrics Survival metrics Potential conflicts of interest
Axicabtagene ciloleucel11 2015–2016 None Propensity score matching ORR improved 6-month OS improved Authors include employees of manufacturer
Axicabtagene ciloleucel3 2015–2016 None Propensity score matching ORR improved 2-year OS improved Funding and writing by manufacturer
Lisocabtagene maraleucel4 2016–2019 None, after eligibility adjustment Matching-adjusted indirect comparisons ORR improved 3-year OS improved Funding and writing by manufacturer
Tisagenlecleucel12 2015–2017 PD after first-line therapy not allowed in JULIET5, 16 Restricted mean survival times Not reported No ‘clinically relevant’ improvement None reported
Novel agent in MM Enrolment for trial of novel agent R/R status differences versus MAMMOTH Adjustment using IPD Response metrics Survival metrics Potential conflicts of interest
Belantamab mafodotin13 2018–2019 Treatment intolerance allowable in DREAMM-26, 20 Matching-adjusted indirect comparisons ORR not improved OS but not PFS improved Authors include employees of manufacturer
Ciltacabtagene autoleucel2 2018–2019 CD38 mAb exposure (alone) allowable in CARTITUDE-16, 19 Propensity score matching ORR improved 12-month PFS and OS improved Authors include employees of manufacturer
Idecabtagene vicleucel14 2017–2018 None, after eligibility adjustment Matching-adjusted indirect comparisons ORR improved PFS and OS improved Authors include employees of manufacturer
Melflufen15 2016–2019 None, after eligibility adjustment None reported ORR not improved OS but not PFS improved Authors include employees of manufacturer
Selinexor1 2015–2018 None, after eligibility adjustment None reported ORR not improved OS improved Funding and writing by manufacturer
  • CD38 mAb, cluster of differentiation 38-directed monoclonal antibody (daratumumab or isatuximab); IPD, individual patient data; ORR, objective response rate; OS, overall survival; PD, progressive disease; PFS, progression-free survival; R/R, relapsed or refractory.
  • * When specific dates were not provided, these were derived from related publications of the novel agent.
  • Defined as any listed contribution from the pharmaceutical manufacturer of the novel agent.
  • While Cox regression was performed, no specific techniques using IPD were employed to account for heterogeneity in underlying patient populations.

In DLBCL, the novel-agent arm in all four studies enrolled patients during more recent intervals than the SCHOLAR-1 dataset.3, 4, 11, 12 All four studies adjusted for key potential confounders using individual patient data. However, one study did not adjust for differences in the specific definitions of R/R status in SCHOLAR-1 versus the JULIET trial of tisagenlecleucel.5, 12, 16 Similarly, one study did not adjust for slight differences in clinical response definitions between the TRANSCEND trial of lisocabtagene maraleucel (Lugano criteria) versus SCHOLAR-1 (original International Working Group 2007 criteria).4, 17, 18

Three of four DLBCL studies reported improved overall survival (OS) with the novel agent.3, 4, 11 In the fourth study, the authors acknowledged a statistical difference in OS but did not report further details because they felt that this incremental improvement was clinically insignificant.12 Of note, this negative study did not report any potential conflicts of interest. Among the three studies that showed a benefit to the novel agent, all three reported potential conflicts of interest such as funding or medical-writing assistance by the manufacturer of the novel agent.

In MM, the novel agent arms in all five studies were contemporaneous with the MAMMOTH dataset and analyzed similar patient cohorts.1, 2, 13-15 However, unlike the MAMMOTH dataset, the DREAMM-2 trial of belantamab mafodotin included treatment intolerance as a determinant of R/R status.6, 19 This variation in the definition of R/R status was not commented upon in the corresponding synthetic control analysis.13 Finally, while two studies used regression modeling to compare arms, they did not report any statistical adjustments based on individual patient data.

All five studies in MM reported significantly improved OS with the novel agents. Two studies noted a comparative benefit in OS but no corresponding benefit in progression-free survival.13, 15 Similarly, three studies did not find any significant improvement in overall response rates with the novel agent.1, 13, 15 All five studies reported potential conflicts of interest, most typically in the form of authorship by employees of the manufacturer of the novel agent.

In our present review of studies using the SCHOLAR-1 and MAMMOTH datasets as synthetic control arms, we thus noted several potential items of concern. Firstly, several studies (primarily in DLBCL) compared more recent trials against older control arms. Secondly, statistical adjustments using individual patient data were employed in most but not all studies. Thirdly, most studies reported potential conflicts of interest with the pharmaceutical manufacturers of the novel agent being analysed. In contrast, the only study without any conflicts of interest was also the only study to determine that the novel agent in question did not meaningfully improve OS.12

In addition to these specific issues raised by our present study, broader issues with synthetic control arms include the presence of unmeasured confounders. For example, patients with significant comorbidities are generally included in registries but excluded from trials.9 Similarly, patients in registries only need to be alive at the time of eligibility; in contrast, clinical trials only characterise patients who survived until treatment initiation. This, in turn, may select for more indolent disease among clinical trial participants versus synthetic controls.20 Landmark analyses can potentially correct for these differences in ‘time zero’ but were not performed in any of the studies we identified. Lastly, performing multiple post hoc comparisons against the same static dataset can raise the risk of generating false-positive signals.21, 22

Limitations of our present study include our specific emphases on SCHOLAR-1 and MAMMOTH and not any other observational datasets. We were not able to determine how much study results were meaningfully influenced (if at all) by differences in years of enrolment, R/R status, or the presence of potential conflicts of interest. Lastly, regarding the ZUMA-1 trial of axicabtagene ciloleucel, the use of SCHOLAR-1 as a synthetic control may have been pre-planned in ways that we are unable to fully characterise.23

Nevertheless, when true RCTs are not feasible, we suggest that appropriate steps be taken to ensure the validity of synthetic control arms. For example, as is being done with the observational KarMMa-RW dataset versus the KarMMa trial of idecabtagene vicleucel,24 a contemporaneous study-specific control arm can be compiled prospectively (even if not randomised) to reflect the changing status quo more accurately for standard-of-care therapies. Additionally, observational datasets should be governed by neutral co-operative groups and specialty societies to encourage pre-planned comparisons and to minimise conflicts of interest.

In summary, our present findings highlight several problems with the use of observational datasets as synthetic control arms in malignant haematology. While these analyses may provide additional context regarding novel therapeutic agents, they cannot serve as replacements for RCTs.

Author contributions

Aaron Goodman, Vinay Prasad, and Ghulam Rehman Mohyuddin conceptualised the study. Rahul Banerjee, Shonali Midha, and Amar H. Kelkar performed the research. All authors analysed the data and contributed substantially to writing the manuscript.

Conflict of interest

Rahul Banerjee reports the following disclosures: Guidepoint Global (consulting), Pack Health (research), Sanofi (consulting), SparkCures (consulting). Aaron Goodman reports the following disclosures: EUSA Pharma (consulting), Seattle Genetics (consulting). Vinay Prasad reports the following disclosures: Arnold Ventures (research), Johns Hopkins Press (royalties), United Healthcare (consulting), Evicore (honoraria). The remaining authors do not have any conflicts of interest to report.

  • [Correction added on 13 January 2022, after first online publication: The data and reference citations in Paragraph 6 were corrected and reference 19 was deleted in this version.]

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.