Volume 92, Issue 12 pp. 3246-3253
RESEARCH ARTICLE
Full Access

Accuracy of quantitative HIV-1 RNA test methods at 1000 copies/mL and the potential impact of differences in assay calibration on therapy monitoring of patients

Nico Lelie

Corresponding Author

Nico Lelie

Lelie Research, Alkmaar, The Netherlands

Biologicals Quality Control (BioQControl), Heiloo, The Netherlands

Correspondence Nico Lelie, Lelie Research, Parkstraat 2, 1811 DK Alkmaar, The Netherlands.

Email: [email protected]

Search for more papers by this author
Harry van Drimmelen

Harry van Drimmelen

Biologicals Quality Control (BioQControl), Heiloo, The Netherlands

Search for more papers by this author
First published: 14 April 2020
Citations: 2

Abstract

The World Health Organization (WHO) recommends the clinical use of a human immunodeficiency virus 1 (HIV-1) viral load (VL) threshold level of 1000 copies (cp)/mL in patients on antiretroviral therapy (ART) to distinguish between viral control (VL < 1000 cp/mL) and viral failure or poor adherence (VL > 1000 cp/mL). The accuracy of five quantitative HIV-1 RNA assays at this level was compared by replicate testing (n = 24) of 1000 cp/mL samples prepared from the Viral Quality Control (VQC) HIV-1 subtype B standard, which is in use for validation of nucleic acid testing methods since 1995. Until 2004 the VL assays reported geometric mean (95% confidence interval [CI]) values ranging between 449 (188-1067) and 3162 (3057-2367) cp/mL when using the Siemens bDNA 3.0 assay as reference method for an assigned value of 1000 (962-1038) cp/mL. In 2018, the following values (95% CI) were found by 24 replicate tests in each of the VL assays on the 1000 cp/mL samples: Abbott RealTime 1084 (784-1572), BioMerieux EasyQ 1110 (533-2230), Roche CAP/CTM 1277 (892-1828), Hologic Aptima 1616 (1324-1973), and Cepheid GeneXpert 2502 (1713-3655) cp/mL. Calibration studies involving three consecutive WHO replacement standards showed a significant drift in the amount of RNA copies per International Unit overtime. Heat inactivation of HIV-1 standards was found to cause a destandardizing effect. Our study underlines the limitations in HIV-1 RNA assay calibration based on frequently replaced WHO international standards. It is therefore proposed that clinicians interpret the recommended 1000 cp/mL alert level in therapy monitoring with an inaccuracy range of 500 to 2000 cp/mL.

1 INTRODUCTION

In the last decades, a World Health Organization (WHO) advisory group on HIV infection recommended guidelines for patient management based on viral load (VL) monitoring during antiretroviral therapy (ART). The WHO guideline recommends VL greater than 1000 copies (cp)/mL to be detected in two sequential samples as an indication for lack of viral control by ART, although lower concentrations might already be considered indicative of virologic failure.1 This threshold value was chosen in consideration of the wide utilization of dry blood spots (DBS) as preferred sample format for VL measurement in sub-Saharan Africa (SSA). DBS facilitates sample transport to central laboratories but reduces sensitivity and accuracy of VL measurement in the range around 1000 cp/mL.2-6 In some SSA countries, the threshold value was decreased to 400cp/mL, a level only feasible for VL detection on fresh or frozen stored plasma samples.7 However, in most SSA countries, the clinicians base their therapeutic decisions on whether or not the HIV-1 VL result is above or below 1000cp/mL according to the WHO guideline.1 It is therefore important to determine the accuracy of multiple quantitative HIV-1 RNA assays at this clinically critical level to assess the potential risk of misclassification of ART outcome.

To conduct such an assessment, a well-calibrated HIV-1 subtype B standard established in the 1990s (Viral Quality Control [VQC]; Sanquin, Amsterdam, The Netherlands) was used for preparing 1000cp/mL aliquots (ViraQ HIV-1 Quant 1000; BioQControl, The Netherlands). The VQC standard was calibrated in nucleic acid copies in multiple VL assays as well as in IUs against the first and second International Standards in two WHO collaborative studies.8, 9 Since 1995 the VQC standard was widely used in nucleic acid testing (NAT) proficiency and validation studies (mainly for estimating the analytical sensitivity of blood screening assays and estimation of the infectious window period of NAT-screened blood donations).10-14

In 2018 VQC reference samples of 1000cp/mL were distributed to five laboratories using different commercial VL assay systems. Each laboratory tested 24 samples in four test runs of six aliquots and the HIV-1 VL results on the five assays were compared. The quantitative results were also compared with those obtained on the same VQC HIV-1 subtype B standard by the quantitative assays that were in use in the late 1990s and early 2000s (at the time of introduction of the first and second WHO International Standards). Since then, even without the use of DBS, the variation in VL results reported by the present assays was larger than the expected; we tried to understand the observed differences by reviewing the calibration data in the package inserts of the HIV-1 RNA assays. Some of the assays were calibrated in IUs against the first and second native HIV-1 International Standards and others against the third WHO standard (which like the subsequent fourth WHO standard has been heat-treated before lyophilization15, 16). All manufacturers reported HIV-1 VL in cp/mL and give copy/IU conversion factors in their package insert. Some assays had been calibrated in copies using another widely used standard (VQA, NIH).17 The frequent replacement of WHO standards over the last two decades made a certain drift in the amount of HIV-1 RNA per IU inevitable; this is because of the uncertainty in calibration in the WHO collaborative studies as well as by changing the source material for preparation of the standards (introducing nucleic acid sequence differences or matrix effects by heat treatment and lyophilisation). Therefore, we also examined the impact of heat inactivation of the VQC standard in different assays and compared the data with those reported on the untreated and later heat-inactivated WHO standard batches. Finally, as the commutability of HIV-1 standards to regular clinical samples has not been well examined, we also compared the relative quantification of the native VQC and second WHO standards with the Bland-Altman agreement values reported in large number of patient samples in different VL method comparison studies.18-27 Our study underlines the limitations in calibration of VL assays based on the current system of biological standardization and discusses its impact on the interpretation of a recommended 1000cp/mL threshold value for clinical evaluation of ART efficacy.

2 MATERIALS AND METHODS

2.1 HIV-1 reference sample of 1000cp/mL

The initial study was limited to testing a single preparation of the VQC HIV-1 subtype B standard, which was diluted in gravimetrically recorded steps to 1000 cp/mL in EDTA plasma to obtain run control samples (P0327 ViraQ HIV-1 Quant 1000; BioQControl, Heiloo, The Netherlands). Each identical 1.2mL plasma aliquot was kept frozen at −80°C for less than 6 months before testing.

2.2 Quantitative HIV-RNA assays compared

In 2018, the supervisors of five diagnostic laboratories were approached for testing the P0327 ViraQ HIV-1 Quant 1000cp/mL run control samples using the following quantitative HIV-1 RNA assay systems for VL testing:
  • Abbott RealTime HIV-1 assay (Dr. Suzanne Jurriaans, University Medical Center, Amsterdam, The Netherlands)

  • Hologic Aptima HIV-1 Quant Dx Assay (Dr. David Kwa, Onze Lieve Vrouwen Gasthuis, Amsterdam, The Netherlands)

  • Roche Cobas Ampliprep/Cobas TaqMan (CAP/CTM) HIV-1 2.0 Test (Dr. Boris Hogema, Sanquin Diagnostic Services, Amsterdam, The Netherlands)

  • Cepheid GeneXpert HIV-1 VL (Dr. Josh Brady, The Doctors Laboratory, London, UK)

  • BioMerieux NucliSens EasyQ V1.2 (Dr. Jack Breuil, Centre Hospitalier Intercommunal, Villeneuve St Georges, France)

To each testing site 24 identical vials of the reference standard were shipped on dry ice. Each laboratory tested six of the HIV-1 1000cp/mL run control samples together with routine clinical samples for 4 days. Results of each test system were reported in cp/mL. The calibration in copies against different standards and copy/IU conversion factors is described in the package inserts of the manufacturers (Table S1).

2.3 Historical calibration of VQC and WHO replacement standards

Figure S1 summarizes the historical calibration data of the VQC and WHO standards used in this study. Dilutions of the S0012 VQC HIV-1 subtype B standard were calibrated in IU/mL against the first and second International Standards (97/656 and 97/650) in the first WHO collaborative study8 as well as in cp/mL by several quantitative NAT methods in proficiency and validation studies conducted between 1996 and 2004.28 Table 1 presents an overview of the quantitative results, expressed at a 1000cp/mL level, reported by the assays on this standard at that time. The quantitative results of 58 bDNA 3.0 tests were used for assigning copy numbers to the VQC standard. Accordingly, the calibration in this bDNA 3.0 assay conversion factors (95% confidence interval [CI]) of 0.39 (0.34-0.44) and 0.58 (0.51-0.66)cp/IU were established on the first and second International Standards (97/656 and 97/650) respectively by the WHO collaborative study,8 whereas an opposite shift in copy/IU values was found from 0.80 (0.69-0.92) to 0.43 (0.36-0.50) copy/IU by the NucliSens assay due to nucleic acid sequence differences between the first and second WHO standard (Table 2). Since at the time of assigning the copies and IU values to the VQC standard the second WHO (97/650) was in use, so a conversion factor of 0.58copy/IU was used going forward. The Supporting Information/Table S2 presents stability/analytical sensitivity data on the VQC HIV-1 standard over two decades. The liquid frozen S0012 HIV-1 standard was found to be completely stable when stored at −80°C, but at −30°C an annual decay of 7 (5-10)% was observed.29

Table 1. Quantitative results on VQC standard dilutions ranging from 10000 to 300000 cp/mL in proficiency and validation studies (1996-2004) translated into viral load corresponding to 1000 cp/mL measured by the bDNA 3.0 assay being used as the reference method
Assay n Geomean cp/mL (95% CI) cp/mL
Abbott LCx 18 1819 1752-1895
Chiron bDNA 1.0 13 449 188-1067
Bayer bDNA 2.0 57 1038 1000-1086
Siemens bDNA 3.0 58 1000 962-1038
Organon Teknika NucliSens 119 2295 2171-2419
Organon Teknika QT-NASBA 366 3162 3057-3267
Roche Amplicor Monitor V1.0 437 2143 2095-2181
Roche Amplicor. Monitor mixed primers 63 1457 1390-1514
Roche Amplicor Monitor V1.5 316 1295 1238-1352
Roche Amplicor Monitor Ultra 142 1181 1124-1229
  • Abbreviations: CI, confidence interval; VQC, Viral Quality Control.
Table 2. Calibration of VQC HIV-RNA subtype B standard on the first (97/656) and second (97/650) WHO HIV-1 RNA subtype B standards (containing 100000 and 363078IU/ampoule, respectively) as calculated from individual quantitative test results (kindly provided by Dr H. Holmes) on standard dilutions with five methods used in the late 1990s by the laboratories participating in the first WHO collaborative study8
n assays VQC copies/IU on 1st WHO (97/656) standard VQC copies/IU on 2nd WHO (97/650) standard
Assay 1st WHO 2nd WHO VQC Geomean 95%CI Geomean 95%CI
Siemens bDNA 3.0 64 69 48 0.39 0.34-0.44 0.58 0.51-0.66
Roche Amplicor Monitor 125 134 112 0.70 0.60-0.81 0.93 0.80-1.08
Roche Amplicor Monitor UltraSens 16 15 11 0.51 0.27-0.95 0.86 0.49-1.51
Organon Teknika NucliSens 46 51 36 0.80 0.69-0.92 0.43 0.36-0.50
Abbott LCx 14 15 14 0.76 0.60-0.96 0.69 0.56-0.86

Meanwhile the third and fourth International HIV-1 standards have been introduced, which in contrast to the previous standards had been heat-treated before lyophilisation.15, 16 The Supporting Information (Supporting Information S3 and Tables S3a-S3c) reviews the calibration of the NAT manufacturers’ test systems in copies and IUs on these latter standards as could be deduced from the data reported in the WHO collaborative studies of 2001,8 2011,15 and 2017.16

2.4 Cross calibration of untreated and heat-inactivated VQC and WHO standards in current VL assays

2.4.1 Experiment 1: Impact of heat-inactivation of VQC standard on calibration against native WHO standard in three quantitative HIV-1 RNA assays

Three laboratories using the Abbott RealTime, Hologic Aptima, and Roche CAP/CTM assays respectively tested six 1000 cp/mL samples of the S0012 VQC standard in parallel with six 1000cp/mL samples derived from the heat-inactivated S0041 version of the VQC standard as well as against six 1000 IU/mL samples prepared from the second WHO 97/650 WHO standard. The heat-inactivated VQC standard was pasteurized for 2hours at 65°C in 1:10 dilution in phosphate-buffered saline and was further diluted in EDTA plasma.29 Original calibration of the pasteurized against the native standard dilutions in 18 replicate bDNA 3.0 tests showed a recovery of HIV-RNA of 56% after pasteurization (as compared with 28% after lyophilization).26 The three (1000 cp/mL or IU/mL) reference samples were shipped in 1.2mL aliquots on dry ice to the three laboratories. The quantitative HIV-1 RNA results, as well as the copy/IU and inactivated/native copy conversion factors, were compared in the three VL assays. The differences in relative quantification by the assays were compared with the Bland-Altman agreement/bias values on subtype B (and C) patient samples observed in several clinical evaluation studies of VL assays.18-27

2.4.2 Experiment 2: Cross calibration of VQC and VQA standard against (heat-treated) WHO replacement standards

Dilutions of 10000, 3000, and 1000cp/mL of the VQC and VQA standards were calibrated against 10000, 3000 and 1000IU/mL concentrations of the second (97/650) and the heat-treated third (10/152) and fourth (16/149) WHO standards in the Abbott RealTime assay with six replicate tests per dilution.

2.5 Statistical analyses

The geometric mean VL and standard deviation were calculated on log values for estimating the 95% CIs for each set of VL results as well as for the corresponding copy/IU conversion factors. Relative quantification factors of assays on the standards were compared with the difference in quantification of clinical samples deduced from Bland-Altman analyses in comparative studies.18-27

3 RESULTS

3.1 Quantification of 1000cp/mL reference samples by five HIV-1 RNA assays

Figure 1 shows the distribution of quantitative results of five VL assays on 1000cp/mL VQC reference samples in four runs of six tests. The 24 test results of four of the five assays clustered within the expected range of 1000 ± 0.3log (500-2000) cp/mL except for one outlier value of 4617cp/mL were obtained with the Abbott RealTime Assay. By contrast all but one of the 23 available test results obtained with the Cepheid GeneXpert assay were found above this range. When the geometric mean values (and 95% CI) of all 24 VL tests were calculated (Table 3) it was found that the Cepheid assay reported 2.5 (1.7-3.7)-fold higher values than those assigned to the VQC standard, whereas the Abbott RealTime Assay reported VL results that were closest to the nominal value, that is, 1084 (784-1572) cp/mL (after the one outlier result had been excluded from the data set). The other assays of BioMerieux, Roche, and Hologic reported 1.1 (0.5-2.2), 1.3 (0.9-1.8), and 1.6 (1.3-2.0)-fold higher values than the nominal value, respectively. The Hologic Aptima assay was the most precise showing a 95% CI of 82%-122%, followed by Abbott, Roche, Cepheid, and BioMerieux with 95% CIs of 72%-163%, 70%-143%, 68%-146%, and 50%-201%, respectively.

Details are in the caption following the image
Distribution of quantitative results on 24 run control samples of 1000cp/mL (P0327 ViraQ HIV-1 Quant 1000) obtained in 2018 by five HIV-1 viral load assays in four test runs. Solid line represents value assigned to 1000cp/mL reference standard and dotted lines represents expected maximum range (1000±0.3 logcp/mL) of variation in individual test results (500-2000 cp/mL). cp, copies
Table 3. Geometric mean values and 95% confidence interval (CI) calculated for 24 replicate test results obtained in 2018 by five HIV-1 viral load assays in 4 test runs (individual data shown in Figure 1)
HIV-1 VL assay n Geomean, cp/mL 95% CI, cp/mL 95% CI, %
Abbott RealTime 24 1084 784-1572 72-163
Hologic Aptima 24 1616 1324-1973 82-122
Roche CAP/CTM 24 1277 892-1828 70-143
Cepheid Gene Xpert 23 2502 1713-3655 68-146
BioMerieux NucliSens EasyQ 24 1110 533-2230 50-201
  • a T factor of 2.07 used for calculating 95% confidence bounds using average and SD on log viral load values.
  • b One outlier result of 4617 cp/mL was excluded from the calculations.

3.2 Calibration of untreated and heat-inactivated VQC HIV-1 standards against the second WHO standard in three quantitative assays

To investigate whether the relative quantification of HIV-1 subtype B RNA was not only assay dependent but also standard dependent we compared calibration of native and heat-inactivated VQC standard dilutions of 1000cp/mL against 1000IU/mL samples of the native lyophilized second WHO 97/650 standard in three quantitative assays (Table 4 and Figure S4). There seemed to be more consensus between the Abbott, Hologic, and Roche assays in quantifying the second WHO standard than for the case of the native and heat-inactivated VQC standard. However, when the relative quantification factors were compared with those found on the clinical samples in different comparison studies18-27 the factors on the VQC standards were generally closer to the ones found on clinical samples (Table S4). When compared with the Abbott RealTime assay the Hologic Aptima assay quantified the native VQC standard with 1.4-fold higher relative values than the second WHO standard (1.53 vs 1.10) whereas the Roche CAP/CTM quantified the two standards with equally higher values than the Abbott assay (1.34 vs 1.38). As a result, the VQC copy/IU conversion factor was lower in the Hologic assay (0.37) than in the Abbott and Roche assays (0.51 and 0.53, respectively). These conversion factors were comparable with those reported in the package inserts of the three manufacturers (Table S1) although Abbott and Roche had used the first WHO 97/656 standard and Hologic the third WHO 10/152 standard for calibration. Even though the Abbott RealTime and Roche CAP/CTM 2.0 assays had comparable copy/IU factors according to their package insert the Roche assay reported 1.34- to 1.38-fold higher values on the VQC and second WHO 97/650 standard in our calibration experiment, whereas on the clinical (subtype B) samples 1.44- to 2.33-fold higher values were found in the comparison studies (Table S4).

Table 4. Cross calibration of three HIV-1 subtype B standards in three viral load assays
HIV-1 VL assay VQC native 1000cp/mL, n=6, geomean (95% CI), cp/mL 2nd WHO 1000IU/mL, n=6, geomean (95% CI), cp/mL VQC inact 1000cp/mL, n=6, geomean (95% CI), cp/mL copy/IU, 95% CI Copy inact/native, 95% CI
Abbott RealTime 1038 (582-1852) 533 (424-671) 955 (585-1559) 0.51 (0.31-0.86) 0.92 (0.49-1.73)
Hologic Aptima 1592 (1259-2012) 589 (455-762) 1860 (1251-2765) 0.37 (0.28-0.49) 1.17 (0.80-1.71)
Roche CAP/CTM 1391 (854-2264) 734 (408-1322) 2172 (1709-2760) 0.53 (0.28-0.99) 1.56 (1.00-2.45)
  • a Native VQC standard.
  • b Heat-inactivated VQC standard.
  • c Copies assigned to VQC standard per IU assigned to 2nd WHO 97/650 standard.

In contrast to the second WHO standard, the third and fourth WHO standards were heat-inactivated. We also studied the impact of pasteurization of the VQC standard, which did not significantly change the original quantification when the Abbott RealTime assay was used, but considerably increased the values obtained with the Hologic and Roche CAP/CTM assays by a factor of 1.94 and 2.27, respectively, as compared with the Abbott RealTime assay (Table S4). Interestingly, these latter factors were comparable with 1.89-2.23-fold higher quantification by Hologic and Roche assays as compared with the Abbott assay when the data on the heat-treated third International Standard from the last WHO collaborative study report of 201716 were analyzed (Table S3c).

We did not compare the quantification of the VQC and WHO standards in the Gene Xpert assay that reported 2.31-fold higher values than the Abbott RealTime assay on the native VQC standard, comparable with a factor of 2.24 observed on 277 (likely subtype C) samples from South Africa.21 Like the Hologic Aptima test the Gene Xpert assay was calibrated on the third heat-treated WHO 10/152 standard for which a conversion factor of 0.58 copy/IU is reported in the Cepheid package insert, which is 1.65-fold higher than the copy/IU factor of 0.35 reported in the test package insert of the Hologic Aptima assay. This factor was in agreement with the 1.55-fold higher values reported by Xpert assay on the VQC standard as compared with the Hologic assay in our study (Table 3).

3.3 Cross calibration of VQC and VQA standards against three successive WHO international standards using the Abbott RealTime assay

Table 5 and Supporting information S5 shows the results of a recent cross-calibration experiment of VQC and VQA standards against three successive WHO international standards using the Abbott RealTime assay. The results showed that the VQC copy/IU conversion factor (95%CI) reduced by 39% from 0.41 (0.27-0.63) against the second to 0.25 (0.15-0.41) against the fourth WHO standard in this experiment.

Table 5. Calibration of the VQC standard against three WHO replacement standards and VQA standard in the Abbott RealTime Assay
HIV-1 standard Nominal value of standard n geomean (95% CI) copies/mL Potency (95% CI)
S0012 VQC 1000cp/mL 18 944 (698-1276) 1.00 (reference)
2nd WHO 97/650 1000IU/mL 6 392 (266-577) 0.41 (0.27-0.63)
3rd WHO 10/152 1000IU/mL 18 291 (220-577) 0.31 (0.21-0.45)
4th WHO 16/149 1000IU/mL 18 236 (156-356) 0.25 (0.15-0.41)
VQA 1000cp/mL 18 753 (579-987) 0.80 (0.52-1.21)
  • a Concentrations of 1000, 3000, and 10000copies/mL or IU/mL were prepared and tested in six replicates by Abbott RealTime assays. Of 2nd WHO 97/650 standard only 1000IU/mL concentration was available for testing.
  • b Equivalent to copies/IU based on copies assigned to VQC standard and IUs to WHO standards.
  • c Equivalent to VQA copy/VQC copy conversion factor.

4 DISCUSSION

In SSA countries, the current VL assays are usually applied in combination with DBS,2-6 which increases the 95% LOD from below 50cp/mL to near the 1000cp/mL threshold value recommended by the WHO for making therapeutic decisions in ART monitoring.1 This comparison study indicates that even with direct testing of plasma samples there is substantial variation in quantitative results reported by five widely used VL assays at the 1000cp/mL threshold value. We found up to 50%-200% confidence bounds in assay precision and up to 2.5-fold differences in assay calibration when assessed against the VQC subtype B standard. The geometric mean values (95% CI) reported on 24 replicate tests on the 1000cp/mL run control samples varied from 1084 (784-1572) in the Abbott RealTime assay to 2502 (1713-3655) in the Cepheid Xpert assay. Similar variation in VL results was found in the clinical evaluation studies comparing different VL assays in testing subtype B patient samples, although differences in VL were usually greater with subtype C and other non-B subtypes.18-27

The Hologic Aptima reported 1.5 (1.2-1.9)-fold higher values than the Abbott RealTime assay on the VQC standard, but when the second WHO 97/650 standard was used this difference reduced to 1.1 ((0.8-1.3)-fold. By contrast the Roche CAP/CTM quantified both the VQC and second WHO standard with 1.3-1.4-fold higher values than the Abbott assay. However, the quantification of the heat-inactivated version of the VQC standard was significantly (1.9- to 2.3-fold) higher by the Hologic and Roche assays than by the Abbott assay. Interestingly, the WHO collaborative study report16 also showed 1.9- to 2.2-fold higher copy/mL values with the heat-treated third WHO standard in the Hologic Aptima and Roche CAP/CTM v2.0 assays than in the Abbott RealTime assay. Hence, pasteurization of HIV-1 in VQC and WHO standards had a destandardizing effect. This can be explained by differences in efficiency of nucleic acid extraction methods between assays and by aggregation of viral particles and proteins in the heat-treated standard. In addition, lyophilization of HIV-1 in plasma may affect virion integrity and the natural configuration of the virus in EDTA plasma as we previously observed that VL recovery after lyophilization (and rehydration) was even lower than after pasteurization (28% vs 56%).29 The latest third and fourth WHO standards are both heat-treated and lyophilized,15, 16 and this likely has an impact on the relative quantification with different VL assays and the copy/IU conversion ratios reported in the manufacturers' package inserts. We suspect that the higher copy number reported using the Cepheid Xpert and Hologic Aptima assays is related to calibration against the third heat-treated WHO standard instead of the untreated first and second WHO standards that was used for calibration of the Roche cobas CAP/CTM, Abbott RealTime, and BioMerieux NucliSens assays. Unfortunately, the calibration system of WHO replacement standards does not follow the ISO 17511 guideline for metrological traceability of measurement values in biological samples and does not report uncertainties in calibration of replacement standards in arbitrary IUs.30 The continuity of the amount of HIV-1 RNA per IU cannot be guaranteed as the calibration of a newer standard batch in the WHO collaborative studies does not include the first established standard but only the latest replacement standard. This may be the reason why NAT manufacturers, so far, do not report quantitative results of HIV-1 VL assays in IUs but still calibrate their assays in nucleic acid copy numbers based on their own standards. In one cross-calibration experiment, we observed a significant drift in cp/IU of 19% with the third and 39% with the fourth WHO replacement standards as compared with the second WHO standard in the Abbott RealTime assay. A similar drift in copy/IU ratios as in our calibration experiment was also found in this as well as in other VL assays when reviewing the quantitative data on the third and fourth International Standard in the latest WHO collaborative study report showing 11%-37% lower copy/IU values by different VL assays on the replacement standard. Hence, the system used by WHO for biological standardization with different batches of lyophilized reference ampoules over time is not ideal for consistent calibration of NAT methods. Therefore, we believe that a liquid HIV-1 plasma standard stored at −80°C and available in sufficient supply for many decades (such as the VQC and VQA standards) are instrumental in providing a second anchor in assay calibration.

The data in this study demonstrate that there is not only considerable variation in quantitative values reported by the assays on one standard but also in the relative quantification of different standards (even for standards of the same subtype, ie, HIV-1 subtype B on which all assays are standardized). It is not known to what extent the different viral standards examined in this report are commutable to natural HIV-1 subtype B samples and therefore we compared relative quantification of VQC and WHO standards by different assays with Bland-Altman agreement factors observed in a number of clinical studies.18-27 These studies demonstrated up to twofold differences in relative quantification of subtype B samples and more than threefold on some of the non-B subtypes. To our knowledge there are no data that compare the relative quantification of 1000cp/mL reference samples of different subtypes and circulating recombinant forms (CRFs), but Manak et al22 compared three assays on clinical (and cultured) HIV-1 samples of different subtypes and showed significant differences in relative quantification depending on the subtype. A 20-member panel of different subtypes and CRFs (P0140 HIV 1000cp/mL subtype reference panel, BioQControl) is available and has been used by some manufacturers for performance evaluation without publishing the data. The limited available data on this multiple subtype reference panel show significant differences in relative quantification of subtypes (unpublished results). So far, the Common Technical Specifications of the EU/IVD Directive/Regulation do not set requirements for analytical sensitivity or quantification of NAT methods for different genotypes. The data presented in this report illustrate that the quantification on one WHO standard batch is not predictive for the performance on other standards and clinical samples, even of the same subtype. Clearly, there are fundamental limitations in using only one standard for assay calibration and the WHO standardization system becomes even weaker when international standards are frequently replaced without being able to relate to a higher order standard with a known number of virions (and double amount of nucleic acid copies as HIV is a diploid virus). The VQC and VQA standards have been calibrated 25 years ago in nucleic acid copies and the VQC standard was also calibrated in IUs assigned to the first and second International Standards in the first WHO collaborative study.8 These liquid viral standards in plasma cannot be considered higher order standards but are instrumental for assay calibration based on the copy numbers historically assigned to these standards. The VQC standard is still available for future WHO calibration studies, which—if properly designed—could help minimizing quantification bias between VL assays as much as possible.

What are the implications of these considerations for the interpretation of quantitative results of HIV-1 RNA assays when classifying patients with VL above or below the 1000cp/mL threshold recommended by WHO in therapy monitoring? This study shows that there are several limitations in assay standardization, reducing the accuracy of VL assays in reporting copy numbers up to 2.5-fold, whereas on top of that assay precision (95% CIs) is limited to 50%-200%. We, therefore, propose that clinicians use an inaccuracy range from 500-2000cp/mL rather than a distinct 1000cp/mL cutoff value to identify patients with possible ART breakthrough infection or with poor adherence to drug-taking schemes. Lack of viral control may be confirmed when VL has increased to a level between 500cp/mL to 2000cp/mL. It is proposed that further studies are performed to compare VL assays with and without DBS on 1000cp/mL reference samples of multiple subtypes and CRFs to obtain an even better understanding of how the WHO recommended 1000cp/mL alert level in therapy monitoring should be interpreted by clinicians. If certain assays show deviating results from the others, on multiple subtypes, recalibration should be considered by the manufacturer to harmonize VL detection worldwide.

CONFLICT OF INTERESTS

Authors are owners of a company (BioQControl) that manufactures run controls and reference panels for performance evaluation of infectious disease tests.

    The full text of this article hosted at iucr.org is unavailable due to technical difficulties.