Volume 87, Issue 3 pp. 1184-1206

GUIDELINES

Free Access

Development, validation, qualification, and dissemination of quantitative MR methods: Overview and recommendations by the ISMRM quantitative MR study group

Sebastian Weingärtner,

Sebastian Weingärtner

orcid.org/0000-0002-0739-6306

Department of Imaging Physics, Delft University of Technology, Delft, The Netherlands

Search for more papers by this author

Kimberly L. Desmond,

Kimberly L. Desmond

orcid.org/0000-0003-0626-9310

Brain Health Imaging Centre, Centre for Addiction and Mental Health, Toronto, Ontario, Canada

Department of Psychiatry, University of Toronto, Toronto, Ontario, Canada

Search for more papers by this author

Nancy A. Obuchowski,

Nancy A. Obuchowski

Department of Quantitative Health Sciences, Cleveland Clinic, Cleveland, Ohio, USA

Search for more papers by this author

Bettina Baessler,

Bettina Baessler

Institute of Diagnostic and Interventional Radiology, University Hospital Zurich, Zurich, Switzerland

Search for more papers by this author

Yuxin Zhang,

Yuxin Zhang

orcid.org/0000-0001-9852-7959

Department of Medical Physics, University of Wisconsin-Madison, Madison, Wisconsin, USA

Department of Radiology, University of Wisconsin-Madison, Madison, Wisconsin, USA

Search for more papers by this author

Emma Biondetti,

Emma Biondetti

orcid.org/0000-0001-6727-0935

Department of Neuroscience, Imaging and Clinical Sciences, D'Annunzio University of Chieti and Pescara, Chieti, Italy

Search for more papers by this author

Dan Ma,

Dan Ma

orcid.org/0000-0003-1664-9579

Department of Biomedical Engineering, Case Western Reserve University, Cleveland, Ohio, USA

Search for more papers by this author

Xavier Golay,

Xavier Golay

orcid.org/0000-0002-6447-4446

Brain Repair & Rehabilitation, Institute of Neurology, University College London, United Kingdom

Gold Standard Phantoms Limited, Rochester, United Kingdom

Search for more papers by this author

Michael A. Boss,

Michael A. Boss

orcid.org/0000-0002-9492-767X

Center for Research and Innovation, American College of Radiology, Philadelphia, Pennsylvania, USA

Search for more papers by this author

Jeffrey L. Gunter,

Jeffrey L. Gunter

orcid.org/0000-0001-7813-9078

Department of Radiology, Mayo Clinic, Rochester, Minnesota, USA

Search for more papers by this author

Kathryn E. Keenan,

Kathryn E. Keenan

orcid.org/0000-0001-9070-5255

National Institute of Standards and Technology, Boulder, Colorado, USA

Search for more papers by this author

Diego Hernando,

Corresponding Author

Diego Hernando

orcid.org/0000-0002-0016-0317

Department of Medical Physics, University of Wisconsin-Madison, Madison, Wisconsin, USA

Department of Radiology, University of Wisconsin-Madison, Madison, Wisconsin, USA

Correspondence

Diego Hernando, Department of Radiology, University of Wisconsin-Madison, Madison, WI 53705, USA.

Email: [email protected]

Search for more papers by this author

the ISMRM Quantitative MR Study Group,

the ISMRM Quantitative MR Study Group

Search for more papers by this author

Sebastian Weingärtner,

Sebastian Weingärtner

orcid.org/0000-0002-0739-6306

Department of Imaging Physics, Delft University of Technology, Delft, The Netherlands

Search for more papers by this author

Kimberly L. Desmond,

Kimberly L. Desmond

orcid.org/0000-0003-0626-9310

Brain Health Imaging Centre, Centre for Addiction and Mental Health, Toronto, Ontario, Canada

Department of Psychiatry, University of Toronto, Toronto, Ontario, Canada

Search for more papers by this author

Nancy A. Obuchowski,

Nancy A. Obuchowski

Department of Quantitative Health Sciences, Cleveland Clinic, Cleveland, Ohio, USA

Search for more papers by this author

Bettina Baessler,

Bettina Baessler

Institute of Diagnostic and Interventional Radiology, University Hospital Zurich, Zurich, Switzerland

Search for more papers by this author

Yuxin Zhang,

Yuxin Zhang

orcid.org/0000-0001-9852-7959

Department of Medical Physics, University of Wisconsin-Madison, Madison, Wisconsin, USA

Department of Radiology, University of Wisconsin-Madison, Madison, Wisconsin, USA

Search for more papers by this author

Emma Biondetti,

Emma Biondetti

orcid.org/0000-0001-6727-0935

Department of Neuroscience, Imaging and Clinical Sciences, D'Annunzio University of Chieti and Pescara, Chieti, Italy

Search for more papers by this author

Dan Ma,

Dan Ma

orcid.org/0000-0003-1664-9579

Department of Biomedical Engineering, Case Western Reserve University, Cleveland, Ohio, USA

Search for more papers by this author

Xavier Golay,

Xavier Golay

orcid.org/0000-0002-6447-4446

Brain Repair & Rehabilitation, Institute of Neurology, University College London, United Kingdom

Gold Standard Phantoms Limited, Rochester, United Kingdom

Search for more papers by this author

Michael A. Boss,

Michael A. Boss

orcid.org/0000-0002-9492-767X

Center for Research and Innovation, American College of Radiology, Philadelphia, Pennsylvania, USA

Search for more papers by this author

Jeffrey L. Gunter,

Jeffrey L. Gunter

orcid.org/0000-0001-7813-9078

Department of Radiology, Mayo Clinic, Rochester, Minnesota, USA

Search for more papers by this author

Kathryn E. Keenan,

Kathryn E. Keenan

orcid.org/0000-0001-9070-5255

National Institute of Standards and Technology, Boulder, Colorado, USA

Search for more papers by this author

Diego Hernando,

Corresponding Author

Diego Hernando

orcid.org/0000-0002-0016-0317

Department of Medical Physics, University of Wisconsin-Madison, Madison, Wisconsin, USA

Department of Radiology, University of Wisconsin-Madison, Madison, Wisconsin, USA

Correspondence

Diego Hernando, Department of Radiology, University of Wisconsin-Madison, Madison, WI 53705, USA.

Email: [email protected]

Search for more papers by this author

the ISMRM Quantitative MR Study Group,

the ISMRM Quantitative MR Study Group

Search for more papers by this author

First published: 26 November 2021

https://doi.org/10.1002/mrm.29084

Citations: 18

Sebastian Weingärtner and Kimberly L. Desmond contributed equally to this work.

Share a link

Email
Wechat
Bluesky

Abstract

On behalf of the International Society for Magnetic Resonance in Medicine (ISMRM) Quantitative MR Study Group, this article provides an overview of considerations for the development, validation, qualification, and dissemination of quantitative MR (qMR) methods. This process is framed in terms of two central technical performance properties, i.e., bias and precision. Although qMR is confounded by undesired effects, methods with low bias and high precision can be iteratively developed and validated. For illustration, two distinct qMR methods are discussed throughout the manuscript: quantification of liver proton-density fat fraction, and cardiac T₁. These examples demonstrate the expansion of qMR methods from research centers toward widespread clinical dissemination. The overall goal of this article is to provide trainees, researchers, and clinicians with essential guidelines for the development and validation of qMR methods, as well as an understanding of necessary steps and potential pitfalls for the dissemination of quantitative MR in research and in the clinic.

1 INTRODUCTION

MR probes a wide array of tissue contrasts, spectral properties, and anatomical information. Based on this wealth of contrast mechanisms, a variety of quantitative MR (qMR) methods that extract quantifiable information from MR acquisitions^1-3 have been proposed and continue to emerge from the MR research community. Upon successful development and validation, qMR methods enable improved standardization in the detection, staging, and treatment monitoring of diseases, both in research and in clinical practice.^4-7 On behalf of the International Society for Magnetic Resonance in Medicine (ISMRM) Quantitative MR Study Group, we provide an overview of the process of development of qMR methods, as well as guidelines for their technical validation, clinical qualification, application, and dissemination. To illustrate this process, we provide examples from two distinct qMR methods: quantification of liver proton-density fat fraction (PDFF), and T₁ quantification in the myocardium (cardiac T₁ mapping; see Figure 1). These two methods were selected based on their substantial interest within the MR research community, important existing and potential applications, and major advances toward widespread clinical use. Importantly, the current status of development and remaining challenges are different for these two methods, which helps illustrate the diversity in the field of qMR.

Details are in the caption following the image — **FIGURE 1**
Open in figure viewer PowerPoint

Example quantitative MR methods illustrated in this manuscript. Top, Liver PDFF mapping, with applications in the evaluation of non-alcoholic fatty liver disease (where liver PDFF is an emerging biomarker for the early diagnosis of NAFLD) and non-alcoholic steatohepatitis (where liver PDFF is emerging as a medical research tool in combination with other noninvasive imaging biomarkers). Bottom, Cardiac T₁ mapping, with applications in various ischemic and non-ischemic cardiomyopathies

1.1 Liver PDFF quantification

PDFF has been developed, validated, and applied for the assessment of tissue triglyceride concentration.⁸ Although chemical shift encoded (CSE) fat-water imaging was introduced nearly 40 years ago,⁹ the development of quantitative techniques that measure PDFF has accelerated over the past two decades.^10-17 Using either MRS¹⁶ or MRI⁸ acquisitions, PDFF measures the concentration of MR-visible triglyceride protons relative to all MR-visible protons (from triglycerides and water), which has multiple research and clinical applications. Recent technical developments and validation studies (see below) have led to widely available techniques. These techniques are particularly promising for the quantification of liver fat, e.g., in the assessment of non-alcoholic fatty liver disease (NAFLD).

1.2 Cardiac T₁ mapping

Although cardiac T₁ mapping was first developed in the 1990s,¹⁸ the field accelerated more recently when the promise to enable non-invasive assessment of diffuse fibrosis emerged.¹⁹ As the T₁ relaxation time depends on the mobility in the macromolecular environment, over time cardiac T₁ mapping has proved useful in many clinical applications.²⁰ Initially, semi-quantitative relaxation measurements in the myocardium based on Look-Locker sequences were explored.^{21, 22} However, these methods lacked the reproducibility and reliability to serve as a quantitative tool in clinical application. With the introduction of the Modified Look-Locker Inversion Recovery (MOLLI)²³ and shortened MOLLI (shMOLLI)²⁴ methods, myocardial T₁ mapping became feasible on a voxel-by-voxel basis in a single breath-hold and with high visual T₁ map quality. This facilitated the widespread use and application to numerous ischemic and non-ischemic cardiomyopathies.^{25, 26} Continuous method development and refinement have led to increasingly sensitive and reliable T₁ measurements of the heart, paving the way for routine clinical use.²⁰

2 TECHNICAL PERFORMANCE OF qMR METHODS

The development and validation of qMR methods requires a framework for describing their technical performance. The two major technical performance properties of a qMR method (Figure 2) are: (1) the bias, which includes the properties of linearity, the regression slope and intercept, and the overall bias; and (2) the precision, which is described by the repeatability and reproducibility. These metrics are described in detail below and summarized in Table 1. Previous works in qMR have used deviating terminologies, including “accuracy” or “robustness,” to describe technical performance. However, in this work we use, and encourage others to use, metrics based on bias and precision, as established by the quantitative imaging metrology community.^{27, 28} A glossary of the terminology used throughout this paper can be found in Supporting Information Table S1, which is available online.

TABLE 1. Technical performance metrics

Metric	Definition	Liver PDFF	Cardiac T₁ (MOLLI)
Linearity	Ability to provide measurements that are proportional to the true value as described in Equation (1)	r² = 0.96 and no evidence of significant higher-order terms in regression analysis of measurements vs true value⁴⁸	r² = 0.996, magnitude of higher order terms < 0.0001⁵²
Regression slope	β₁ in Equation (1)	0.975⁴⁸	0.919⁵²
Fixed bias	β₀ in Equation (1)	<0.2%⁴⁸	4.2%⁵²
Bias (or precision) profile	A table or figure illustrating the estimates of the bias (or precision) over the range of true values and/or other relevant characteristics	See Yokoo et al., 2018⁴⁸	See Roujol et al., 2014^{52, 57}
Repeatability	A measure of precision describing the variability in measurements on a subject over a short period of time using the same imaging system and experimental conditions²⁷	Repeatability coefficient of 2.9%⁴⁸	Repeatability coefficient of 2.0%⁵²–4.6%⁵⁸
Reproducibility	A measure of precision describing the variability in measurements on a subject using different experimental conditions (different systems, and/or pulse sequence parameters, and/or measurements separated by a long period of time, etc)²⁷	Reproducibility coefficient (across different hardware systems or reconstruction software) of 4.3%⁴⁸	Highly variable. In tightly controlled studies, 2.1% has been reported,⁵⁹ however, a meta-analysis showed >7% reproducibility in healthy subjects.⁶⁰ For this reason, it is not recommended to compare MOLLI T₁ values across systems and parameters, due to system specific biases²⁰

2.1 Bias

Bias describes the systematic tendency of qMR measurements to differ from the ground-truth value of the measurand (i.e., the underlying quantity of interest). To define bias, let X_i denote the ground-truth value of the measurand for the i-th subject, and Y_ij denote the j-th qMR measurement for the i-th subject. Ideally, the measurements and the ground truth possess an affine linear relationship, as follows^{29, 30}:

$urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0001$ (1)

where $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0002$ is the intercept, $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0003$ is the regression slope, and $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0004$ is a random effect, which we assume is independently and identically distributed from a normal distribution with mean zero and variance $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0005$ (which captures the precision).

To measure bias, the ground-truth value can sometimes be ascertained by using a reference method. Although the ground-truth should be conceptually well defined, its estimation via a reference method is generally imperfect and requires careful design. Such a reference method may be invasive or non-invasive, and based on MR or other modalities. Importantly, a reference method should be independent from the qMR method under evaluation (e.g., should not be obtained from the same source data), as any dependence between the two measurements may lead to an underestimation of the qMR method’s bias. Furthermore, to be accepted as a reference method, its measurements must be highly concordant with the ground truth, and its performance (bias and precision) must be substantially better than the performance of the method under evaluation.²⁸ These requirements often complicate the acquisition of a reference method in vivo.

For this reason, investigators often rely on reference objects (“phantoms”) to assess bias. The design of phantoms is driven by the technique they will be testing and the specific tissues or MR properties they will mimic, as well as additional considerations such as traceability and long-term stability.³¹ It is important that phantoms themselves are systematically measured prior to use, which is sometimes achieved using gold standard NMR measurements,³² the best available reference method on MRI systems, or non-MR methods. With a standard phantom, such as the ISMRM/NIST system phantom,³³ or a well-characterized home-built phantom, the technical performance of a qMR method can be estimated, as an approximation of in vivo technical performance. Although phantoms are highly effective in many qMR applications, there are cases where phantoms may be of limited value, as existing phantom designs do not adequately replicate the relevant signal properties, spatial distribution, or temporal dynamics found in tissue.³⁴

In a phantom study (or in vivo, if a suitable reference method is available), measurements are obtained at multiple values X_i (e.g., corresponding to different phantom compartments) over the range of the true value, X. Ideally, at least 10 nearly equally spaced values X_i should be chosen,²⁹ covering the range of values of interest. This range usually includes the normal range expected in a healthy reference cohort as well as values observed under influence of the pathology or condition of interest. For each value i, the individual bias or % bias is calculated as:

$urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0006$ (2)

where $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0007$ is the mean over the potentially repeated measurements on the same phantom compartment (or subject). Over N observations, we can estimate the overall bias:

$urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0010$ (3)

Finally, some qMR methods may present a constant bias that is not dependent on the true value of the measurand. In these cases, the fitted line in Equation (1) will be parallel to the identity line, with regression slope close to one, and regression intercept that provides an estimate of the overall bias.²⁹ A 95% confidence interval (CI) for the estimate of the fixed bias should be reported.³⁵ Note that small values of fixed bias are often well tolerated. For example, under typical conditions, confidence intervals for a new patient's measurement constructed under the no-bias assumption provide nominal coverage as long as the fixed bias is <12% of the within-subject SD (wSD).³⁶

Because the bias sometimes depends on the true value of the measurand, the bias profile can be evaluated by plotting the estimate of bias from each value of X_i (i.e., b_i) against the true values X_i. Note that the relationship between measurements and ground truth may generally be nonlinear, particularly when considering a broad range of measurand values. However, the assumption of linearity is often an appropriate approximation and greatly simplifies the statistical analysis. To assess the property of linearity, we fit an ordinary least squares (OLS) regression of the Y_ijs on X_is. One way to test the appropriateness of a linear model is to formally test for significant non-linearities (curvature).²⁹ Sequential tests can be performed starting with a third-order (cubic) regression: $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0011$ . If the third order coefficient $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0012$ is not significantly different from zero, then the process can be repeated with a second-order (quadratic) regression: $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0013$ . If the quadratic term $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0014$ is not significantly different from zero, then the hypothesis of a linear model cannot be rejected, and a linear fit can be used: $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0015$ .²⁹ Ideally, R-squared (R²) will be greater than 0.90.³⁵ The regression slope, $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0016$ , should be reported along with its 95% CI. As a general rule, a slope in the range [0.95, 1.05] is acceptable.³⁵ Sometimes, the linear relationship in Equation (1) holds only for a certain range of the values of the measurand, so it is important to assess this property over the likely values of the measurand. It may be that linear relationships hold for various ranges of the true value, but $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0017$ and $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0018$ differ for each range or even vary by subject characteristics (e.g., age or body mass index).

Finally, once the bias is known, the qMR method could, in principle, be calibrated to the reference to eliminate bias. However, this is not a common approach. Indeed, the bias itself often arises from uncorrected confounding factors that may affect various acquisitions or patients differently.^{37, 38} For this reason, bias is often not reproducible, and calibration-based correction should be approached with caution.

2.2 Precision

Precision describes the tendency of the measurement system, when used repeatedly in several “replicate” measurements on the same subject, to produce different values.²⁷ The precision of a method has enormous practical importance. Indeed, the required number of participants for clinical studies increases with $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0019$ and is, therefore, driven by the precision of a qMR method, which determines their cost and feasibility.³⁹ The precision is also a major factor affecting the method’s ability (including sensitivity and specificity) to detect a specific condition, and determines the minimum detectable change (see Figure 3). In contrast to bias, the evaluation of precision does not require a reference method, and therefore in vivo evaluation of precision is often highly feasible.

Some studies use spatial variability in a homogeneous phantom or tissue as a heuristic to evaluate precision metrics. While this may be an acceptable approximation with certain simple imaging methods, spatial variability of system properties (e.g., B₀ and $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0020$ heterogeneities) often render this approximation inadequate, even if the region of interest appears homogeneous to the observer. In particular, this spatial variability method cannot be used to evaluate precision metrics if spatial information is used in the image reconstruction or parametric mapping (e.g., regularization in compressed sensing). Instead, precision metrics should be evaluated in studies that obtain and compare multiple replicate measurements.

Test-retest studies allow estimation of precision. When the same MR system and experimental conditions (including acquisition parameters) are used for all replicate measurements on the same subject over a short span of time, we refer to this as the repeatability condition. When the replicates are obtained under different conditions (e.g., different field strengths, different MRI vendors, platforms, or software versions, different individual scanners, different pulse sequences or acquisition parameters, different image analysis software, different readers, or long delay between acquisitions), we call this the reproducibility condition [3]. With qMR, we often characterize precision by either the wSD, or the within-subject coefficient of variation, denoted wCV. Note from Equation (1), $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0021$ , and $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0022$ . The wCV is often used when the variability in the measurements is much higher for large true values or when the measurements are log-normally distributed (thus, X_i and Y_ij in Equation (1) would need to be measured on a logarithmic scale).

Precision studies are often small because of cost, ethical, and technical concerns.^{29, 30, 40} For these reasons, meta-analysis is often required to pool estimates from multiple studies.⁴¹ A general rule of thumb to obtain a reliable estimate of precision is >35 subjects with two or more replicates.³⁶

For each subject in a test-retest study, the qMR measurement is performed at time point 1 (denoted Y_i1) and time point 2 (Y_i2). Additional time points can be included, if available. For each subject, we can calculate the mean and SD of the measurements: $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0023$ and $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0024$ . From the N subjects, we estimate the mean wSD or wCV as:

$urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0025$ (4)

Importantly, 95% CIs for wSD and wCV should also be reported.³⁵ Implicit in Equation (4) is the assumption that wSD (or wCV) is constant over the range of measurand values. This assumption should be assessed by calculating the estimates for several ranges of $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0026$ or even for various patient and/or disease characteristics, to determine a precision profile.^{42, 43}

Two useful precision metrics are the repeatability coefficient (RC and %RC), estimated as:

$urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0027$ (5)

and the reproducibility coefficient (RDC and %RDC), estimated analogously. These metrics describe the smallest significant difference between two repeated measurements on a subject and assuming a normal distribution for the replicate measurements.^{27, 30} These metrics can be used as thresholds to discern between differences due to measurement imprecision and differences due to a true change in the measurand.

In evaluating reproducibility, the experimental conditions across replicate measurements can be altered in various different ways (see above). Ultimately, the widespread dissemination of a qMR method will require establishment of reproducibility across conditions such as different centers, MRI vendors, or patient populations. However, such multi-center, multi-vendor studies are expensive and complex, and may not be appropriate for a newly developed qMR method. A practical approach is to evaluate reproducibility in multiple studies of increasing complexity, beginning with relatively simple studies at a single center and vendor,⁴⁴ while building up toward more ambitious studies.⁴⁵

In evaluating precision, it is important to carefully design and describe the specific procedures followed. For example, repeatability may be evaluated by performing consecutive scans within the same scanning session, in order to establish the effects due to MR system adjustments and noise. However, repeatability is often determined by scanning the subject in separate sessions over a short time interval, including repositioning and re-localizing between sessions, in order to capture additional variability due to other factors such as subject positioning.²⁹ In general, the experimental design to study precision may be different for different methods or applications. Thus, it is important to meticulously report the parameters and conditions that are kept identical and those which may have changed between replicate measurements, to enable replication and interpretation of the results.

2.3 Examples

2.3.1 Liver PDFF quantification

Liver PDFF quantification methods have been validated in multiple studies including evaluation of bias in PDFF phantoms (both commercially available and home-built),^{46, 47} in vivo liver imaging,⁴⁸ and ex vivo livers.⁴⁹ In a recent meta-analysis,⁴⁸ liver PDFF had high linearity and low bias with respect to the MRS-determined reference PDFF value in 23 studies, which included a total of 1,679 subjects. Test-retest repeatability studies have also been performed.^{48, 50} Recently, high linearity and low bias of multi-center PDFF measurements⁴⁷ has been demonstrated by shipping a phantom to multiple centers in a “round-robin” study, and evaluating measurements on the same phantom across centers, vendors, platforms, field strengths, and acquisition parameters. In addition, the reproducibility of PDFF measurements in the liver, including across field strengths and MRI vendors, has also been demonstrated in multiple studies.^{48, 51}

2.3.2 Cardiac T₁

Bias and precision have been the dominating criteria in analysis of cardiac T₁ mapping methods [35]. Multiple studies have shown that different T₁ mapping methods provide varying profiles of bias and precision.

Inversion recovery-based methods have been shown to exhibit good repeatability but large biases, while saturation recovery methods have been shown to reduce bias at the cost of reduced repeatability.^52-54 For example, the most commonly used myocardial T₁ mapping technique, the inversion recovery-based method MOLLI, is known to be subject to multiple confounding factors and exhibits substantial bias.⁵⁵ However, given its excellent repeatability and visual image quality, the sequence is highly popular among users.²³ Even though some studies have shown initial evidence of multi-center or multi-vendor reproducibility with tightly controlled protocols,⁵⁶ the reproducibility is generally compromised due to the measurand confounders (see next section). Thus, it is recommended to obtain center and protocol specific reference ranges in healthy subjects, before using MOLLI for quantitative diagnosis.²⁰

3 CONFOUNDING FACTORS IN MR

In qMR, a wide variety of confounding factors may introduce bias or poor precision. Table 2 provides illustrative categories and examples of qMR confounding factors. A poll distributed among the members of the ISMRM Quantitative MR Study Group queried the frequency, relevance, and potential correction mechanisms for confounding factors used in the quantitative MR community. Supporting Information Figures S1–S6, which are available online, summarize the poll results.

TABLE 2. Types of qMR confounding factors and illustrative examples

Hardware/system imperfections	Physiological effects/motion	Signal model imperfections	Other artifacts and noise
B₀ heterogeneities and off-resonance	Respiratory motion	Additional relaxation mechanisms (not included in model)	Partial volume
B₁ heterogeneities	Cardiovascular motion/pulsation		Slice profile imperfections
Eddy currents	Intestinal peristalsis	Spectral complexity (additional resonances, J-coupling, etc.)	Imperfect spoiling
Gradient nonlinearities	Bulk body motion		Parallel imaging artifacts
System drift	Blood flow	Exchange (multi-pool)	Noise

3.1 Hardware and system imperfections

The presence of magnetic field heterogeneities (both B₀ and B₁),⁶¹ gradient nonlinearities,⁶² concomitant gradients,^{63, 64} eddy currents,⁶⁵ system drifts,⁶⁶ timing errors,⁶⁷ and other system imperfections, is unavoidable in MR applications. These effects may result in tolerable artifacts in qualitative MR as long as the relative visual contrast between tissues is preserved, but may introduce substantial bias and poor precision in qMR methods.

3.2 Physiological effects and motion

Physiological motion effects include respiration, cardiovascular motion and pulsation, intestinal peristalsis, and bulk patient motion, among others. These effects often result in artifacts, ghosting, and mis-registration in the acquired images,⁶⁸ which can in turn introduce bias and poor precision in qMR. Blood and tissue motion during the acquisition can also introduce artifacts, phase offsets, and dephasing that confound the quantification.

3.3 Signal model imperfections

Practical qMR methods rely on simplified signal models. The presence of signal effects that are not included in the model introduces bias in qMR measurements. These effects may be due to additional relaxation mechanisms, incomplete approach to steady state, diffusion, spectral complexity, etc. Signal model imperfections can lead to poor reproducibility in qMR, as these effects will often manifest differently for varying experimental conditions, including different systems, field strengths, and acquisition parameters. Ideally, signal models should be based on specific biophysical assumptions about the tissues of interest. However, biophysical modeling is challenging in certain applications, and so signal “representations” are often used, which enable fitting of the acquired data but are not based on specific tissue models.⁶⁹ For example, the diffusion tensor representation provides a useful approximation to the diffusion-weighted MR signal at moderate b-values, but is not based on specific tissue modeling assumptions.⁶⁹ Such signal representations have demonstrated clinical value, but their quantitative performance needs cautious consideration. For example, bias may not be meaningful if the measurand does not represent a physical property of the tissue, and reproducibility across changes in acquisition parameters is often challenging.

3.4 Other artifacts and noise

A variety of additional imaging artifacts, including partial volume, slice profile imperfections, imperfect spoiling, parallel imaging artifacts, and noise can confound qMR methods. For example, imperfect slice profiles due to finite-duration excitation pulses lead to a distribution of flip angles across the slice and may also introduce crosstalk between slices.^{70, 71} In addition, noise in the acquired imaging data propagates into the subsequent qMR measurements. The propagation of noise is generally dependent on the acquisition parameters and the choice of signal model; more complicated models with many free parameters often result in higher noise amplification. Furthermore, manipulation of MR signals prior to qMR measurement can affect the noise distribution, which affects the bias and precision of qMR methods. For example, noise in complex MR data is well modeled by a Gaussian distribution. However, qMR sometimes relies on magnitude images (e.g., in diffusion MRI, or cardiac T₁ mapping as discussed throughout this paper). This magnitude operation has several important effects, including the elimination of phase information, and the introduction of an additional bias. Indeed, regions of low signal magnitude, as commonly observed in methods such as diffusion MRI or relaxometry, deviate substantially from a Gaussian noise distribution.⁷² If subsequent qMR processing implicitly assumes a Gaussian noise distribution (e.g., in methods that rely on least-squares fitting, as described below), bias and poor reproducibility may result from the inaccurate noise assumptions.^{73, 74}

For these reasons, different processing pipelines of the same data can lead to differences in the resulting qMR measurements. For example, even filtering of the image data can introduce biases in the quantification when nonlinear models are being used. Thus, using transparent open-source toolboxes or custom-built processing for qMR should be preferred over black-box tools, when reproducibility is targeted.

Finally, when using MR-based reference methods for validation of qMR, even the reference method itself may not be immune to the presence of confounding factors such as physiological effects and motion. This limitation of the reference method may complicate the evaluation of qMR bias in vivo.

3.5 Examples

3.5.1 Liver PDFF quantification

Quantification of PDFF is affected by multiple confounding factors, including:

T₁ recovery: The short T₁ relaxation time of fat compared to that of water in the liver can lead to bias (overestimation) of PDFF¹² in acquisitions that include T₁ weighting.
$urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0028$ relaxation: $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0029$ decay across multiple echoes can appear as interference between fat and water signals, and therefore can introduce bias and poor reproducibility in PDFF quantification (see Figure 4).^{13, 14, 75, 76}
Spectral complexity of fat signals: Unlike water signals, which result in a single MR resonance, fat signals arise from protons located in various positions within the triglyceride molecule. These protons, in turn, lead to a multi-peak spectrum from fat.^{15, 77} If unaccounted for, this spectral complexity leads to bias and poor reproducibility across acquisition parameters, particularly the echo time combination.
Phase errors: Phase errors, such as those arising from eddy current effects, can introduce bias and poor reproducibility in PDFF quantification.^{78, 79}

**FIGURE 4**
Open in figure viewer PowerPoint

$urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0030$ decay, if uncorrected, can confound PDFF quantification, leading to bias, as well as poor precision (e.g., poor reproducibility across acquisitions with different number of echoes), particularly in patients with elevated liver $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0031$ = 1/ $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0032$ ( $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0033$ = 160 s⁻¹ at 1.5T, corresponding to mild iron overload). As shown though simulation and in vivo, $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0034$ -uncorrected signal fitting results are highly dependent on the choice of echo times. In contrast, $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0035$ -corrected PDFF quantification has low bias and high reproducibility across choices of echo times. For this illustration, a 12-echo liver CSE acquisition in a patient with high liver fat and iron overload was reprocessed retrospectively multiple times, using the first n echoes (for n = 5,…,12). In each case, both $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0036$ -uncorrected and $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0037$ -corrected PDFF mapping methods were used

Over the past two decades, these and other confounding factors have been systematically identified, characterized and addressed using various acquisition and post-processing-based approaches (see the Technical Development section below).

3.6 Cardiac T₁

Quantitative cardiac imaging is particularly challenging due to the impact of cardiac and respiratory motion. Confounders, as described above, often have an imaging method-specific impact on the quantification. Additionally, subject-specific effects are often substantial. Various existing and emerging technical developments seek to mitigate these effects.^{55, 57, 80-83} The most relevant confounders for cardiac T₁ mapping include:

Heart rate: Acquisitions commonly need to be synchronized with the heartbeat to minimize cardiac motion effects. The subject-specific heart rate may influence the bias and/or precision. Introduction of alternative mapping schemes or heart rate-resilient timing has helped to alleviate this confounder.^{55, 57, 80}
k-Space acquisition: In some methods, only the effect of the magnetization preparation is modeled. In this case the disruption of the magnetization by radiofrequency (RF) pulses used for the k-space acquisition can cause bias in the quantification. Specifically, this may render the quantification susceptible to physiological factors such as T₂ relaxation⁵⁷ or magnetization transfer,⁸⁴ or system-related properties such as off-resonance⁸⁵ or flip angles.⁸⁶ The dependency on system-related parameters makes it paramount to use identical sequences and sequence parameters in cardiac T₁ mapping, when reproducibility across centers and MRI scanners is desired.
Partial-volume effect: Cardiac acquisitions are commonly limited in the achievable resolution due to motion constraints. This results in partial-voluming, where voxels are partially filled with different tissue types at tissue interfaces. As a result, the area that can reliably be evaluated is further reduced, rendering the quantification dependent on accurate delineation of the region of interest.^{87, 88}

4 TECHNICAL DEVELOPMENT AND VALIDATION

The development of qMR methods is typically an iterative process including design of acquisition, modeling and signal fitting methods. This technical development can be framed as an optimization of bias and precision metrics (see Figure 5), subject to specific constraints such as scan time, or hardware performance.

4.1 Acquisition

Once the basic physical mechanism to be probed has been selected and potential confounders have been identified, aspects of protocol design and optimization can be considered. A common goal is to select pulse sequences and parameters such that the measurand can be determined with low bias and high precision, subject to a set of timing, hardware, and other constraints. Acquisition design will often begin by selecting a pulse sequence where the measurand of interest can be directly probed, while minimizing the effect of confounding factors. Next, an acquisition that includes multiple scans with different parameters can be designed to enable estimation of the measurand. In applications where thermal noise is the dominant source of noise (as opposed to, e.g., physiological noise), acquisitions with higher imaging SNR may substantially improve bias¹² or precision.⁸⁹ The choice of acquisition parameters may be driven by heuristics, and also refined using quantitative tools such as sensitivity analysis,^{90, 91} or noise propagation analysis (e.g., Cramer-Rao lower bounds, CRLB).^{11, 92-94}

4.1.1 Liver PDFF quantification

The choice of pulse sequence for quantification of PDFF is driven by the desire to obtain chemical shift-encoded data with proton-density contrast (e.g., avoiding confounding effects due to T₁ and T₂ relaxation), and with rapid scan times (e.g., to enable whole-liver coverage in a single breath-hold while avoiding motion artifacts). For these reasons, the pulse sequence of choice for MRI-based liver PDFF quantification is a multi-echo spoiled gradient echo (SGRE) sequence, either using 2D multi-slice or 3D imaging.^{8, 17} In addition, small flip angles are used in order to avoid T₁ bias.¹² Other confounding factors are typically addressed by postprocessing/modeling (see below). Optimal acquisition parameters, such as echo times, have been determined using CRLB analysis.¹¹

4.1.2 Cardiac T₁

A multitude of pulse sequences for cardiac T₁ mapping have been proposed and novel method development remains an active area of research. Generally, these acquisitions can be decomposed into three integral parts: 1) contrast sensitization; 2) k-space acquisition; 3) motion compensation.

Typically, preparation pulses are used to sensitize the imaging signal to the T₁ relaxation time of the tissue. Inversion pulses are most widely used in T₁ mapping, including the commonly used MOLLI sequence and its variants.^{23, 24, 55} Saturation recovery has also been proposed with the potential to minimize bias caused by various confounding factors as described above.^{57, 80} However, due to a decreased dynamic range, saturation recovery preparation typically results in lower T₁ mapping precision as compared with inversion preparation.

Cardiac T₁ mapping is typically performed using multiple electrocardiogram (ECG)-triggered snapshot images, with each image obtained during a single diastolic quiescence, i.e., all k-space lines necessary for image reconstruction of one snapshot image are acquired in one heartbeat. To achieve optimal SNR as well as minimal disruption of the longitudinal magnetization recovery curve, balanced steady state free precession (bSSFP) readouts are the method of choice. Spoiled gradient echo readouts have also been explored to minimize sensitivity to off-resonance and field heterogeneities, albeit at the cost of reduced precision.⁹⁵ More recently, continuous imaging throughout the heartbeat have been proposed to allow cardiac phase-resolved T₁ mapping.^{81, 82, 96}

ECG triggering is almost universally used as the means for cardiac motion compensation in T₁ mapping, with few notable exceptions.^{96, 97} Various schemes have been explored for respiratory motion compensation. Clinically available T₁ mapping methods usually acquire a single-slice T₁ map in a single breath-hold.²⁶ However, free-breathing methods have also been explored with diaphragmatic navigator gating, tracking or self-gating.^97-99 Importantly, free-breathing sequences allow for the acquisition of multiple slices or 3D volumes and can be used to enable T₁ mapping with increased spatial resolution.^{83, 100, 101}

4.2 Model selection

Many qMR methods rely on parametric mapping using a signal model that relates the acquired data to the underlying measurand. Selection of a signal model is typically an iterative process and seeks to balance bias and precision. Often the process begins with identifying the relevant degrees of freedom in the underlying tissue,⁶⁹ such that these tissue properties can be related to, and estimated from, acquired MR signals. For example, this step may involve the identification of the major pools of nuclei with shared properties that will reasonably contribute to the signal. These pools can describe physical compartments, such as “intracellular” compartments, or local molecular environments such as lipid protons. The Bloch equations, describing the response to RF energy and relaxation properties, are defined for each pool. Next, a model may consider whether nuclei can travel between pools by chemical exchange or diffusion, or interact magnetically with other pools due to proximity and, thus, define the exchange kinetics. Models commonly describe the signal within a voxel independently of its spatial neighborhood, but one may also need to consider the influence of the neighboring voxels (e.g., in quantitative susceptibility mapping,¹⁰² or electrical properties tomography^{103, 104}).

The next step is often to evaluate the signal model under modifications of the acquisition pulse sequence. It is often helpful to develop a working model for simple excitation-readout with Cartesian acquisition and long repetition time (TR) before considering advanced k-space trajectories or pulse trains. The requirements for each measurand are different, but major considerations in the presence of increasingly advanced pulse sequences may include relaxation effects, B₀ and B₁ heterogeneities, etc. It may also be necessary to consider the need for steady-state or non-steady state modeling. Finally, any signal manipulations that occur before analysis, such as magnitude operation or spatial filtering, need to be included.

In subsequent iterations, confounding factors are often identified and addressed through acquisition- and/or modeling-based refinements. Importantly, qMR methods necessarily use simplified models of the actual underlying physics. For this reason, it is always possible to “enhance” the models by including additional unknown parameters. However, these signal model enhancements generally lead to increased challenges in the parameter estimation (particularly noise amplification, sensitivity to artifacts, and computation time). Practical signal models, therefore, seek a balance between accurately capturing the underlying physics and enabling stable quantification within acceptable computation times. Once a satisfactory model is achieved, this model often needs to be re-evaluated upon subsequent refinements of the qMR method, including accelerated acquisitions.

4.2.1 Liver PDFF quantification

A widely used signal model for PDFF quantification in the liver includes^{8, 15, 105}:

$urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0038$ (6)

where $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0039$ and $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0040$ are the proton density-weighted signal amplitudes of water and fat, respectively; fat signals are modeled as a pre-calibrated spectrum including M peaks with known relative amplitudes $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0041$ and frequency offsets $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0042$ ⁷⁷; initial phase $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0043$ ; B₀ related off-resonance frequency $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0044$ ; and transverse relaxation time $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0045$ . Upon data fitting (see below), this signal model allows estimation of $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0046$ and $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0047$ , which lead to the calculation of PDFF as:

$urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0048$ (7)

Importantly, the widely used signal model in Equation (6) constitutes a balance between bias and precision (noise performance).¹⁰⁶ For example, this model addresses the spectral complexity of the fat signal by using a multi-peak signal model and also accounts for $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0049$ decay. If unaccounted for, both of these effects have been shown to lead to substantial bias and poor reproducibility in PDFF quantification.¹⁷ However, the model in Equation (6) typically relies on a pre-calibrated multi-peak fat spectrum, where the relative frequencies and amplitudes of the fat peaks are assumed known a priori,¹⁰⁷ and also assumes a common $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0050$ decay time for water and all fat peaks.¹⁰⁶ Each of these approximations help maintain acceptable noise performance and precision for PDFF quantification by limiting the number of unknown parameters, even though they may introduce a small bias in the estimation of liver PDFF when the model assumptions do not hold exactly.

4.2.2 Cardiac T₁

T₁ recovery is thoroughly studied and can be accurately described by the well-known phenomenological Bloch relaxation equations. However, the signal model needs to be adapted to the specific imaging sequence, as summarized next.

In the widely used modified Look-Locker inversion recovery (MOLLI), an adaptation of the standard inversion recovery model is commonly employed to describe the signal S(t) at different inversion times t:

$urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0051$ (8)

Here A and B describe fit parameters and $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0052$ is the apparent relaxation time. The T₁ estimate is then extracted as $urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0053$ . This adaptation is inspired by Deichmann et al¹⁰⁸ and aims to reduce the effect of the RF pulses used for the k-space acquisition on the quantification. However, the acquisition commonly deviates from the assumptions underlying this correction, inducing residual susceptibility to various effects related to the RF pulses during the k-space acquisition. Numerical models have also been proposed to approximate the magnetization evolution during the k-space acquisition, using for example Bloch equation simulations or additional parameters.^{109, 110} While these methods commonly achieve lower bias, their applicability might be limited, and precision may be compromised.

A second class of myocardial T₁ mapping methods is based on saturation recovery, which commonly relies on the following three-parameter model:

$urn:x-wiley:07403194:media:mrm29084:mrm29084-math-0054$ (9)

This saturation recovery-based approach has been shown to compensate for the effects of the RF pulses used for the k-space acquisition,⁵⁷ which in turn enables cardiac T₁ mapping with reduced bias. It has also been suggested to omit the B parameter to obtain a two-parameter model, in order to improve precision at the cost of bias in saturation recovery T₁ mapping.⁵⁵

4.3 Model fitting

Fitting a signal model to acquired data can typically be described in terms of a formulation and an algorithm, as described next.

The formulation is often an optimization problem, which describes in what sense the model should fit the acquired data. For example, least-squares fitting is often used for various linear or nonlinear models in quantitative MR. For nonlinear models based on exponential functions, a logarithm of the data is sometimes calculated to linearize the problem. This linearization simplifies the optimization, although it affects the noise propagation and may require additional manipulations to avoid excessive noise influence from low-SNR data points.¹¹¹ Furthermore, some formulations rely on the acquired complex data, whereas others use magnitude data.⁷⁹ Finally, the formulation may be constrained (where the set of allowable parameters is restricted based on physical or noise propagation considerations) or unconstrained. In addition to least-squares fitting, other formulations can be used, including those required for maximum-likelihood estimation in the presence of non-Gaussian noise.

Once a formulation is selected, an algorithm needs to be selected to solve the corresponding optimization problem. Depending on the formulation, various closed-form or iterative algorithms are typically available. Variations of Newton’s method, including Levenberg-Marquardt and Gauss-Newton algorithms, constitute common choices for iterative optimization.¹¹² An ideal algorithm would be efficient (i.e., fast and requiring low resources) and would lead to the global solution of the optimization problem described in the formulation.

4.3.1 Liver PDFF quantification

Model fitting for PDFF mapping is typically performed using nonlinear least-squares fitting of the signal model (Equation 6) to the acquired multi-echo data, followed by calculation of PDFF at each pixel (Equation 7). Multi-echo data are often corrupted by phase errors that are inconsistent across echoes.^{78, 79} For this reason, some or all of the phase information is often discarded to avoid PDFF bias, and algorithms often rely partly on fitting the signal magnitude, instead of the original complex-valued signals. Magnitude fitting leads to reduced bias by avoiding phase related PDFF errors at the cost of reduced noise performance and precision (by discarding half of the acquired information, i.e., the phase).

4.3.2 Cardiac T₁

Basic model fitting in cardiac T₁ mapping is also most commonly performed using magnitude-based nonlinear least-squares fitting. When unsigned magnitude images are used in an inversion-recovery model, the signal polarity information is lost. To resolve this issue the images are commonly ordered by the inversion time and the polarity can be restored heuristically, by successively flipping the sign in the ordered sequence and accepting the solution with the lowest fit residual.²⁰

However, this process might introduce additional noise variability. It has been proposed to incorporate phase information to perform hybrid fitting on a signed magnitude. Here, the background phase is extracted from a fully relaxed image, and the phase difference to other T₁ weighted images can be used to restore the signal polarity.¹¹³

5 CLINICAL QUALIFICATION

For qMR methods, in addition to technical validation, which measures the bias and precision of the quantitative measurements, it is essential to perform clinical qualification (Figure 6).¹¹⁴ Clinical qualification seeks to establish the relationship between the qMR measurement and biological processes or clinical endpoints, as needed to determine the clinical utility of the method, e.g., whether it enables screening, diagnosis, staging, prognosis, or treatment monitoring for a particular condition and target population.^115-119 For example, rather than focusing on technical performance metrics of bias and precision, clinical qualification may focus on metrics such as sensitivity, specificity, negative / positive predictive value, prediction accuracy, or odds ratio.^{114, 120} Upon successful clinical evaluation for a specific application, qMR methods may lead to qualified quantitative biomarkers (see Glossary in Supporting Information Table S1, which is available online).⁶

This marks an important distinction: while a qMR measurand usually relates to a physical property, this measurand, when being used as a clinical biomarker, indicates pathophysiological alterations or other changes in the physiological state. Often numerous biological and physiological processes affect the underlying physical property. Thus, a single qMR measurand can be qualified as a biomarker for multiple disease entities. In this case, while being sensitive to multiple diseases, the measurand may not be specific to any one physiological alteration.

There are strong connections between technical validation and clinical qualification. For example, a qMR method with poor precision (e.g., poor test-retest repeatability) will likely also have poor sensitivity and specificity for detection of a specific condition. However, these are also important distinctions between both types of evaluation. For example, it is possible to develop a qMR method with excellent technical performance (low bias and high precision) for quantifying a measurand; however, this method may have poor clinical performance for a specific application, e.g., due to underlying biological variability that complicates the relationship between the measurand and the clinical endpoint of interest, such as survival, disease-free survival, or various surrogate endpoints.^121-123 Furthermore, a biased method may reduce the desired effect size, as the bias itself may be different for various patient populations. Alternatively, it is possible that a biased (confounded) qMR method provides larger effect sizes for a specific disease entity than an unbiased measurement, e.g., if the confounders themselves are sensitive to the physiological alteration (see Figure 7). However, it is important to note that this enhancement usually comes at the cost of strongly reduced reproducibility as the variability in the bias is difficult to control.

For these reasons, it is essential to conduct both technical validation and clinical qualification of qMR methods. This need further highlights the importance of multi-disciplinary collaboration between technical imaging researchers, translation-focused radiologists, and other clinicians.

5.1 Examples

5.1.1 Liver PDFF quantification

Liver PDFF has been shown to be correlated with histologic steatosis grade. For example, PDFF can classify histologic steatosis (grade 0 vs. 1–3) with sensitivity 0.93 and specificity 0.94.¹²⁴ Also, MRI-based liver PDFF quantification is emerging as a useful biomarker to assess longitudinal changes in liver fat within clinical trials.¹²⁵ Furthermore, a reduction of MRI-PDFF by 30% is associated (odds ratio 6.98) with histologic improvement in NAFLD Activity Score.¹²⁶

Liver PDFF values may be predictive of pediatric metabolic syndrome.¹²⁷ In addition, liver steatosis is associated with cardiovascular diseases^{128, 129}; for instance, liver fat is an independent risk factor (odds ratio 2.1) for high-risk plaque¹²⁸ and other cardiovascular risk factors.¹²⁹ Importantly, now that the required MRI technical development is mature and PDFF mapping methods are widely available, determination of the association between liver PDFF and various clinical outcomes constitutes an active area of research.

5.1.2 Cardiac T₁

Native myocardial T₁ times, in the absence of a contrast agent, have been evaluated against histologically determined fibrosis from myocardial biopsies in vivo and total collagen volume in animal studies¹³⁰ and heart transplant patients.¹³¹ Variable degrees of correlation ranging from moderate to high have been reported depending on the disease model,¹³¹ indicating that T₁ times are not reflective of fibrosis alone but of a number of factors. Extracellular volume (ECV) calculated from native T₁, post-contrast T₁, and hematocrit, generally showed better correlation to the amount of fibrosis but variability among disease models and studies remains. Accordingly, the clinical context needs to be considered in the interpretation of both native T₁ and ECV, and alteration in either measurand cannot be directly linked to a single specific physiological process.^{20, 132-134}

Nonetheless, cardiac T₁ mapping-related markers have demonstrated high clinical diagnostic and prognostic value in an unexpectedly wide range of disease entities.^{25, 135} For example, in cardiac amyloidosis ECV showed excellent sensitivity and specificity (0.93 and 0.87, respectively) and an odds-ratio of 84.6.¹³⁶ In patients with an acute infarct quantitative assessment of normal appearing myocardium in patients with an acute infarct using native T₁ or ECV has proven to be a better predictor for all-cause mortality or major cardiac events than any other cardiac MRI marker,¹³¹ and accurate differentiator between reversible and irreversible myocardial damage (96.7% prediction accuracy).

Interestingly, MOLLI T₁ mapping, which is most common in clinical use, is known to exhibit a large bias. However, it has been suggested that certain confounders to MOLLI T₁ measurements may enhance clinical sensitivity.¹³⁷ As illustrated in Figure 7, this somewhat counter-intuitive phenomenon arises because some confounders (e.g., magnetization transfer^{84, 137, 138}) are sensitive to pathological alterations, leading to inflated effect size in certain disease entities compared to unbiased measurements. However, as a result of these confounders, the measurand in MOLLI is highly dependent on the sequence parameters (e.g., TR, flip angle, slice profile), the scanner specifications and tissue properties that are not related to the tissue T₁ time. Hence, this inflated effect size is obtained at the cost of reduced reproducibility.

6 DISSEMINATION IN RESEARCH AND IN THE CLINIC

Although qMR methods have shown great potential to guide clinical decision-making and patient management for improved patient care and outcomes, very few qMR methods are used in routine clinical practice. Many promising qMR methods are only described in the scientific literature, without translation in clinical research studies or clinical use.

As recently stated in the Imaging Biomarker Roadmap for Cancer Studies,¹³⁹ all imaging biomarkers, including quantitative MRI biomarkers, need to cross two “translational gaps” before they are ready to guide clinical decision making. These gaps are crossed through increasing technical validation and clinical qualification, as well as assessment of cost effectiveness and other considerations. Once technical and clinical performance evaluation demonstrate the reliability of a qMR biomarker to test medical research hypotheses, this biomarker can cross the first gap and become a useful “medical research tool”. At this stage, substantial additional validation and qualification are still needed in order to achieve clinical impact as a screening, diagnostic, or predictive biomarker. This may include validation of multi-center reproducibility,¹⁴⁰ and large prospective clinical trials to demonstrate improved clinical diagnosis or outcomes. In combination with cost effectiveness and other considerations, these activities enable a biomarker to cross the second translational gap and become a “clinical decision-making tool” that influences patient care. Importantly, application of qMR methods in research or in the clinic requires rigorous quality assurance and quality control procedures.^141-143

During the three processes of (1) technical development and validation, (2) clinical qualification, and (3) dissemination, additional substantial challenges (including regulatory issues and market-related factors) often arise before clinical dissemination is achieved (Figure 6). For example, the ability of healthcare providers to obtain reimbursement or take charge of the costs associated with quantitative imaging biomarkers may drive the clinical use of these tools. Oftentimes, a lack of CE/FDA (or equivalent) labeling limits the ability to apply a biomarker in clinical practice.

6.1 Examples

6.1.1 Liver PDFF quantification

CSE-based liver PDFF quantification has emerged as a major clinical and research tool to determine liver fat content. Importantly, this qMR method is commercially available on systems from various MRI vendors, including regulatory approvals such as FDA clearance and CE mark. A liver PDFF quantification profile is currently being developed by the Radiological Society of North America’s Quantitative Imaging Biomarkers Alliance (RSNA QIBA).¹⁴⁴ Now that liver PDFF quantification methods have shown excellent technical performance (low bias, high precision), clinically relevant results are emerging, including population studies measuring prevalence in various populations¹⁴⁵ and clinical studies showing the prognostic value of PDFF.¹⁴⁶

6.1.2 Cardiac T₁

In recent years, cardiac T₁ mapping has become widely available on most clinical MRI systems. Some vendors have released dedicated product packages comprising one or more T₁ mapping methods, while others have provided prototype methods. Several cardiac T₁ mapping methods have regulatory approval such as FDA clearance and CE mark.

T₁ mapping is widely used in cardiac MRI in academic centers and beyond. It has been successfully applied to an unexpectedly large spectrum of ischemic and non-ischemic cardiomyopathies^{20, 131} and is established as part of routine scanning in numerous clinical cardiac MR protocols. The effect of most heart diseases on myocardial T₁ has been investigated, mostly in single center studies. Select pathologies have been studied in large cohorts or multi-center studies, including studies on amyloidosis¹⁴⁷ and Anderson-Fabry disease.¹⁴⁸ Additionally, cardiac T₁ mapping has been adopted in multiple national cohorts, including the UK biobank protocol and the German national cohort.^{149, 150} These studies are some of the largest ongoing MRI projects to date. Following the clinical success demonstrated in the literature, cardiac T₁ mapping was adopted in disease specific clinical guidelines.¹⁵¹ Further increases in clinical integration and use in a growing number of cardiac MRI protocols are likely.

7 RELATED INITIATIVES, CHALLENGES, AND OPPORTUNITIES

As described above, substantial efforts are needed for the development, validation and dissemination of quantitative MR techniques. These efforts require collaboration between technical researchers, translational researchers and clinicians, industry, and initiatives and institutions dedicated to the regulation and guidance of quantitative imaging measurements. Such initiatives and institutions include authorities for standardization of measurements such as Italy’s Istituto Nazionale di Ricerca Metrologica (INRIM), the Korea Research Institute of Science and Standards (KRISS), the U.S.’s National Institute of Standards and Technology (NIST), the UK’s National Physical Laboratory (NPL), Germany’s Physikalisch-Technische Bundesanstalt (PTB), and for the advancement of the development and use of imaging biomarkers, as performed by QIBA, the US National Cancer Institute through the Quantitative Imaging Network (QIN), the European Imaging Biomarkers Alliance (EIBALL), Japan-QIBA, the European Society of Radiology (ESR), and the ISMRM. Specifically, the major goal of the ISMRM Quantitative MR Study Group is to promulgate documentary and measurement standards for qMR methods in collaboration with national metrology institutes, academic and clinical MR sites, and through collaboration with existing study groups. Furthermore, ongoing qMR improvements occur in the context of broad efforts to evaluate and optimize the value of MRI in medicine.¹⁵² Ultimately, efforts to develop qMR methods should have broad value in medicine across countries and populations, beyond specialized research centers.

In addition, the development and validation of qMR methods is closely connected to the improvement of the reproducibility of MR research itself. There are multiple existing and emerging initiatives in this area, including the ISMRM Reproducible Research Study Group, and reproducibility has recently been emphasized by major journals, such as Magnetic Resonance in Medicine or the Journal of Magnetic Resonance Imaging. A related set of benchmarks for validation of quantitative imaging tools has been described by the Quantitative Imaging Network of the US’ National Cancer Institute.¹⁵³ Importantly, multiple consensus efforts and community challenges have emerged in recent years for specific qMR methods or applications, as well as for general optimization of qMR.^{6, 20, 37, 142, 154-163} Data standards are also essential for reproducibility and interoperability, making it easier to create transparent qMR workflows. Two standards that are relevant for qMR are the ISMRM-Raw Data format (https://ismrmrd.github.io/apidocs/1.5.0/), and the Brain Imaging Data Structure (BIDS) extension proposal for quantitative MRI (https://github.com/bids-standard/bids-specification/pull/508).

Furthermore, various software packages developed, maintained and used by the community enable improved reproducibility by standardizing data processing pipelines. Supporting Information Figure S6 gives an overview of the user base of publicly available software with applications in qMR. Examples include various toolboxes hosted on the Matlab Central File exchange (https://www.mathworks.com/matlabcentral/fileexchange/), FSLTools (https://fsl.fmrib.ox.ac.uk/fsl/fslwiki/FslTools), OsiriX or Horos plugins, ImageJ (https://imagej.nih.gov/ij/), Bay Area Reconstruction Toolbox (BART, https://mrirecon.github.io/bart/), qMRlab (https://qmrlab.org),¹⁶⁴ Gadgetron (http://gadgetron.github.io/), Quantitative Imaging Tools (https://github.com/spinicist/QUIT), Michigan Image Reconstruction Toolbox (MIRT, http://github.com/JeffFessler/MIRT.jl), hMRI (https://hmri-group.github.io/hMRI-toolbox/), LCModel (http://s-provencher.com/lcmodel.shtml), Total Mapping Toolbox (TOMATO) (https://mrkonrad.github.io/TOMATO/html), QMRI Tools (https://mfroeling.github.io/QMRITools/), and others (e.g., vendor proprietary software and in-house or personal code). RSNA QIBA and NIBIB have also sponsored the development of digital reference objects (DROs), which enable the testing of analysis tools to assess their bias and precision when working with quantitative data obtained with different acquisition parameters and varying levels of SNR. Example DROs for DCE-MRI and DWI are available from QIBA (https://qidw.rsna.org).

The transformation of MR into a truly quantitative diagnostic modality has enormous potential to impact research and clinical care. However, the development, validation, and dissemination of quantitative MR methods is faced with multiple challenges, particularly the complexity and cost of the required validation studies, as highlighted by the above networks and initiatives. These challenges reinforce the need for collaboration between technical MR researchers, academic radiologists, and other clinicians, as well as industry, such as original equipment manufacturers (OEMs)—vendors of MR systems and other MR equipment and software, pharmaceutical companies, contract research organizations, and others.

Finally, substantial recent efforts from the qMR community have focused on rapid multi-parametric mapping and machine learning (ML). Emerging multi-parametric mapping methods such as MR fingerprinting¹⁶⁵ and multi-tasking⁸¹ enable quantitative mapping of several parameters with short scan times. These methods are highly promising for a variety of applications, and require careful development and validation to address a large number of potential confounding factors. ML methods, including radiomics and deep learning, have recently gained enormous interest in the field. Indeed, ML may contribute to different stages of the qMR pipeline, including image prescription, acquisition, reconstruction, post-processing, measurement, and analysis. Despite the potential impact of these methods, rigorous development and validation of ML-enabled qMR is needed. This development and validation pose new challenges and opportunities for ML-enabled qMR, including how to quantify and address confounding factors to achieve low bias and high reproducibility across patients, sites, and vendors, in much the same way as the more ‘traditional’ qMR methods highlighted in the present manuscript.

8 SUMMARY AND CONCLUSIONS

On behalf of the International Society for Magnetic Resonance in Medicine (ISMRM) Quantitative MR Study Group, this manuscript describes a framework for the development and validation of quantitative MR methods. With a focus on technical performance metrics (bias and precision), this framework highlights the challenges as well as the research opportunities associated with quantitative MR methods. Overall, rigorous development and validation are critical components of the transformation of MR into a truly quantitative diagnostic modality. A summary of concluding recommendations to achieve this aim is provided in Table 3. Upon successful implementation of qMR methods, as well as clinical qualification of qMR-based biomarkers, qMR has the potential to substantially advance imaging in clinical applications and clinical research, and build a cornerstone of precision radiology.

TABLE 3. Recommendation for qMR development, validation and application

Definitions	The measurand of interest needs to be clearly defined. How does the targeted measurand relate to other physical properties? For example, if a coefficient is determined, what is the reference quantity (e.g., MR-visible protons)?
Choice of pulse sequence	Select pulse sequences and parameters such that the measurand can be determined with low bias and high precision, subject to a set of timing, hardware, and other constraints. Acquisition design will often begin by selecting a pulse sequence where the measurand of interest can be directly probed, while minimizing the effect of confounding factors
Choice of models	Proper biophysical modeling is difficult, but may avoid the pitfalls of various signal representations. Indeed, various models can often fit the data, but models that are not grounded on specific tissue assumptions are often more difficult to validate, and are also likely to suffer from poor reproducibility
Rigorous validation	It is critical to perform systematic validation of the technical performance of emerging qMR methods. Importantly, even though early stage validation is often focused on bias, evaluation of precision (repeatability and reproducibility) is essential to enable further clinical qualification and dissemination
Structured evaluation	Well-structured reporting of the validation is an essential component of establishing a qMR method. The standard metrics being evaluated should be described clearly as discussed in the section “Technical Performance of qMR Methods” above. Future work from the community may establish a standardized structure for the Methods and Results sections of qMR manuscripts
Real-world validation	Even at the stage of technical validation, it is important to evaluate the performance of qMR methods under conditions that are relevant to the real-world clinical environment. For example, performance may depend on the hardware available at different sites (e.g., academic vs non-academic). In addition, technical validation in a relevant patient population helps pave the way for subsequent clinical qualification and application
Focus on reproducibility	Optimization and characterization of reproducibility across acquisition protocols, field strength, vendor, platform, etc, is critical in qMR. Indeed, in qualitative MR methods development, one is often interested in finding the optimal set of acquisition and processing parameters to maximize imaging performance (e.g., resolution, SNR). Although this optimal set of parameters is also relevant in qMR, the development of qMR methods that are reproducible across variations in the acquisition parameters is arguably even more important than the identification of the optimal parameters. This way, qMR methods are best suited for widespread dissemination across sites that may not be able to implement exactly optimized acquisitions
Reproducibility vs. standardization	Certain qMR methods are highly reproducible across variations in acquisition parameters (within a certain range). For example, this is the case for PDFF measurement in the liver: when correcting for all relevant confounders, PDFF measurement is highly reproducible across field strength, echo time combinations, spatial resolution, and various other acquisition parameters. However, other qMR methods have poorer reproducibility, and their widespread dissemination would benefit highly from standardization of acquisitions (as well as processing) across sites and systems, as well as harmonization (see below)
Harmonization	Quantitative MR can benefit from harmonized acquisitions and tools. For example, standardized reference objects and tool validation methods, such as the use of DROs, provide common ground for comparison of imaging protocols across sites, vendors, and software analysis packages
Realistic time-horizon	Development, validation, qualification, and dissemination of a qMR method is a slow, iterative process that may take more than a decade
Consider the end goal	In qMR, the end goal is often to enable improved diagnosis, staging, and/or treatment monitoring of disease and generate increased value in the clinical work up. This goal is generally relevant even for technically focused researchers
Mind the translational gap	Establishing new quantitative imaging biomarkers in actual clinical use is a lengthy process and requires many steps that may not be amenable to funding by traditional science-oriented grants or tenders. This may require creativity about the path to clinical integration and real-world use
Clinical qualification is key	Clinical qualification is critical to achieve translation of a quantitative MR method to the clinic. This may be the most time-consuming step in the entire pipeline of qMR method development and evaluation
Collaboration	Working together with stakeholders (technical, clinical, industrial) and across imaging modalities or scientific disciplines is critical. For example, accurate biophysical modelling will benefit from collaboration between clinical and preclinical MR scientists, but also between MR researchers and scientists studying tissues at smaller scales (e.g., cell cultures) or using different imaging technology (e.g., X-ray phase contrast imaging for tissue structure, near infrared spectroscopy for blood oxygenation properties, or microscopy). Collaboration with clinicians is of enormous value in qMR technique development, and helps create a virtuous loop of refinement of existing methods and conception of new methods that address existing clinical needs. Furthermore, early stage discussion and cooperation with industry is especially relevant, since CE/FDA-labeling is mandatory for clinical translation. A technique without labelling will not be widely adopted in clinical practice due to ethical concerns and regulatory issues

ACKNOWLEDGMENTS

The authors thank the entire ISMRM Quantitative MR Study Group, including those who responded to the online poll, as well as those who provided feedback during the public presentation on June 23, 2021, for their thoughtful feedback, which has led to an improved manuscript. We also acknowledge the efforts of the endorsers, as listed in the Supporting Information Table S2. We also thank the ISMRM Publications Committee for reviewing the work and coordinating the ISMRM Board of Trustees approval process.

CONFLICT OF INTEREST

Nancy A. Obuchowski is a paid consultant for RSNA’s Quantitative Imaging Biomarker Alliance. Bettina Baessler is co-founder of Lernrad GmbH, Germany. Xavier Golay is a co-founder, CEO and shareholder of Gold Standard Phantoms. Diego Hernando is co-founder of Calimetrix, LLC.

Open Research

DATA AVAILABILITY STATEMENT

Data sharing is not applicable to this article as no new data were obtained or analyzed in this study.

Supporting Information

REFERENCES

1 Biomarkers Definitions Working G. Biomarkers and surrogate endpoints: preferred definitions and conceptual framework. Clin Pharmacol Ther. 2001; 69: 89-95.
10.1067/mcp.2001.113989
PubMed Web of Science® Google Scholar
2Abramson RG, Burton KR, Yu JP, et al. Methods and challenges in quantitative imaging biomarker development. Acad Radiol. 2015; 22: 25-32.
10.1016/j.acra.2014.09.001
PubMed Web of Science® Google Scholar
3 M Cercignani, N Dowell, P Tofts, editors. Quantitative MRI of the brain: principles of Physical measurement, 2nd ed. CRC Press; 2018.
Google Scholar
4Cui Y, Zhang XP, Sun YS, Tang L, Shen L. Apparent diffusion coefficient: potential imaging biomarker for prediction and early detection of response to chemotherapy in hepatic metastases. Radiology. 2008; 248: 894-900.
10.1148/radiol.2483071407
PubMed Web of Science® Google Scholar
5Abramson RG, Arlinghaus LR, Dula AN, et al. MR imaging biomarkers in oncology clinical trials. Magn Reson Imaging Clin N Am. 2016; 24: 11-29.
10.1016/j.mric.2015.08.002
PubMed Web of Science® Google Scholar
6deSouza NM, Achten E, Alberich-Bayarri A, European Society of R. et al. Validated imaging biomarkers as decision-making tools in clinical trials and routine practice: current status and recommendations from the EIBALL* subcommittee of the European Society of Radiology (ESR). Insights Imaging. 2019; 10: 87.
10.1186/s13244-019-0764-0
PubMed Web of Science® Google Scholar
7Modell B, Khan M, Darlison M, Westwood MA, Ingram D, Pennell DJ. Improved survival of thalassaemia major in the UK and relation to T2* cardiovascular magnetic resonance. J Cardiovasc Magn Reson. 2008; 10: 42.
10.1186/1532-429X-10-42
PubMed Web of Science® Google Scholar
8Reeder SB, Sirlin CB. Quantification of liver fat with magnetic resonance imaging. Magn Reson Imaging Clin N Am. 2010; 18: 337-357, ix.
10.1016/j.mric.2010.08.013
PubMed Web of Science® Google Scholar
9Dixon WT. Simple proton spectroscopic imaging. Radiology. 1984; 153: 189-194.
10.1148/radiology.153.1.6089263
CAS PubMed Web of Science® Google Scholar
10Reeder SB, Wen Z, Yu H, et al. Multicoil Dixon chemical species separation with an iterative least-squares estimation method. Magn Reson Med. 2004; 51: 35-45.
10.1002/mrm.10675
CAS PubMed Web of Science® Google Scholar
11Pineda AR, Reeder SB, Wen Z, Pelc NJ. Cramer-Rao bounds for three-point decomposition of water and fat. Magn Reson Med. 2005; 54: 625-635.
10.1002/mrm.20623
PubMed Web of Science® Google Scholar
12Liu CY, McKenzie CA, Yu H, Brittain JH, Reeder SB. Fat quantification with IDEAL gradient echo imaging: correction of bias from T(1) and noise. Magn Reson Med. 2007; 58: 354-364.
10.1002/mrm.21301
CAS PubMed Web of Science® Google Scholar
13Yu H, McKenzie CA, Shimakawa A, et al. Multiecho reconstruction for simultaneous water-fat decomposition and T2* estimation. J Magn Reson Imaging: JMRI. 2007; 26: 1153-1161.
10.1002/jmri.21090
CAS PubMed Web of Science® Google Scholar
14O'Regan DP, Callaghan MF, Wylezinska-Arridge M, et al. Liver fat content and T2*: simultaneous measurement by using breath-hold multiecho MR imaging at 3.0 T–feasibility. Radiology. 2008; 247: 550-557.
10.1148/radiol.2472070880
PubMed Web of Science® Google Scholar
15Yu H, Shimakawa A, McKenzie CA, Brodsky E, Brittain JH, Reeder SB. Multiecho water-fat separation and simultaneous R2* estimation with multifrequency fat spectrum modeling. Magn Reson Med. 2008; 60: 1122-1134.
10.1002/mrm.21737
CAS PubMed Web of Science® Google Scholar
16Hamilton G, Middleton MS, Bydder M, et al. Effect of PRESS and STEAM sequences on magnetic resonance spectroscopic liver fat quantification. J Magn Reson Imaging. 2009; 30: 145-152.
10.1002/jmri.21809
CAS PubMed Web of Science® Google Scholar
17Meisamy S, Hines CD, Hamilton G, et al. Quantification of hepatic steatosis with T1-independent, T2-corrected MR imaging with spectral modeling of fat: blinded comparison with MR spectroscopy. Radiology. 2011; 258: 767-775.
10.1148/radiol.10100708
PubMed Web of Science® Google Scholar
18Schwarzbauer C, Syha J, Haase A. Quantification of regional blood volumes by rapid T1 mapping. Magn Reson Med. 1993; 29: 709-712.
10.1002/mrm.1910290521
CAS PubMed Web of Science® Google Scholar
19Everett RJ, Stirrat CG, Semple SI, Newby DE, Dweck MR, Mirsadraee S. Assessment of myocardial fibrosis with T1 mapping MRI. Clin Radiol. 2016; 71: 768-778.
10.1016/j.crad.2016.02.013
CAS PubMed Web of Science® Google Scholar
20Messroghli DR, Moon JC, Ferreira VM, et al. Clinical recommendations for cardiovascular magnetic resonance mapping of T1, T2, T2* and extracellular volume: a consensus statement by the Society for Cardiovascular Magnetic Resonance (SCMR) endorsed by the European Association for Cardiovascular Imaging (EACVI). J Cardiovasc Magn Reson. 2017; 19: 75.
10.1186/s12968-017-0389-8
PubMed Web of Science® Google Scholar
21Mewton N, Liu CY, Croisille P, Bluemke D, Lima JA. Assessment of myocardial fibrosis with cardiovascular magnetic resonance. J Am Coll Cardiol. 2011; 57: 891-903.
10.1016/j.jacc.2010.11.013
PubMed Web of Science® Google Scholar
22Amano Y, Takeda M, Tachi M, Kitamura M, Kumita S. Myocardial fibrosis evaluated by Look-Locker and late gadolinium enhancement magnetic resonance imaging in apical hypertrophic cardiomyopathy: association with ventricular tachyarrhythmia and risk factors. J Magn Reson Imaging. 2014; 40: 407-412.
10.1002/jmri.24357
PubMed Web of Science® Google Scholar
23Messroghli DR, Radjenovic A, Kozerke S, Higgins DM, Sivananthan MU, Ridgway JP. Modified Look-Locker inversion recovery (MOLLI) for high-resolution T1 mapping of the heart. Magn Reson Med. 2004; 52: 141-146.
10.1002/mrm.20110
PubMed Web of Science® Google Scholar
24Piechnik SK, Ferreira VM, Dall'Armellina E, Cochlin LE, Greiser A, Neubauer S, Robson MD. Shortened Modified Look-Locker Inversion recovery (ShMOLLI) for clinical myocardial T1-mapping at 1.5 and 3 T within a 9 heartbeat breathhold. J Cardiovasc Magn Reson. 2010; 12: 69.
10.1186/1532-429X-12-69
PubMed Web of Science® Google Scholar
25Radenkovic D, Weingartner S, Ricketts L, Moon JC, Captur G. T1 mapping in cardiac MRI. Heart Fail Rev. 2017; 22: 415-430.
10.1007/s10741-017-9627-2
PubMed Web of Science® Google Scholar
26Higgins DM, Moon JC. Review of T1 mapping methods: comparative effectiveness including reproducibility issues. Curr Cardiovasc Imaging Rep. 2014; 7: 9252.
10.1007/s12410-013-9252-y
Google Scholar
27Kessler LG, Barnhart HX, Buckler AJ, Group QTW, et al. The emerging science of quantitative imaging biomarkers terminology and definitions for scientific studies and regulatory submissions. Stat Methods Med Res. 2015; 24: 9-26.
10.1177/0962280214537333
PubMed Web of Science® Google Scholar
28Sullivan DC, Obuchowski NA, Kessler LG, Group R-QMW, et al. Metrology standards for quantitative imaging biomarkers. Radiology. 2015; 277: 813-825.
10.1148/radiol.2015142202
PubMed Web of Science® Google Scholar
29Raunig DL, McShane LM, Pennello G, Group QTPW, et al. Quantitative imaging biomarkers: a review of statistical methods for technical performance assessment. Stat Methods Med Res. 2015; 24: 27-67.
10.1177/0962280214537344
PubMed Web of Science® Google Scholar
30Obuchowski NA, Reeves AP, Huang EP, Algorithm Comparison Working G, et al. Quantitative imaging biomarkers: a review of statistical methods for computer algorithm comparisons. Stat Methods Med Res. 2015; 24: 68-106.
10.1177/0962280214537390
PubMed Web of Science® Google Scholar
31Keenan KE, Ainslie M, Barker AJ, et al. Quantitative magnetic resonance imaging phantoms: a review and the need for a system phantom. Magn Reson Med. 2018; 79: 48-61.
10.1002/mrm.26982
PubMed Web of Science® Google Scholar
32Boss M, Dienstfrey A, Gimbutas Z, et al. Magnetic Resonance Imaging Biomarker Calibration Service: Proton Spin Relaxation Times. National Institute of Standards and Technology; 2018.
Google Scholar
33Stupic KF, Ainslie M, Boss MA, et al. A standard system phantom for magnetic resonance imaging. Magn Reson Med. 2021; 86: 1194-1211.
10.1002/mrm.28779
PubMed Web of Science® Google Scholar
34Marques JP, Meineke J, Milovic C, et al. QSM reconstruction challenge 2.0: a realistic in silico head phantom for MRI data simulation and evaluation of susceptibility mapping procedures. Magn Reson Med. 2021; 86: 526-542.
10.1002/mrm.28716
PubMed Web of Science® Google Scholar
35Obuchowski NA, Buckler A, Kinahan P, et al. Statistical issues in testing conformance with the quantitative imaging biomarker alliance (QIBA) profile claims. Acad Radiol. 2016; 23: 496-506.
10.1016/j.acra.2015.12.020
PubMed Web of Science® Google Scholar
36Obuchowski NA, Bullen J. Quantitative imaging biomarkers: effect of sample size and bias on confidence interval coverage. Stat Methods Med Res. 2018; 27: 3139-3150.
10.1177/0962280217693662
PubMed Web of Science® Google Scholar
37Captur G, Bhandari A, Brühl R, et al. T1 mapping performance and measurement repeatability: results from the multi-national T1 mapping standardization phantom program (T1MES). J Cardiovasc Magn Reson. 2020; 22: 31.
10.1186/s12968-020-00613-3
PubMed Web of Science® Google Scholar
38Hernando D, Kuhn JP, Mensel B, et al. R2* estimation using “in-phase” echoes in the presence of fat: the effects of complex spectrum of fat. J Magn Reson Imaging. 2013; 37: 717-726.
10.1002/jmri.23851
PubMed Web of Science® Google Scholar
39Obuchowski NA, Mozley PD, Matthews D, Buckler A, Bullen J, Jackson E. Statistical considerations for planning clinical trials with quantitative imaging biomarkers. J Natl Cancer Inst. 2019; 111: 19-26.
10.1093/jnci/djy194
PubMed Google Scholar
40Shukla-Dave A, Obuchowski NA, Chenevert TL, et al. Quantitative imaging biomarkers alliance (QIBA) recommendations for improved precision of DWI and DCE-MRI derived biomarkers in multicenter oncology trials. J Magn Reson Imaging. 2019; 49: e101-e121.
10.1002/jmri.26518
PubMed Web of Science® Google Scholar
41Huang EP, Wang X-F, Choudhury KR, et al. Meta-analysis of the technical performance of an imaging procedure: guidelines and statistical methodology. Stat Methods Med Res. 2015; 24: 141-174.
10.1177/0962280214537394
PubMed Web of Science® Google Scholar
42Obuchowski NA, Barnhart HX, Buckler AJ, et al. Statistical issues in the comparison of quantitative imaging biomarker algorithms using pulmonary nodule volume as an example. Stat Methods Med Res. 2015; 24: 107-140.
10.1177/0962280214537392
PubMed Web of Science® Google Scholar
43 RSNA. QIBA Profile: Diffusion-Weighted Magnetic Resonance Imaging (DWI). QIBA; 2019.
Google Scholar
44Hernando D, Cook RJ, Qazi N, Longhurst CA, Diamond CA, Reeder SB. Complex confounder-corrected R2* mapping for liver iron quantification with MRI. Eur Radiol. 2021; 31: 264-275.
10.1007/s00330-020-07123-x
PubMed Web of Science® Google Scholar
45Hernando D, Zhao R, Mattison R, et al. Multi-center, multi-vendor reproducibility of confounder-corrected R2* mapping for liver iron quantification at 1.5T and 3T: interim results. 2019 March 17, 2019; Orlando, FL. Society of Abdominal Imaging Annual Meeting.
Google Scholar
46Hernando D, Sharma SD, Aliyari Ghasabeh M, et al. Multisite, multivendor validation of the accuracy and reproducibility of proton-density fat-fraction quantification at 1.5T and 3T using a fat-water phantom. Magn Reson Med. 2017; 77: 1516-1524.
10.1002/mrm.26228
CAS PubMed Web of Science® Google Scholar
47Hu HH, Yokoo T, Bashir MR, et al. Linearity and bias of proton density fat fraction as a quantitative imaging biomarker: a multicenter, multiplatform, multivendor phantom study. Radiology. 2021; 298: 640-651.
10.1148/radiol.2021202912
PubMed Web of Science® Google Scholar
48Yokoo T, Serai SD, Pirasteh A, et al. Linearity, bias, and precision of hepatic proton density fat fraction measurements by using MR imaging: a meta-analysis. Radiology. 2018; 286: 486-498.
10.1148/radiol.2017170550
PubMed Web of Science® Google Scholar
49Bannas P, Kramer H, Hernando D, et al. Quantitative magnetic resonance imaging of hepatic steatosis: validation in ex vivo human livers. Hepatology. 2015; 62: 1444-1455.
10.1002/hep.28012
CAS PubMed Web of Science® Google Scholar
50Hines CD, Frydrychowicz A, Hamilton G, et al. T(1) independent, T(2) (*) corrected chemical shift based fat-water separation with multi-peak fat spectral modeling is an accurate and precise measure of hepatic steatosis. J Magn Reson Imaging: JMRI. 2011; 33: 873-881.
10.1002/jmri.22514
PubMed Web of Science® Google Scholar
51Serai SD, Dillman JR, Trout AT. Proton density fat fraction measurements at 1.5- and 3-T hepatic MR imaging: same-day agreement among readers and across two imager manufacturers. Radiology. 2017; 284: 244-254.
10.1148/radiol.2017161786
PubMed Web of Science® Google Scholar
52Roujol S, Weingartner S, Foppa M, et al. Accuracy, precision, and reproducibility of four T1 mapping sequences: a head-to-head comparison of MOLLI, ShMOLLI, SASHA, and SAPPHIRE. Radiology. 2014; 272: 683-689.
10.1148/radiol.14140296
PubMed Web of Science® Google Scholar
53Weingartner S, Messner NM, Budjan J, et al. Myocardial T1-mapping at 3T using saturation-recovery: reference values, precision and comparison with MOLLI. J Cardiovasc Magn Reson. 2016; 18: 84.
10.1186/s12968-016-0302-x
PubMed Web of Science® Google Scholar
54Fontana M, White SK, Banypersad SM, et al. Comparison of T1 mapping techniques for ECV quantification. Histological validation and reproducibility of ShMOLLI versus multibreath-hold T1 quantification equilibrium contrast CMR. J Cardiovasc Magn Reson. 2012; 14: 88.
10.1186/1532-429X-14-88
PubMed Web of Science® Google Scholar
55Kellman P, Hansen MS. T1-mapping in the heart: accuracy and precision. J Cardiovasc Magn Reson. 2014; 16: 2.
10.1186/1532-429X-16-2
PubMed Web of Science® Google Scholar
56Graham-Brown MP, Rutherford E, Levelt E, et al. Native T1 mapping: inter-study, inter-observer and inter-center reproducibility in hemodialysis patients. J Cardiovasc Magn Reson. 2017; 19: 21.
10.1186/s12968-017-0337-7
PubMed Web of Science® Google Scholar
57Chow K, Flewitt JA, Green JD, Pagano JJ, Friedrich MG, Thompson RB. Saturation recovery single-shot acquisition (SASHA) for myocardial T(1) mapping. Magn Reson Med. 2014; 71: 2082-2095.
10.1002/mrm.24878
CAS PubMed Web of Science® Google Scholar
58Raman FS, Kawel-Boehm N, Gai N, et al. Modified look-locker inversion recovery T1 mapping indices: assessment of accuracy and reproducibility between magnetic resonance scanners. J Cardiovasc Magn Reson. 2013; 15: 64.
10.1186/1532-429X-15-64
PubMed Web of Science® Google Scholar
59Dabir D, Child N, Kalra A, et al. Reference values for healthy human myocardium using a T1 mapping methodology: results from the international T1 multicenter cardiovascular magnetic resonance study. J Cardiovasc Magn Reson. 2014; 16: 69.
10.1186/s12968-014-0069-x
PubMed Web of Science® Google Scholar
60Popescu IA, Werys K, Zhang Q, et al. Standardization of T1-mapping in cardiovascular magnetic resonance using clustered structuring for benchmarking normal ranges. Int J Cardiol. 2021; 326: 220-225.
10.1016/j.ijcard.2020.10.041
PubMed Web of Science® Google Scholar
61Glover GH, Schneider E. Three-point Dixon technique for true water/fat decomposition with B0 inhomogeneity correction. Magn Reson Med. 1991; 18: 371-383.
10.1002/mrm.1910180211
CAS PubMed Web of Science® Google Scholar
62Malyarenko DI, Ross BD, Chenevert TL. Analysis and correction of gradient nonlinearity bias in apparent diffusion coefficient measurements. Magn Reson Med. 2014; 71: 1312-1323.
10.1002/mrm.24773
PubMed Web of Science® Google Scholar
63Bernstein MA, Zhou XJ, Polzin JA, et al. Concomitant gradient terms in phase contrast MR: analysis and correction. Magn Reson Med. 1998; 39: 300-308.
10.1002/mrm.1910390218
CAS PubMed Web of Science® Google Scholar
64Colgan TJ, Hernando D, Sharma SD, Reeder SB. The effects of concomitant gradients on chemical shift encoded MRI. Magn Reson Med. 2017; 78: 730-738.
10.1002/mrm.26461
CAS PubMed Web of Science® Google Scholar
65Jezzard P, Barnett AS, Pierpaoli C. Characterization of and correction for eddy current artifacts in echo planar diffusion imaging. Magn Reson Med. 1998; 39: 801-812.
10.1002/mrm.1910390518
CAS PubMed Web of Science® Google Scholar
66Benner T, van der Kouwe AJ, Kirsch JE, Sorensen AG. Real-time RF pulse adjustment for B0 drift correction. Magn Reson Med. 2006; 56: 204-209.
10.1002/mrm.20936
CAS PubMed Web of Science® Google Scholar
67Peters DC, Derbyshire JA, McVeigh ER. Centering the projection reconstruction trajectory: reducing gradient delay errors. Magn Reson Med. 2003; 50: 1-6.
10.1002/mrm.10501
PubMed Web of Science® Google Scholar
68Zaitsev M, Maclaren J, Herbst M. Motion artifacts in MRI: a complex problem with many partial solutions. J Magn Reson Imaging. 2015; 42: 887-901.
10.1002/jmri.24850
PubMed Web of Science® Google Scholar
69Novikov DS, Kiselev VG, Jespersen SN. On modeling. Magn Reson Med. 2018; 79: 3172-3193.
10.1002/mrm.27101
PubMed Web of Science® Google Scholar
70McRobbie D, Lerski R, Straughan K. Slice profile effects and their calibration and correction in quantitative NMR imaging. Phys Med Biol. 1987; 32: 971.
10.1088/0031-9155/32/8/002
Web of Science® Google Scholar
71Malik SJ, Kenny GD, Hajnal JV. Slice profile correction for transmit sensitivity mapping using actual flip angle imaging. Magn Reson Med. 2011; 65: 1393-1399.
10.1002/mrm.22739
PubMed Web of Science® Google Scholar
72Gudbjartsson H, Patz S. The Rician distribution of noisy MRI data. Magn Reson Med. 1995; 34: 910-914.
10.1002/mrm.1910340618
CAS PubMed Web of Science® Google Scholar
73Jones DK, Basser PJ. “Squashing peanuts and smashing pumpkins”: how noise distorts diffusion-weighted MR data. Magn Reson Med. 2004; 52: 979-993.
10.1002/mrm.20283
PubMed Web of Science® Google Scholar
74Hernando D, Kramer JH, Reeder SB. Multipeak fat-corrected complex R2* relaxometry: theory, optimization, and clinical validation. Magn Reson Med. 2013; 70: 1319-1331.
10.1002/mrm.24593
PubMed Web of Science® Google Scholar
75Bydder M, Yokoo T, Hamilton G, et al. Relaxation effects in the quantification of fat using gradient echo imaging. Magn Reson Imaging. 2008; 26: 347-359.
10.1016/j.mri.2007.08.012
PubMed Web of Science® Google Scholar
76Horng DE, Hernando D, Reeder SB. Quantification of liver fat in the presence of iron overload. J Magn Reson Imaging. 2017; 45: 428-439.
10.1002/jmri.25382
PubMed Web of Science® Google Scholar
77Hamilton G, Yokoo T, Bydder M, et al. In vivo characterization of the liver fat (1)H MR spectrum. NMR Biomed. 2011; 24: 784-790.
10.1002/nbm.1622
PubMed Web of Science® Google Scholar
78Yu H, Shimakawa A, Hines CD, et al. Combination of complex-based and magnitude-based multiecho water-fat separation for accurate quantification of fat-fraction. Magn Reson Med. 2011; 66: 199-206.
10.1002/mrm.22840
CAS PubMed Web of Science® Google Scholar
79Hernando D, Hines CD, Yu H, Reeder SB. Addressing phase errors in fat-water imaging using a mixed magnitude/complex fitting method. Magn Reson Med. 2012; 67: 638-644.
10.1002/mrm.23044
CAS PubMed Web of Science® Google Scholar
80Weingartner S, Akcakaya M, Basha T, et al. Combined saturation/inversion recovery sequences for improved evaluation of scar and diffuse fibrosis in patients with arrhythmia or heart rate variability. Magn Reson Med. 2014; 71: 1024-1034.
10.1002/mrm.24761
CAS PubMed Web of Science® Google Scholar
81Christodoulou AG, Shaw JL, Nguyen C, et al. Magnetic resonance multitasking for motion-resolved quantitative cardiovascular imaging. Nat Biomed Eng. 2018; 2: 215-226.
10.1038/s41551-018-0217-y
CAS PubMed Web of Science® Google Scholar
82Weingartner S, Shenoy C, Rieger B, Schad LR, Schulz-Menger J, Akcakaya M. Temporally resolved parametric assessment of Z-magnetization recovery (TOPAZ): dynamic myocardial T1 mapping using a cine steady-state look-locker approach. Magn Reson Med. 2018; 79: 2087-2100.
10.1002/mrm.26887
PubMed Web of Science® Google Scholar
83Guo R, Cai X, Kucukseymen S, et al. Free-breathing simultaneous myocardial T1 and T2 mapping with whole left ventricle coverage. Magn Reson Med. 2021; 85: 1308-1321.
10.1002/mrm.28506
PubMed Web of Science® Google Scholar
84Robson MD, Piechnik SK, Tunnicliffe EM, Neubauer S. T1 measurements in the human myocardium: the effects of magnetization transfer on the SASHA and MOLLI sequences. Magn Reson Med. 2013; 70: 664-670.
10.1002/mrm.24867
CAS PubMed Web of Science® Google Scholar
85Kellman P, Herzka DA, Arai AE, Hansen MS. Influence of Off-resonance in myocardial T1-mapping using SSFP based MOLLI method. J Cardiovasc Magn Reson. 2013; 15: 63.
10.1186/1532-429X-15-63
PubMed Web of Science® Google Scholar
86Cooper MA, Nguyen TD, Spincemaille P, Prince MR, Weinsaft JW, Wang Y. How accurate is MOLLI T1 mapping in vivo? Validation by spin echo methods. PLoS One. 2014; 9:e107327.
10.1371/journal.pone.0107327
PubMed Web of Science® Google Scholar
87Cameron D, Vassiliou VS, Higgins DM, Gatehouse PD. Towards accurate and precise T1 and extracellular volume mapping in the myocardium: a guide to current pitfalls and their solutions. MAGMA. 2018; 31: 143-163.
10.1007/s10334-017-0631-2
PubMed Google Scholar
88Weingartner S, Messner NM, Zollner FG, Akcakaya M, Schad LR. Black-blood native T1 mapping: blood signal suppression for reduced partial voluming in the myocardium. Magn Reson Med. 2017; 78: 484-493.
10.1002/mrm.26378
CAS PubMed Web of Science® Google Scholar
89Motosugi U, Hernando D, Wiens C, Bannas P, Reeder SB. High SNR acquisitions improve the repeatability of liver fat quantification using confounder-corrected chemical shift-encoded MR imaging. Magn Reson Med Sci. 2017; 16: 332-339.
10.2463/mrms.mp.2016-0081
CAS PubMed Web of Science® Google Scholar
90Boudreau M, Pike GB. Sensitivity regularization of the Cramér-Rao lower bound to minimize B1 nonuniformity effects in quantitative magnetization transfer imaging. Magn Reson Med. 2018; 80: 2560-2572.
10.1002/mrm.27337
PubMed Web of Science® Google Scholar
91Drakesmith M, Harms R, Rudrapatna SU, Parker GD, Evans CJ, Jones DK. Estimating axon conduction velocity in vivo from microstructural MRI. NeuroImage. 2019; 203: 116186.
10.1016/j.neuroimage.2019.116186
PubMed Web of Science® Google Scholar
92Scharf LL, LT M. Geometry of the Cramér-Rao bound. In: Proceedings of the IEEE Sixth SP Workshop on Statistical Signal and Array Processing. Victoria, BC, Canada. 1992. p 5-8.
Google Scholar
93Akcakaya M, Weingartner S, Roujol S, Nezafat R. On the selection of sampling points for myocardial T1 mapping. Magn Reson Med. 2015; 73: 1741-1753.
10.1002/mrm.25285
PubMed Web of Science® Google Scholar
94Kellman P, Xue H, Chow K, Spottiswoode BS, Arai AE, Thompson RB. Optimized saturation recovery protocols for T1-mapping in the heart: influence of sampling strategies on precision. J Cardiovasc Magn Reson. 2014; 16: 55.
10.1186/s12968-014-0055-3
PubMed Web of Science® Google Scholar
95Jang J, Bellm S, Roujol S, et al. Comparison of spoiled gradient echo and steady-state free-precession imaging for native myocardial T1 mapping using the slice-interleaved T1 mapping (STONE) sequence. NMR Biomed. 2016; 29: 1486-1496.
10.1002/nbm.3598
CAS PubMed Web of Science® Google Scholar
96Shaw JL, Yang Q, Zhou Z, et al. Free-breathing, non-ECG, continuous myocardial T1 mapping with cardiovascular magnetic resonance multitasking. Magn Reson Med. 2019; 81: 2450-2463.
10.1002/mrm.27574
PubMed Web of Science® Google Scholar
97Qi H, Jaubert O, Bustin A, et al. Free-running 3D whole heart myocardial T1 mapping with isotropic spatial resolution. Magn Reson Med. 2019; 82: 1331-1342.
10.1002/mrm.27811
CAS PubMed Web of Science® Google Scholar
98Chow K, Yang Y, Shaw P, Kramer CM, Salerno M. Robust free-breathing SASHA T1 mapping with high-contrast image registration. J Cardiovasc Magn Reson. 2016; 18: 47.
10.1186/s12968-016-0267-9
PubMed Web of Science® Google Scholar
99Weingartner S, Roujol S, Akcakaya M, Basha TA, Nezafat R. Free-breathing multislice native myocardial T1 mapping using the slice-interleaved T1 (STONE) sequence. Magn Reson Med. 2015; 74: 115-124.
10.1002/mrm.25387
CAS PubMed Web of Science® Google Scholar
100Weingartner S, Akcakaya M, Roujol S, et al. Free-breathing post-contrast three-dimensional T1 mapping: volumetric assessment of myocardial T1 values. Magn Reson Med. 2015; 73: 214-222.
10.1002/mrm.25124
PubMed Web of Science® Google Scholar
101Nordio G, Bustin A, Henningsson M, et al. 3D SASHA myocardial T1 mapping with high accuracy and improved precision. MAGMA. 2019; 32: 281-289.
10.1007/s10334-018-0703-y
CAS PubMed Google Scholar
102Wang Y, Liu T. Quantitative susceptibility mapping (QSM): decoding MRI data for a tissue magnetic biomarker. Magn Reson Med. 2015; 73: 82-101.
10.1002/mrm.25358
CAS PubMed Web of Science® Google Scholar
103Hancu I, Liu J, Hua Y, Lee SK. Electrical properties tomography: available contrast and reconstruction capabilities. Magn Reson Med. 2019; 81: 803-810.
10.1002/mrm.27453
PubMed Web of Science® Google Scholar
104Liu J, Wang Y, Katscher U, He B. Electrical properties tomography based on B₁ maps in MRI: principles, applications, and challenges. IEEE Trans Biomed Eng. 2017; 64: 2515-2530.
10.1109/TBME.2017.2725140
PubMed Web of Science® Google Scholar
105Bydder M, Yokoo T, Yu H, Carl M, Reeder SB, Sirlin CB. Constraining the initial phase in water-fat separation. Magn Reson Imaging. 2011; 29: 216-221.
10.1016/j.mri.2010.08.011
PubMed Web of Science® Google Scholar
106Horng DE, Hernando D, Hines CD, Reeder SB. Comparison of R2* correction methods for accurate fat quantification in fatty liver. J Magn Reson Imaging. 2013; 37: 414-422.
10.1002/jmri.23835
PubMed Web of Science® Google Scholar
107Wang X, Hernando D, Reeder SB. Sensitivity of chemical shift-encoded fat quantification to calibration of fat MR spectrum. Magn Reson Med. 2016; 75: 845-851.
10.1002/mrm.25681
PubMed Web of Science® Google Scholar
108Deichmann R, Haase A. quantification of T1 values by SNAPSHOT-FLASH NMR imaging. J Magn Reson (1969). 1992; 96: 608-612.
10.1016/0022-2364(92)90347-A
CAS Web of Science® Google Scholar
109Shao J, Rapacchi S, Nguyen KL, Hu P. Myocardial T1 mapping at 3.0 tesla using an inversion recovery spoiled gradient echo readout and bloch equation simulation with slice profile correction (BLESSPC) T1 estimation algorithm. J Magn Reson Imaging. 2016; 43: 414-425.
10.1002/jmri.24999
PubMed Web of Science® Google Scholar
110Xanthis CG, Bidhult S, Kantasis G, Heiberg E, Arheden H, Aletras AH. Parallel simulations for QUAntifying RElaxation magnetic resonance constants (SQUAREMR): an example towards accurate MOLLI T1 measurements. J Cardiovasc Magn Reson. 2015; 17: 104.
10.1186/s12968-015-0206-1
PubMed Web of Science® Google Scholar
111Veraart J, Sijbers J, Sunaert S, Leemans A, Jeurissen B. Weighted linear least squares estimation of diffusion MRI parameters: strengths, limitations, and pitfalls. NeuroImage. 2013; 81: 335-346.
10.1016/j.neuroimage.2013.05.028
PubMed Web of Science® Google Scholar
112Bertsekas DP. Nonlinear Programming. Athena Scientific. 2016.
Google Scholar
113Xue H, Greiser A, Zuehlsdorff S, et al. Phase-sensitive inversion recovery for myocardial T1 mapping with motion correction and parametric fitting. Magn Reson Med. 2013; 69: 1408-1420.
10.1002/mrm.24385
PubMed Web of Science® Google Scholar
114Hunter DJ, Losina E, Guermazi A, Burstein D, Lassere MN, Kraus V. A pathway and approach to biomarker validation and qualification for osteoarthritis clinical trials. Curr Drug Targets. 2010; 11: 536-545.
10.2174/138945010791011947
CAS PubMed Web of Science® Google Scholar
115Califf RM. Biomarker definitions and their applications. Exp Biol Med (Maywood). 2018; 243: 213-221.
10.1177/1535370217750088
CAS PubMed Web of Science® Google Scholar
116Rosenkrantz AB, Mendiratta-Lala M, Bartholmai BJ, et al. Clinical utility of quantitative imaging. Acad Radiol. 2015; 22: 33-49.
10.1016/j.acra.2014.08.011
PubMed Web of Science® Google Scholar
117Brisman JL, Pile-Spellman J, Konstas AA. Clinical utility of quantitative magnetic resonance angiography in the assessment of the underlying pathophysiology in a variety of cerebrovascular disorders. Eur J Radiol. 2012; 81: 298-302.
10.1016/j.ejrad.2010.12.079
PubMed Web of Science® Google Scholar
118Eskreis-Winkler S, Zhang Y, Zhang J, et al. The clinical utility of QSM: disease diagnosis, medical management, and surgical planning. NMR Biomed. 2017; 30:e3668.
10.1002/nbm.3668
CAS Web of Science® Google Scholar
119Gallagher EJ. Clinical utility of likelihood ratios. Ann Emerg Med. 1998; 31: 391-397.
10.1016/S0196-0644(98)70352-X
CAS PubMed Web of Science® Google Scholar
120Dobbin KK, Cesano A, Alvarez J, et al. Validation of biomarkers to predict response to immunotherapy in cancer: volume II—clinical validation and regulatory considerations. J Immunother Cancer. 2016; 4: 77.
10.1186/s40425-016-0179-0
PubMed Web of Science® Google Scholar
121Padhani AR, Liu G, Koh DM, et al. Diffusion-weighted magnetic resonance imaging as a cancer biomarker: consensus and recommendations. Neoplasia. 2009; 11: 102-125.
10.1593/neo.81328
CAS PubMed Web of Science® Google Scholar
122Baessler B, Schaarschmidt F, Dick A, et al. Mapping tissue inhomogeneity in acute myocarditis: a novel analytical approach to quantitative myocardial edema imaging by T2-mapping. J Cardiovasc Magn Reson. 2015; 17: 115.
10.1186/s12968-015-0217-y
PubMed Web of Science® Google Scholar
123Baessler B, Schaarschmidt F, Stehning C, et al. Reproducibility of three different cardiac T2 -mapping sequences at 1.5T. J Magn Reson Imaging. 2016; 44: 1168-1178.
10.1002/jmri.25258
PubMed Web of Science® Google Scholar
124Gu J, Liu S, Du S, et al. Diagnostic value of MRI-PDFF for hepatic steatosis in patients with non-alcoholic fatty liver disease: a meta-analysis. Eur Radiol. 2019; 29: 3564-3573.
10.1007/s00330-019-06072-4
PubMed Web of Science® Google Scholar
125Loomba R, Neuschwander-Tetri BA, Sanyal A, et al. Multicenter validation of association between decline in MRI-PDFF and histologic response in NASH. Hepatology. 2020; 72: 1219-1229.
10.1002/hep.31121
CAS PubMed Web of Science® Google Scholar
126Stine JG, Munaganuru N, Barnard A, et al. Change in MRI-PDFF and histologic response in patients with nonalcoholic steatohepatitis: a systematic review and meta-analysis. Clin Gastroenterol Hepatol. 2021; 19: 2274–2283.
10.1016/j.cgh.2020.08.061
PubMed Web of Science® Google Scholar
127Rehm JL, Wolfgram PM, Hernando D, Eickhoff JC, Allen DB, Reeder SB. Proton density fat-fraction is an accurate biomarker of hepatic steatosis in adolescent girls and young women. Eur Radiol. 2015; 25: 2921-2930.
10.1007/s00330-015-3724-1
PubMed Web of Science® Google Scholar
128Puchner SB, Lu MT, Mayrhofer T, et al. High-risk coronary plaque at coronary CT angiography is associated with nonalcoholic fatty liver disease, independent of coronary plaque and stenosis burden: results from the ROMICAT II trial. Radiology. 2015; 274: 693-701.
10.1148/radiol.14140933
PubMed Web of Science® Google Scholar
129Chalasani N, Younossi Z, Lavine JE, et al. The diagnosis and management of nonalcoholic fatty liver disease: practice guidance from the American Association for the Study of Liver Diseases. Hepatology. 2018; 67: 328-357.
10.1002/hep.29367
PubMed Web of Science® Google Scholar
130Faragli A, Merz S, Muzio FPL, et al. Estimation of total collagen volume: a T1 mapping versus histological comparison study in healthy Landrace pigs. Int J Cardiovasc Imaging. 2020; 36: 1761-1769.
10.1007/s10554-020-01881-x
CAS PubMed Web of Science® Google Scholar
131Reiter U, Reiter C, Krauter C, Fuchsjager M, Reiter G. Cardiac magnetic resonance T1 mapping. Part 2: diagnostic potential and applications. Eur J Radiol. 2018; 109: 235-247.
10.1016/j.ejrad.2018.10.013
PubMed Web of Science® Google Scholar
132Hamilton-Craig CR, Strudwick MW, Galloway GJ. T1 Mapping for myocardial fibrosis by cardiac magnetic resonance relaxometry-a comprehensive technical review. Front Cardiovasc Med. 2016; 3: 49.
PubMed Web of Science® Google Scholar
133Lurz JA, Luecke C, Lang D, et al. CMR-derived extracellular volume fraction as a marker for myocardial fibrosis: the importance of coexisting myocardial inflammation. JACC Cardiovasc Imaging. 2018; 11: 38-45.
10.1016/j.jcmg.2017.01.025
PubMed Web of Science® Google Scholar
134Fehrmann A, Treutlein M, Rudolph T, et al. Myocardial T1 and T2 mapping in severe aortic stenosis: potential novel insights into the pathophysiology of myocardial remodelling. Eur J Radiol. 2018; 107: 76-83.
10.1016/j.ejrad.2018.08.016
PubMed Web of Science® Google Scholar
135Aherne E, Chow K, Carr J. Cardiac T1 mapping: techniques and applications. J Magn Reson Imaging. 2020; 51: 1336-1356.
10.1002/jmri.26866
PubMed Web of Science® Google Scholar
136Pan JA, Kerwin MJ, Salerno M. Native T1 mapping, extracellular volume mapping, and late gadolinium enhancement in cardiac amyloidosis: a meta-analysis. JACC Cardiovasc Imaging. 2020; 13: 1299-1310.
10.1016/j.jcmg.2020.03.010
PubMed Web of Science® Google Scholar
137Duan C, Zhu Y, Jang J, et al. Non-contrast myocardial infarct scar assessment using a hybrid native T1 and magnetization transfer imaging sequence at 1.5T. Magn Reson Med. 2019; 81: 3192-3201.
10.1002/mrm.27636
CAS PubMed Web of Science® Google Scholar
138Lopez K, Neji R, Mukherjee RK, et al. Contrast-free high-resolution 3D magnetization transfer imaging for simultaneous myocardial scar and cardiac vein visualization. MAGMA. 2020; 33: 627-640.
10.1007/s10334-020-00833-9
PubMed Google Scholar
139O'Connor JP, Aboagye EO, Adams JE, et al. Imaging biomarker roadmap for cancer studies. Nat Rev Clin Oncol. 2017; 14: 169-186.
10.1038/nrclinonc.2016.162
CAS PubMed Web of Science® Google Scholar
140Tofts PS, Collins DJ. Multicentre imaging measurements for oncology and in the brain. Br J Radiol. 2011; 84: S213-S226.
10.1259/bjr/74316620
PubMed Web of Science® Google Scholar
141Captur G, Gatehouse P, Keenan KE, et al. A medical device-grade T1 and ECV phantom for global T1 mapping quality assurance-the T1 mapping and ECV standardization in cardiovascular magnetic resonance (T1MES) program. J Cardiovasc Magn Reson. 2016; 18: 58.
10.1186/s12968-016-0280-z
PubMed Web of Science® Google Scholar
142Keenan KE, Biller JR, Delfino JG, et al. Recommendations towards standards for quantitative MRI (qMRI) and outstanding needs. J Magn Reson Imaging. 2019; 49: e26-e39.
10.1002/jmri.26598
PubMed Web of Science® Google Scholar
143Buckler AJ, Bresolin L, Dunnick NR, et al. Quantitative imaging test approval and biomarker qualification: interrelated but distinct activities. Radiology. 2011; 259: 875-884.
10.1148/radiol.10100800
PubMed Web of Science® Google Scholar
144Radiological Society of North America Quantitative Imaging Biomarkers Alliance. https://www.rsna.org/research/quantitative-imaging-biomarkers-alliance. Accessed November 21, 2021.
Google Scholar
145Kuhn JP, Meffert P, Heske C, et al. Prevalence of fatty liver disease and hepatic iron overload in a northeastern German population by using quantitative MR imaging. Radiology. 2017; 284: 706-716.
10.1148/radiol.2017161228
PubMed Web of Science® Google Scholar
146Ajmera V, Park CC, Caussy C, et al. Magnetic resonance imaging proton density fat fraction associates with progression of fibrosis in patients with nonalcoholic fatty liver disease. Gastroenterology. 2018; 155: 307-310 e302.
10.1053/j.gastro.2018.04.014
PubMed Web of Science® Google Scholar
147Baggiano A, Boldrini M, Martinez-Naharro A, et al. Noncontrast magnetic resonance for the diagnosis of cardiac amyloidosis. JACC Cardiovasc Imaging. 2020; 13: 69-80.
10.1016/j.jcmg.2019.03.026
PubMed Web of Science® Google Scholar
148Augusto JB, Nordin S, Vijapurapu R, et al. Myocardial edema, myocyte injury, and disease severity in fabry disease. Circ Cardiovasc Imaging. 2020; 13:e010171.
10.1161/CIRCIMAGING.119.010171
PubMed Web of Science® Google Scholar
149Petersen SE, Matthews PM, Francis JM, et al. UK Biobank's cardiovascular magnetic resonance protocol. J Cardiovasc Magn Reson. 2016; 18: 8.
10.1186/s12968-016-0227-4
PubMed Web of Science® Google Scholar
150Bamberg F, Kauczor HU, Weckbach S, et al. German national cohort MRISI. Whole-body MR imaging in the German national cohort: rationale, design, and technical background. Radiology. 2015; 277: 206-220.
10.1148/radiol.2015142272
PubMed Web of Science® Google Scholar
151Ferreira VM, Schulz-Menger J, Holmvang G, et al. Cardiovascular magnetic resonance in nonischemic myocardial inflammation: expert recommendations. J Am Coll Cardiol. 2018; 72: 3158-3176.
10.1016/j.jacc.2018.09.072
PubMed Web of Science® Google Scholar
152van Beek EJR, Kuhl C, Anzai Y, et al. Value of MRI in medicine: more than just another test? J Magn Reson Imaging. 2019; 49: e14-e25.
10.1002/jmri.26211
PubMed Web of Science® Google Scholar
153Farahani K, Tata D, Nordstrom RJ. QIN benchmarks for clinical translation of quantitative imaging tools. Tomography. 2019; 5: 1-6.
10.18383/j.tom.2018.00045
PubMed Web of Science® Google Scholar
154Wilson M, Andronesi O, Barker PB, et al. Methodological consensus on clinical proton MRS of the brain: review and recommendations. Magn Reson Med. 2019; 82: 527-550.
10.1002/mrm.27742
PubMed Web of Science® Google Scholar
155Near J, Harris AD, Juchem C, et al. Preprocessing, analysis and quantification in single-voxel magnetic resonance spectroscopy: experts' consensus recommendations. NMR Biomed. 2021; 34:e4257.
10.1002/nbm.4257
PubMed Web of Science® Google Scholar
156Alsop DC, Detre JA, Golay X, et al. Recommended implementation of arterial spin-labeled perfusion MRI for clinical applications: a consensus of the ISMRM perfusion study group and the European consortium for ASL in dementia. Magn Reson Med. 2015; 73: 102-116.
10.1002/mrm.25197
PubMed Web of Science® Google Scholar
157Taouli B, Beer AJ, Chenevert T, et al. Diffusion-weighted imaging outside the brain: consensus statement from an ISMRM-sponsored workshop. J Magn Reson Imaging. 2016; 44: 521-540.
10.1002/jmri.25196
PubMed Web of Science® Google Scholar
158Ljimani A, Caroli A, Laustsen C, et al. Correction to: consensus-based technical recommendations for clinical translation of renal diffusion-weighted MRI. MAGMA. 2020; 33: 197-198.
10.1007/s10334-020-00828-6
PubMed Google Scholar
159Ljimani A, Caroli A, Laustsen C, et al. Consensus-based technical recommendations for clinical translation of renal diffusion-weighted MRI. MAGMA. 2020; 33: 177-195.
10.1007/s10334-019-00790-y
CAS PubMed Google Scholar
160Baltzer P, Mann RM, Iima M, et al. Diffusion-weighted imaging of the breast-a consensus and mission statement from the EUSOBI International Breast Diffusion-Weighted Imaging working group. Eur Radiol. 2020; 30: 1436-1450.
10.1007/s00330-019-06510-3
PubMed Web of Science® Google Scholar
161Shukla-Dave A, Obuchowski NA, Chenevert TL, et al. Quantitative imaging biomarkers alliance (QIBA) recommendations for improved precision of DWI and DCE-MRI derived biomarkers in multicenter oncology trials. J Magn Reson Imaging. 2019; 49: e101-e121.
10.1002/jmri.26518
PubMed Web of Science® Google Scholar
162Dyverfeldt P, Bissell M, Barker AJ, et al. 4D flow cardiovascular magnetic resonance consensus statement. J Cardiovasc Magn Reson. 2015; 17: 72.
10.1186/s12968-015-0174-5
PubMed Web of Science® Google Scholar
163Committee QSMCO, Bilgic B, Langkammer C, et al. QSM reconstruction challenge 2.0: design and report of results. Magn Reson Med. 2021; 86: 1241-1255.
10.1002/mrm.28754
PubMed Web of Science® Google Scholar
164Karakuzu A, Boudreau M, Duval T, et al. qMRLab: quantitative MRI analysis, under one umbrella. J Open Source Softw. 2020; 5: 2343.
10.21105/joss.02343
Google Scholar
165Ma D, Gulani V, Seiberlich N, et al. Magnetic resonance fingerprinting. Nature. 2013; 495: 187-192.
10.1038/nature11971
CAS PubMed Web of Science® Google Scholar

Citing Literature

Volume87, Issue3

March 2022

Pages 1184-1206

This article also appears in:

Development, validation, qualification, and dissemination of quantitative MR methods: Overview and recommendations by the ISMRM quantitative MR study group

Abstract

1 INTRODUCTION

1.1 Liver PDFF quantification

1.2 Cardiac T1 mapping

2 TECHNICAL PERFORMANCE OF qMR METHODS

2.1 Bias

2.2 Precision

2.3 Examples

2.3.1 Liver PDFF quantification

2.3.2 Cardiac T1

3 CONFOUNDING FACTORS IN MR

3.1 Hardware and system imperfections

3.2 Physiological effects and motion

3.3 Signal model imperfections

3.4 Other artifacts and noise

3.5 Examples

3.5.1 Liver PDFF quantification

3.6 Cardiac T1

4 TECHNICAL DEVELOPMENT AND VALIDATION

4.1 Acquisition

4.1.1 Liver PDFF quantification

4.1.2 Cardiac T1

4.2 Model selection

4.2.1 Liver PDFF quantification

4.2.2 Cardiac T1

4.3 Model fitting

4.3.1 Liver PDFF quantification

4.3.2 Cardiac T1

5 CLINICAL QUALIFICATION

5.1 Examples

5.1.1 Liver PDFF quantification

5.1.2 Cardiac T1

6 DISSEMINATION IN RESEARCH AND IN THE CLINIC

6.1 Examples

6.1.1 Liver PDFF quantification

6.1.2 Cardiac T1

7 RELATED INITIATIVES, CHALLENGES, AND OPPORTUNITIES

8 SUMMARY AND CONCLUSIONS

ACKNOWLEDGMENTS

CONFLICT OF INTEREST

Open Research

DATA AVAILABILITY STATEMENT

Supporting Information

REFERENCES

Citing Literature

Figures

References

Related

Information

1.2 Cardiac T₁ mapping

2.3.2 Cardiac T₁

3.6 Cardiac T₁

4.1.2 Cardiac T₁

4.2.2 Cardiac T₁

4.3.2 Cardiac T₁

5.1.2 Cardiac T₁

6.1.2 Cardiac T₁