Chemical shift encoding based double bonds quantification in triglycerides using deep image prior
Introduction
Chemical-shift encoding-based water-fat separation methods have been developed to quantify fat content (1,2). Recently, there has been growing interest in quantifying the fatty acid composition of fat due to its potential in evaluating metabolic disorders and inflammatory conditions (3). Previous studies indicate that the key to quantifying the fatty acid composition lies in determining the number of double bonds (ndb) in triglycerides (4-7). Optimization algorithms have been proposed to quantify this variable using chemical-shift encoded multi-echo methods (4,5,8).
In recent years, there is strong interest in applying deep neural networks in quantitative magnetic resonance imaging (qMRI) for tasks such as image reconstruction and parameter mapping as they are more robust compared to conventional fitting methods (9). qMRI collects data in higher dimensions than the conventional anatomical imaging. Deep learning shows promising performance in handling high-dimensional data (9). However, most deep learning-based methods require a substantial amount of data to train the neural network. For many medical imaging tasks, it is challenging or even impractical to collect sufficient amount of training data. Deep image prior (DIP) method (10) has been proposed as an unsupervised learning method to address ill-posed inverse problems using deep network without requiring any training data. The original DIP approach was found promising in image denoising and restoration tasks (10-12). The recent work demonstrated the potential of DIP in medical imaging tasks where training data is difficult to acquire, including PET image reconstruction and restoration (13-15) and magnetic resonance (MR) image reconstruction (16-19).
One challenge for quantifying the ndb in triglycerides is that it is highly susceptible to signal perturbations, making it an ill-posed problem. Deep learning approaches were previously reported promising in qMRI in the presence of signal perturbations (20,21). However, training a deep neural network requires a sufficient amount of data and the data available for quantifying fatty acid is scarce in the community. The properties of DIP make it a proper choice to address these issues in quantifying the numbers of double bonds in triglycerides via neural networks. In this work, we investigated DIP for mapping the ndb in triglycerides from multi-echo magnetic resonance imaging (MRI). We demonstrated the feasibility using both phantom and in vivo experiments.
Methods
Signal model
The multi-peak multi-echo signal with water and fat contents at echo time t can be expressed as (4):
Where W, F are water and fat signal, respectively; ϕ is a complex map with the real and imaginary component representing the sum of field map and R2* map; is the normalization factor; αm is the amplitude of the mth fat peak and is the function of the ndb and the number of methylene-interrupted double bonds (nmidb); and ωm is the known chemical shift of the mth fat peak. We adopt the eight-peak fat model in the study by Trinh et al. (4), which is also referred as the “free model”. In free model, ndb and nmidb are estimated as independent parameters. More details of the free model can be found in Tab. 1 in the study by Trinh et al. (4).
Network fitting
We first recap the basic concept of DIP and present its extension to the mapping of numbers of double bonds in triglyceride. The DIP method is unsupervised and interprets the output of a deep network as a parametrization of an image (10). In the original DIP method, the deep network with network weight θ takes a randomized noise map Z as input and map it to a denoised image with the same size. Given a noisy image I, the denoised image I* can be obtained by minimizing the following equation:
The network is enforced to reconstruct the noisy image I through iterations and early stopping is applied so that the output tensor is a denoised image rather than the noisy one. The DIP is based on an assumption that the structure of an autoencoder-like architecture already captures the main image statistics, which is independent of learning. Those statistics priors are associated with low-level image features, and it is the must-need information to recover an image with high fidelity from a degraded one. Intuitively, the clean image with high fidelity can emerge during the process of reconstructing the noisy image by using those image statistics.
As for quantifying the parametric maps in our task, the network is defined as a parametric map generator and the optimization can be defined as follows:
where A is the estimated parametric map; e is the index of the echo and ye stands for the acquired signal under the corresponding time of echo; p is the index of the pixels in an image; and L is the cost function and L1 norm is chosen as the cost function.
The network takes the randomized map as input and the network parameters are updated iteratively following the optimization in Eq. [3]. The result arrives once the loss function is converged, and early stopping is applied. There is no training for the neural network as the parameters of the network are updated solely for the MR parameters of a single slice, and the optimization itself can be regarded as the inference. The network has five outputs, including W, Ff, ϕ, ndb and nmidb. The complex images W, Ff, ϕ have two output channels, one for the real part and the other for the imaginary part. The ndb and nmidb outputs have one channel. We group the fat signal term F and the normalization factor term f into one term as we empirically find it more stable to fit the network. The fat signal can be recovered once ndb and nmidb are obtained as they can be used to compute the normalization factor. The fat fraction (FF) is defined as . Sigmoid function is placed at the channel of ndb to limit its range from 0 to 6 as this is the typical range of the numbers of double bonds (5). We choose not to limit the range of nmidb as we empirically found that it did not create much difference to the convergence. We adopt a UNet like architecture for the deep network (22). The optimization process is shown in Figure 1.
Data acquisition
The in-vivo study was conducted in accordance with the Declaration of Helsinki (as revised in 2013). The study was approved by ethics review board at the Chinese University of Hong Kong (No. 2016.150) and informed consent was taken from all individual participants.
We employed the multi-echo gradient echo sequence protocol described in (23) to acquire the images. Scans were performed using a Philips Elition 3T MRI scanner (Philips Healthcare, Best, the Netherlands). A 32-channel head coil and a 32-channel cardiac coil were used as the receiver for phantom and abdominal imaging, respectively. A total of 14 echoes were acquired with the following parameter settings: TE1/ΔTE =1.20/0.7 ms, repetition time (TR) =10 ms, flip angle =20 degrees, matrix size =160×120, field of view (FOV) =400 mm × 300 mm, and slice thickness =6 mm. Seven slices were acquired. Frequency encoding direction is from right to left. All data were collected in axial plane. For in vivo scan, the data of each slice was acquired in a single breath hold of 17 seconds.
Implementation details
The experiments were conducted in a computing environment running Windows 11 with Python 3.7. All experiments were implemented using PyTorch 1.9 (24). The computational tasks were performed on a system equipped with an RTX 4090 GPU (24 GB) and an i9-13900 CPU. For comparative analysis, we also implemented the algorithm described in the study by Trinh et al. (4) to quantify the ndb by fitting the data to the eight-peak free model. In the subsequent discussion, we refer to this method as least square fitting (LSF), as it is based on the LSF method of water-fat separation (2). All images were cropped to ensure the samples occupied the majority part of the image. The Adadelta optimizer (25) was employed, and the learning rate was set to 5e−2. The fitting of DIP was conducted with 180,000 iterations and it took around an hour to complete. The fitting of the LSF took 15 to 30 minutes in our experiments.
Phantom experiment
To validate the measured ndb values of different vegetable oils by comparing them to the literature values previously published (5,26), we prepared 50 mL tubes of various kinds of pure vegetable oil (olive, peanut, safflower, walnut, grapeseed, canola, and corn oil). These tubes were immersed in water bath in our phantom. In addition to the pure oil phantoms, we also used corn oil and created a representative water-fat phantom with an FF of approximately 60% to evaluate the algorithm’s ability to quantify the ndb in the presence of mixed water and fat. The creation of the water-fat phantom followed the protocol outlined in (27). The agar solution was prepared by heating and mixing distilled water, agar powder, sodium dodecyl sulfate, and sodium benzoate (all sourced from Sigma Aldrich, St. Louis, MO, USA). The agar and oil solutions were blended, stirred, heated, and subsequently cooled to form the water-fat phantom.
The analysis of the results was performed within regions of interest (ROIs). Circular ROIs were placed at the center of each tube. The mean and the standard deviation of all pixel values within the ROIs of all slices were calculated.
In-vivo experiment
We performed an in-vivo abdomen scan on a healthy volunteer. The ROIs were chosen on the area of subcutaneous fat. The mean value within the ROIs of all the slices was compared with previously published literature values (5,26).
Results
Table 1 shows the fitted result compared with the literature values of both the DIP and the LSF method.
Table 1
Oil category | DIP | LSF | Literature values |
---|---|---|---|
ndb | |||
Safflower | 4.95±0.24 | 4.95±0.23 | 5.14 |
Walnut | 4.82±0.23 | 4.80±0.25 | 5.02 |
Grapeseed | 4.74±0.22 | 4.64±0.25 | 4.55 |
Canola | 3.98±0.24 | 3.70±0.24 | 3.62 |
Corn | 4.54±0.30 | 4.29±0.24 | 4.31 |
Olive | 2.87±0.19 | 2.63±0.23 | 2.89 |
Peanut | 3.34±0.20 | 3.13±0.23 | 3.48 |
nmidb | |||
Safflower | 2.28±0.12 | 2.27±0.10 | 2.34 |
Walnut | 2.16±0.13 | 2.14±0.16 | 2.32 |
Grapeseed | 2.08±0.11 | 2.00±0.14 | 2.3 |
Canola | 1.47±0.09 | 1.27±0.11 | 1.14 |
Corn | 1.92±0.17 | 1.71±0.15 | 1.75 |
Olive | 0.77±0.12 | 0.64±0.13 | 0.35 |
Peanut | 1.04±0.13 | 0.91±0.23 | 1.01 |
The reference value of the grapeseed oil is from the study by Trinh et al. (26) while the rest are from the study by Bydder et al. (5). The values are presented as mean ± SD. ndb, number of double bonds; nmidb, number of methylene-interrupted double bonds; DIP, deep image prior; LSF, least square fitting; SD, standard deviation.
Figure 2 shows the maps of the ndb and the nmidb from the pure oil phantoms. Note the measured values are close to the expected values, and all the tubes show an FF of nearly 100%.
Figure 3 illustrates the regression plot depicting the correlation between the measured values and the literature values of ndb and nmidb for both the DIP method and the LSF method. The plot reveals a strong correlation, and the regression lines closely align with the reference plot of y=x. For the LSF method, the Pearson correlation coefficient between the measured values and the literature values is 0.98 (P=8.89e−5) for ndb and 0.98 (P=9.34e−5) for nmidb. The regression analysis yields a slope of 1.04 and an intercept of −0.29 for ndb and a slope of 0.98 and an intercept of −0.04 for nmidb. Similarly, the DIP method exhibits a reasonable correlation performance, with a Pearson correlation coefficient of 0.96 (P=0.0005) for ndb and 0.96 (P=0.0006) for nmidb. The regression analysis of DIP yields a slope of 0.92 and an intercept of 0.35 for ndb and a slope of 0.90 and an intercept of 0.20 for nmidb.
Figure 4 shows the results from the phantom with mixed water and fat made from corn oil. The DIP method yields consistent measurement in two phantoms (pure oil ndb =4.50±0.21, mixed phantom ndb =4.43±0.24, pure oil nmidb =1.88±0.12, mixed phantom nmidb =1.83±0.15). The LSF method shows inconsistent measurement in two phantoms (pure oil ndb =4.32±0.22, mixed phantom ndb =5.27±0.26, pure oil nmidb =1.72±0.13, mixed phantom nmidb =2.58±0.16), respectively. The robustness of DIP method can also be appreciated from its closer FF result (62.76%) to the reference value (60%). In contrast, the LSF method yields FF with a larger deviation (66.70%). For DIP, there is no significant differences between the result of pure oil and mixed phantom (P=0.20 for ndb and P=0.15 for nmidb).
Figure 5 shows the in-vivo results obtained from a typical slice using both methods. Note the published reference value of ndb and nmidb of fat in the subcutaneous region are 2.88 and 0.70, respectively (5). The measured values of the healthy volunteer are shown in Table 2, indicating a reasonable alignment of the measured values and the literature values. It is also observed that there is left-right variation of the quantification map along the frequency encoding direction, likely due to echo shift along frequency encoding.
Table 2
Variables | DIP | LSF |
---|---|---|
ndb | 2.83±0.74 | 3.10±1.96 |
nmidb | 0.74±0.35 | 0.89±0.15 |
The values are presented as mean ± SD. ndb, number of double bonds; nmidb, number of methylene-interrupted double bonds; DIP, deep image prior; LSF, least square fitting; SD, standard deviation.
Discussion
We demonstrated the feasibility of using the DIP-based method to measure the numbers of double bonds in triglycerides from chemical shift-encoded multi-echo gradient echo images. The results obtained from pure oil phantoms and subcutaneous fat show agreement between the measured values and the literature values. We observed that the DIP method demonstrated a superior performance compared to LSF method in the mixed water-fat phantom. We attribute the better performance to the denoising ability of the DIP method. It is likely that the LSF method requires exceedingly high FF to provide sufficient signal-to-noise ratio (SNR) for reliable quantification of the ndb. The convolutional neural network architecture itself is assumed to have the ability to capture implicit prior of the image, filter out noise and generate the main content of the image with high fidelity. In the original DIP method for denoising application, the network is enforced to reconstruct a noisy image while the image with reduced noise emerges after early-stop of iterations. In our work, the DIP network is enforced to reconstruct the gradient echo images from the output parametric maps. It is likely inherent denoising effect in the signal domain contribute to the fitting of the parametric maps, and thus improves robustness to estimate ndb and nmidb map compared to the LSF method when the fat signal is reduced.
It is important to note that the DIP-based method for fitting the numbers of double bonds is unsupervised and does not need pretraining the neural network. Self-supervised learning methods have been proposed for parametric MRI, which do not require ground truth for training but still require a substantial number of unlabeled images as the training data (28,29). In contrast, DIP only trains the network on a single dataset and the training itself is the inference. It may have potential significance in other learning-based qMRI mapping tasks, in which it is challenging to acquire training data.
While our initial validation of using DIP to fit the numbers of double bonds is promising, this work has limitations. The network needs to be retrained each time for a new mapping task, which was also pointed out in the original DIP work (10). This increases the computation cost of inference. New methods for accelerating the optimization process have been proposed recently (30) and it is worth looking into the application in qMRI. Quantifying fatty acid content is closely associated with water fat separation, and we may study the performance of the algorithms on more challenging scenarios that may produce water-fat swap and consider applying advanced correction method (31-34). In addition, the performance of the fitting algorithm can be studied on different water-fat models (7). Further work includes looking into the exact reason of left-right variation of in-vivo quantification maps and the corresponding measures to alleviate those impact. The impact of complex field map needs to be explored in the future as and field inhomogeneity may provide more clinical information. This study uses measurements from previous publication as reference value for validation. It is worthy of conducting gas chromatography and use it as a reference value in future studies. It is also important to conduct more in vivo studies on patients with various FFs to understand the feasibility of the DIP approach in clinical imaging.
Conclusions
We demonstrated the feasibility of using DIP to fit the ndb in triglycerides from chemical shift-encoded multi-echo gradient echo MRI. Further studies are needed to validate this approach on more types of oils and a larger cohort of subjects.
Acknowledgments
Funding: This study was supported by grants from
Footnote
Conflicts of Interest: All authors have completed the ICMJE uniform disclosure form (available at https://qims.amegroups.com/article/view/10.21037/qims-24-1507/coif). Q.C. serves as an unpaid editorial board member of Quantitative Imaging in Medicine and Surgery and is the employee of Philips Electronics Hong Kong Limited. V.W.W. served as a consultant or advisory board member for AbbVie, AstraZeneca, Boehringer Ingelheim, Echosens, Gilead Sciences, Intercept, Inventiva, Merck, Novo Nordisk, Pfizer, Sagimet Biosciences, TARGET PharmaSolutions, and Visirna; and a speaker for Abbott, AbbVie, Echosens, Gilead Sciences, Novo Nordisk, and Unilab. He has received a grant from Gilead Sciences, and is a cofounder and a shareholder of Illuminatio Medical Technology. W.C. is a co-founder and a shareholder of Illuminatio Medical Technology Limited and PrivacyPro Medical Technology Limited. The other authors have no conflicts of interest to declare.
Ethical Statement: The authors are accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. The study was conducted in accordance with the Declaration of Helsinki (as revised in 2013). The study was approved by ethics review board of the Chinese University of Hong Kong (No. 2016.150) and informed consent was taken from all individual participants.
Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0/.
References
- Dixon WT. Simple proton spectroscopic imaging. Radiology 1984;153:189-94. [Crossref] [PubMed]
- Reeder SB, McKenzie CA, Pineda AR, Yu H, Shimakawa A, Brau AC, Hargreaves BA, Gold GE, Brittain JH. Water-fat separation with IDEAL gradient-echo imaging. J Magn Reson Imaging 2007;25:644-52. [Crossref] [PubMed]
- Reddy JK, Rao MS. Lipid metabolism and liver inflammation. II. Fatty liver disease and fatty acid oxidation. Am J Physiol Gastrointest Liver Physiol 2006;290:G852-8. [Crossref] [PubMed]
- Trinh L, Peterson P, Leander P, Brorson H, Månsson S. In vivo comparison of MRI-based and MRS-based quantification of adipose tissue fatty acid composition against gas chromatography. Magn Reson Med 2020;84:2484-94. [Crossref] [PubMed]
- Bydder M, Girard O, Hamilton G. Mapping the double bonds in triglycerides. Magn Reson Imaging 2011;29:1041-6. [Crossref] [PubMed]
- Hamilton G, Yokoo T, Bydder M, Cruite I, Schroeder ME, Sirlin CB, Middleton MS. In vivo characterization of the liver fat 1H MR spectrum. NMR Biomed 2011;24:784-90. [Crossref] [PubMed]
- Berglund J, Ahlström H, Kullberg J. Model-based mapping of fat unsaturation and chain length by chemical shift imaging--phantom validation and in vivo feasibility. Magn Reson Med 2012;68:1815-27. [Crossref] [PubMed]
- Peterson P, Månsson S. Simultaneous quantification of fat content and fatty acid composition using MR imaging. Magn Reson Med 2013;69:688-97. [Crossref] [PubMed]
- Zhu Y, Cheng J, Cui ZX, Zhu Q, Ying L, Liang D. Physics-Driven Deep Learning Methods for Fast Quantitative Magnetic Resonance Imaging: Performance improvements through integration with deep neural networks. IEEE Signal Processing Magazine 2023;40:116-28.
- Ulyanov D, Vedaldi A, Lempitsky V, editors. Deep image prior. Proceedings of the IEEE conference on computer vision and pattern recognition 2018. Available online: https://github.com/DmitryUlyanov/deep-image-prior
- Liu J, Sun Y, Xu X, Kamilov US, editors. Image restoration using total variation regularized deep image prior. ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP); IEEE 2019:7715-9.
- Liu Y, Pan J, Ren J, Su Z, editors. Learning deep priors for image dehazing. Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV); 2019:2492-500.
- Hashimoto F, Ote K, Onishi Y. PET image reconstruction incorporating deep image prior and a forward projection model. IEEE Trans Radiat Plasma Med Sci 2022;6:841-6.
- Liu Q, Tsai YJ, Gallezot JD, Guo X, Chen MK, Pucar D, Young C, Panin V, Casey M, Miao T, Xie H, Chen X, Zhou B, Carson R, Liu C. Population-based deep image prior for dynamic PET denoising: A data-driven approach to improve parametric quantification. Med Image Anal 2024;95:103180. [Crossref] [PubMed]
- Kaplan S, Zhu YM. Full-Dose PET Image Estimation from Low-Dose PET Image Using Deep Learning: a Pilot Study. J Digit Imaging 2019;32:773-8. [Crossref] [PubMed]
- Lin YC, Huang HM. Denoising of multi b-value diffusion-weighted MR images using deep image prior. Phys Med Biol 2020;65:105003. [Crossref] [PubMed]
- Darestani MZ, Heckel R. Accelerated MRI with un-trained neural networks. IEEE Trans Comput Imaging 2021;7:724-33.
- Yoo J, Jin KH, Gupta H, Yerly J, Stuber M, Unser M. Time-Dependent Deep Image Prior for Dynamic MRI. IEEE Trans Med Imaging 2021;40:3337-48. [Crossref] [PubMed]
- Xiong Z, Gao Y, Liu Y, Fazlollahi A, Nestor P, Liu F, Sun H. Quantitative susceptibility mapping through model-based deep image prior (MoDIP). Neuroimage 2024;291:120583. [Crossref] [PubMed]
- Huang C, Wong VW, Chan Q, Chu WC, Chen W. An uncertainty aided framework for learning based liverT(1ρ)mapping and analysis. Phys Med Biol 2023;
- Shih SF, Kafali SG, Calkins KL, Wu HH. Uncertainty-aware physics-driven deep learning network for free-breathing liver fat and R(2) * quantification using self-gated stack-of-radial MRI. Magn Reson Med 2023;89:1567-85. [Crossref] [PubMed]
- Buda M, Saha A, Mazurowski MA. Association of genomic subtypes of lower-grade gliomas with shape features automatically extracted by a deep learning algorithm. Comput Biol Med 2019;109:218-25. [Crossref] [PubMed]
- Available online: https://imagemed.univ-rennes1.fr/en/mrquantif/protocols
- Paszke A, Gross S, Massa F, Lerer A, Bradbury J, Chanan G, et al. Pytorch: An imperative style, high-performance deep learning library. Advances in Neural Information Processing Systems (NeurIPS); 2019;32.
- Zeiler MD. ADADELTA: an adaptive learning rate method. arXiv preprint. 2012. arXiv:1212.5701.
- Trinh L. Quantification of Fatty Acid Composition Using MRI: Comparison of Accuracy at 1.5, 3 and 7 T. MSc Thesis, Lund University, Sweden; 2013.
- Hines CD, Yu H, Shimakawa A, McKenzie CA, Brittain JH, Reeder SB. T1 independent, T2* corrected MRI with accurate spectral modeling for quantification of fat: validation in a fat-water-SPIO phantom. J Magn Reson Imaging 2009;30:1215-22. [Crossref] [PubMed]
- Liu F, Kijowski R, El Fakhri G, Feng L. Magnetic resonance parameter mapping using model-guided self-supervised deep learning. Magn Reson Med 2021;85:3211-26. [Crossref] [PubMed]
- Huang C, Qian Y, Yu SC, Hou J, Jiang B, Chan Q, Wong VW, Chu WC, Chen W. Uncertainty-aware self-supervised neural network for liverT1ρmapping with relaxation constraint. Phys Med Biol 2022; [Crossref]
- Li T, Wang H, Zhuang Z, Sun J, editors. Deep random projector: Accelerated deep image prior. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR); 2023.
- Peterson P, Månsson S. Fat quantification using multiecho sequences with bipolar gradients: investigation of accuracy and noise performance. Magn Reson Med 2014;71:219-29. [Crossref] [PubMed]
- Colgan TJ, Hernando D, Sharma SD, Reeder SB. The effects of concomitant gradients on chemical shift encoded MRI. Magn Reson Med 2017;78:730-8. [Crossref] [PubMed]
- Ruschke S, Eggers H, Kooijman H, Diefenbach MN, Baum T, Haase A, Rummeny EJ, Hu HH, Karampinos DC. Correction of phase errors in quantitative water-fat imaging using a monopolar time-interleaved multi-echo gradient echo sequence. Magn Reson Med 2017;78:984-96. [Crossref] [PubMed]
- Soliman AS, Wiens CN, Wade TP, McKenzie CA. Fat quantification using an interleaved bipolar acquisition. Magn Reson Med 2016;75:2000-8. [Crossref] [PubMed]