1.IntroductionThe tumor microenvironment (TME) is compositionally and structurally heterogeneous, hosting a complex network of biomolecules that encode a variety of biological signals, and metabolic and immune interactions. Collagen is the dominant structural protein in the extracellular matrix (ECM) of the TME.1 Among the 28 types of collagen, fibrillar type I collagen forms a triple-helix structure and organizes itself into a fiber-like structure,2 being the primary collagen in the ECM. It has been shown that the changes in the collagen (especially type I collagen) structure or distribution are linked to many diseases including cancer. Specifically, parameters such as fiber density and orientation of collagen fibers have a significant impact on the progression and treatment of cancer.3–6 For example, the stroma in the TME of pancreatic ductal adenocarcinoma (PDAC) is highly fibrotic, constituting up to 85% of the tumor volume.7 PDAC fibrosis affects the efficacy of cytotoxic therapies and can compromise drug delivery.2 Therefore, comprehensive analysis of fibrillar collagen in the TME is critical for understanding the role of collagen’s architectural changes in carcinogenesis and metastasis, which have implications for the development of effective cancer therapies and personalized treatments. Conventional histological staining agents, such as Masson’s Trichrome, Movat’s Pentachrome, Van Gieson’s stain, and Picrosirius Red, can be used for collagen imaging.2,8 While these labeled techniques enable inspection of collagen in tissue samples using standard widefield microscopes, they are fairly laborious and time-consuming, requiring specialized protocols to overcome stain variation effects. Moreover, histological staining-based techniques are limited in resolving collagen fibers with high resolution and providing quantifiable metrics needed for prognosis and treatment studies. The implementation of artificial intelligence models to enhance the diagnostic potential of collagen-stained images has shown promise, but it is limited by the lack of insight into the biochemistry and the detailed morphology of the ECM.9,10 Label-free collagen-specific imaging modalities, such as polarized light11–13 and second harmonic generation (SHG) microscopy, have been demonstrated to overcome shortcomings of conventional stain-based imaging. Specifically, SHG is a second-order nonlinear coherent scattering process that is highly specific to non-centrosymmetric fibrillar collagen and its supramolecular fiber morphology. In addition, SHG microscopy has sub-micrometer resolution and can perform optical sectioning of tissue with an imaging depth of up to hundreds of micrometers.14,15 SHG has been used to study tumor-associated collagen signatures3,16 (TACS); therefore, it is suitable to be ground truth in our study. In particular, TACS-3, a pattern exhibiting high fiber alignment perpendicular to tumor boundaries, has been shown to be a negative prognostic factor in breast cancer.17 Moreover, similar morphological signatures have also been found in other types of cancer such as skin,18,19 ovarian,4,20 prostate,21 and pancreas.6,22 While SHG imaging has elucidated the biomedical consequences of architectural changes in TME, the molecular mechanisms that drive collagen alterations are still poorly understood. Thus, there is a need for a multimodal approach where SHG and chemical imaging methods sensitive to molecular changes are employed together in TME investigations. Infrared absorption spectroscopy is a label-free analytical technique that provides quantitative biochemical information by probing the vibrational bands of functional biomolecules. Specifically, in the mid-infrared (MIR) fingerprint region ( to ), spectrochemical analysis of measured transmittance through a specimen reveals compositional information. Recently, tunable quantum cascade lasers (QCL) with high spectral power output enabled the development of wide-field MIR spectral imaging systems that can operate at room temperature with compact footprints.23–25 The QCL-based MIR spectral imaging (MIRSI) can rapidly collect hyperspectral datasets from whole-slide tissue sections and has the unique capability to combine spatial and chemical information.26 Previous important studies revealed that the MIR spectral analysis can detect different collagen types,27,28 identify fibrosis in numerous tissues, including the liver,29 heart,30 and bone marrow,31 and provide critical prognostic information based on reactive stroma in TME-based investigations.32,33 In these past reports, the MIRSI-identified fibrotic regions were primarily referenced to stained images of adjacent tissue sections. However, an objective benchmarking of MIRSI-detected collagen fibers with respect to the ground truth SHG microscopy on the same tissue sample has not been performed. Here, we present a new label-free multimodal imaging approach using MIRSI and SHG imaging modalities to sequentially acquire complementary chemical and morphological information from the same biological tissue sample (Fig. 1). We first developed a protocol that enabled reliable image acquisition from the same tissue section using two different microscopes, which employ two distinct frequency regions of the electromagnetic spectrum, i.e., visible-near-infrared () and MIR ( to ). To classify fibrotic regions in pancreatic tissue samples based on collagen’s spectral signatures, we trained a random forest (RF) model using large hyperspectral MIRSI datasets and SHG images of the same tissues as the ground truth. This RF model, which we named RF-MIRSI, was then used to identify regions of high collagen composition from the whole-tissue sections. Finally, we validated the RF-MIRSI fibrotic collagen segmentation method, referencing our findings to the SHG images. The RF-MIRSI method achieved a high average boundary -score (0.8 at 4-pixel thresholds) in the collagen distribution, high correlation (Pearson’s 0.82) in the collagen orientation, and similarly high correlation (Pearson’s 0.66) in the collagen alignment. Our label-free multimodal collagen fiber imaging approach is a key step towards future comprehensive tumor tissue investigations where both morphometric and chemometric information are considered to better study tumor-promoting ECM alterations. 2.Materials and Methods2.1.Tissue Sample and Pure Collagen Sample PreparationFive human pancreas tissue samples were selected from archival, formalin-fixed paraffin-embedded (FFPE) blocks that were created at the Translational Science Biocore BioBank of the University of Wisconsin Carbone Cancer Center. Each FFPE block contains one pancreas tissue sample of one patient. The diagnosis of the patients was not retrieved since it was not needed for this proof-of-concept imaging study. The Translational Research Initiatives in Pathology Laboratory of the University of Wisconsin-Madison prepared the tissue sections for this study using Richard-Allan Scientific Cytoseal 60 Histology Mounting Media. Specifically, two adjacent -thick sections were cut from each block and then deparaffinized. Among these two tissue sections, one was first hematoxylin and eosin (H&E) stained and then enclosed with #1.5 coverslip for later use in bright-field imaging; the other section was unstained, first put on a substrate for MIRSI imaging and then #1.5 cover-slipped for the following SHG imaging. was used as a substrate for MIRSI imaging because it does not have any spectral signature in our MIR spectral window. For the spectrum in Fig. S2 in the Supplementary Material, pure human type I collagen (Cat # CC050, Sigma Aldrich, St. Louis, Missouri, United States) of volume was drop casted on substrate, was dried for 2 h, and was acquired using the same MIRSI microscope described in Sec. 2.3. 2.2.Second Harmonic Generation MicroscopyAll slides in this study were imaged with a custom-built SHG backward detection multiphoton microscope described previously.34 In this system, a Coherent Chameleon Ultra II Ti:Sapphire laser (Coherent, Santa Clara, California, United States) was used to deliver 890 nm light to the sample using a air immersion objective (Nikon, Melville, New York). The backward SHG signal was filtered with a bandpass filter (, Semrock, Rochester, New York) and collected with a H7422P-40 GaAsP photomultiplier tube (Hamamatsu, Hamamatsu, Japan). Circular polarization was implemented for the SHG light source. All images were collected at resolution () with consistent acquisition settings using in-house developed acquisition software WiscScan.35 The images were then stitched together for each slide using the Fiji Grid/Collection Stitching plugin.36 2.3.Mid-infrared Spectral ImagingThe MIR spectral images were acquired using the Spero-QT (DRS Daylight Solutions Inc., San Diego, California, United States), which uses four tunable QCLs that span the spectral region of 950 to with resolution. The microscope was run in transmission mode using a 12.5× air objective (0.7 NA). The linearly polarized infrared light transmitted by the tissue sample was measured at room temperature by a microbolometer focal plane array (image pixel size: ). The sample chamber was kept inert with nitrogen gas. To scan an entire tissue sample, a translational stage was used to image the region of interest (ROI) consisting of multiple field of views (FOVs) as illustrated in Fig. S1 in the Supplementary Material. At the time of the measurements, the device suffered some instability at the spectral region of 1440 to , where the switching between two different QCLs happens. Therefore, the region was omitted from further analysis. For this work, MIRSI data from five tissue sections were collected, totaling FOVs for validation and FOVs for training, each with , summing up to million spectra and million spectra for validation and training, respectively. Overall, we combined RF-MIRSI FOVs encompassing all five tissues into 33 large ROIs, each of size ( hyperspectral pixels). In each ROI, spectra from non-tissue regions (on average about 20%) were excluded. 2.4.Spectral Data ProcessingThe infrared spectral images were processed with MATLAB (MathWorks, Natick, Massachusetts, United States) using the methods described before.26,37 The raw hyperspectral images consist of data cubes with dimensions of for each FOV. To exclude spectral data from the non-tissue regions, the protein-associated amide I absorption peak was considered because proteins are omnipresent in any tissue sample. The presence of the amide I spectral peak was determined by taking the intensity difference between (amide I) and , and if the value lies between 0.1 and 2.0 a.u., then it implies the presence of a significant amide I infrared spectral peak and that pixel was labeled as tissue associated and considered in the following workflow. Subsequently, all the spectra excluding the spectral region of 1440 to were normalized so that its Euclidean norm was set to unity. To reduce the noise in the hyperspectral MIRSI data, a principal component analysis (PCA) was first performed on the raw datasets. Only the first 40 principal components were kept (average 99.8% of variance) to reconstruct the spectra in the original wavenumber space to reduce the noise.26 The PCA noise reduction procedure was done separately for training and validation datasets. 2.5.Random Forest ModelThe training and validation datasets were labeled based on the ground truth SHG images. SHG images were first registered to MIRSI hyperspectral images using landmark registrations in MATLAB as described in Ref. 38. Briefly, matching points were selected manually to indicate common points between both images using MATLAB’s Control Point Selection (cpselect) function.39 Then, the affine transformation that matches their scaling, translation, and rotation was determined using MATLAB’s fitgeoform2d function40 and applied to the SHG image using MATLAB’s imwarp function.41 The matched SHG images were then binarized to provide a classification label (collagen/non-collagen) for every MIRSI spectra. The SHG images were binarized by first choosing a proper background threshold. The threshold within an image was set to a level manually such that dark counts (signals from outside of tissue region) were mostly removed while the remaining true collagen signals were maximized. Across all used SHG images, the threshold was set to be of the maximum intensity. Next, a density filter was applied to filter out isolated collagen pixels from the binarized SHG images, i.e., those with less than six neighboring collagen pixels, and henceforth is referred to as non-structural collagen. For RF model training, two MIRSI ROIs from two different tissue samples deemed representative of the whole dataset (assessed through their corresponding SHG images) were chosen and yielded million spectra. There was no overlap between training and validation ROIs. From the training spectra, those that were not from tissues and surpluses were discarded to avoid class imbalance between collagen and non-collagen. In total, 1.4 million collagen and 1.4 million non-collagen labeled spectra were used for training. An overview of the training and validation data can be found in Table S1 in the Supplementary Material. The TreeBagger class42 from MATLAB was used to train the RF model. The number of trees was set to 50. Increasing the number of trees to 500 did not yield any perceptible advantage; therefore, 50 was kept. As per the default recommendation, the number of features used for each decision split was set to 21 by rounding up the square root of the number of features (). The accuracy of the RF model was quantified using out-of-bag error, which is equivalent to cross-validation for the RF algorithm.43 The model achieved an accuracy of 79% calculated using the MATLAB function oobError.44 Furthermore, the importance of each feature for the classification was determined by taking the feature out of the model and calculating the model performance drop based on a randomly selected training dataset (out-of-bag dataset). This calculation of the feature importance was done using the built-in implementation in MATLAB by setting the “OOBPredictorImportance” parameter to on, as detailed in Ref. 42. 2.6.Image Processing and RF Model EvaluationBased on the RF model training, we first generated a new set of RF-MIRSI images of all five tissue sections, where each pixel value yields collagen probability. These probability maps, where collagen and non-collagen classes for each pixel are specified, were used in our method validation. The pixels whose probability of being collagen was higher than 50% were classified as collagen pixels and the rest as non-collagen pixels. This rather low probability threshold was chosen to accurately detect collagen-containing pixels, and a 50% threshold value was used throughout our analysis unless otherwise noted. However, this threshold resulted in many isolated pixels from non-structural collagen that can be considered noise in our study. To remove this noise from RF-MIRSI images, we applied a density filter, where only pixels with at least six surrounding predicted collagen pixels were kept, filtering out non-structural collagen. For validation, the corresponding binary images of RF-MIRSI were compared with the binarized SHG images (details on binarization were given in Sec. 2.5). To validate the collagen distribution from the RF-MIRSI model, we calculated a metric named boundary -score (BF-score). The -score is the harmonic mean of the precision and recall values and is defined as with TP, FP, and FN true positive, false positive, and false negative, respectively. -score ranges from 0 to 1, and 1 indicates perfect classification.45 The BF-score is an extension of the -score that takes distance error tolerance into consideration by defining a boundary to determine whether a point matches the ground truth or not. It was calculated using MATLAB’s BF-score function46 for all 33 ROIs that encompass all five tissues as described in Sec. 2.4. The BF-scores were calculated with various pixel thresholds up to 8 pixels (1 pixel represents , and the largest diffraction limited point is ). Similar calculations were done using RF-MIRSI images that were obtained at different collagen probability thresholds. To examine the capabilities of our technique for studying collagen morphology, we calculated both dominant orientation and coherency for each selected ROI. Both metrics were obtained by calculating the structure tensor of each ROI using the Fiji plugin OrientationJ.47 The dominant direction is defined as the primary orientation of the ROI, ranging from 0 to 180 deg. The coherency is an indicator of fiber alignment within an ROI, ranging from 0 to 1, with “0” indicating isotropic fiber orientations and “1” indicating fibers aligned in one direction. For both metrics, the size of ROI was set to () to obtain the granularity of the direction. In total, 297 ROIs were studied. Pearson’s -value was calculated using MATLAB’s corrcoef function48 to evaluate the correlation between the RF-MIRSI images and SHG images for both dominant orientation and coherency. Pearson’s -value is a measure of the strength of the linear correlation between variables, ranging from to 1, where indicates a perfect negative linear relationship, 0 indicates no linear relationship, and 1 indicates a perfect positive linear relationship. 3.Results3.1.RF Training and RF-MIRSIMIRSI data of five tissues were collected, constituting FOVs of or around 68 million spectra. To illustrate our raw MIRSI spectral data, Fig. 2 shows a typical unprocessed MIRSI image collected at illumination that corresponds to the protein amide I band along with 10 randomly selected spectra for each of the three outlined ROIs. The example MIRSI image consists of four () stitched FOVs collected from a pancreatic tissue section. To train the RF, 2.8 million spectra, including collagen and non-collagen pixels (1.4 million each), were used. These training spectra were pre-processed to exclude non-tissue regions and reduce noise. For the ML training, SHG image pixels were used as classification labels. MIRSI images are registered to SHG images using landmark registration, which matches their scaling, translation, and rotation. Subsequently, the trained RF model was applied to the rest of the spectra collected from the tissues. For each pixel-associated spectrum, the RF outputs the probability of how likely that pixel contains collagen. More details about the data pre-processing workflow and the training of the RF can be found in Sec. 2. 3.2.Correlation Between Binarized RF-MIRSI and SHG ImagesFigure 3 presents the results of our correlation investigation between the RF-MIRSI-predicted collagen pixels and the SHG ground truth. A representative MIRSI image of a tissue region collected at protein amide I band illumination is shown in Fig. 3(a). The RF-MIRSI predicted collagen pixels from the same tissue region are shown in green in Fig. 3(b). To qualitatively illustrate the correlation between RF-MIRSI-predicted and SHG-identified collagen pixels, Fig. 3(b) shows the binarized SHG image in pink and the pixels that are identified as fibrillar collagen by both methods in white. Collagen fiber’s dominant direction and coherency were also calculated for a randomly chosen subregion, outlined with a white box in Fig. 3(b), and the findings were presented in Fig. 3(c). For reference, an unprocessed SHG image (raw) counterpart of the same subregion is shown in Fig. 3(d). A similar analysis using the rest of the data was also performed and will be explained below. For the quantitative evaluation of our multimodal imaging technique, we used BF-score, dominant angle, and coherency as metrics. We first divided RF-MIRSI data from five tissues into 33 large ROIs of size ( hyperspectral pixels). The average BF-scores calculated for each of the 33 ROIs are shown in Fig. 4(a) for various pixels and collagen probability thresholds. For the dominant direction and coherency calculations, we used smaller ROIs () to preserve granularity. The Bland-Altman plot shown in Fig. 4(b) depicts the agreement of the dominant direction for both SHG and RF-MIRSI measurements with Pearson’s of 0.82. Figure 4(c) depicts the distribution of the absolute angle difference between the two techniques with a mean of and a standard deviation of 14.9 deg. The coherency calculated from both techniques is shown in Fig. 4(d) with Pearson’s of 0.66. 3.3.Wavenumbers Significant in Detecting Collagen as Identified by Random ForestFigure 5(a) shows the wavenumbers as a function of their importance in detecting collagen as identified by the RF model. The top 20 best predictor wavenumbers are colored in maroon. To further examine this, the average and standard deviation of collagen and non-collagen spectra from the training data are shown in Fig. 5(b) with the top 20 wavenumber predictors indicated with stars. 4.DiscussionIn cancer, the growth of fibrous tissue around the tumor (also referred to as a desmoplastic reaction) has been shown to be an important hallmark of TME, where it presents distinct similarities to the wound healing response.49–51 Previously, highly organized fibrillar collagen patterns in TME were identified as negative cancer prognostic markers using SHG imaging, which is the current “gold standard” in fibrillar collagen studies.6 Despite their important prognostic potential, stromal-based biomarkers are not yet part of the current clinical histopathology because SHG is a low-throughput imaging technique and requires extensive expertise and costly instrumentation. MIRSI is a label-free and rapid hyperspectral digital tissue imaging technique, which can collect whole-slide tissue images within minutes. Here, we trained a machine learning (ML) algorithm using the gold-standard SHG images to generate fibrillar collagen maps using the rapid and high-throughput MIRSI modality. Our qualitative (Fig. 3) and quantitative (Fig. 4) investigations revealed that RF-MIRSI-predicted fibrotic collagen tissue regions correlate well with the gold-standard SHG imaging. Therefore, our method has the potential to contribute to the histopathology workflow by providing fibrillar collagen maps of whole-tissue pathology sections. In addition to providing fibrillar collagen maps, where there has been significant effort using other emerging technologies such as polarization microscopy12,13 to emulate SHG with lower cost, MIRSI takes advantage of potential biochemical signatures associated with specific diseases. Moreover, infrared photonics is a blooming field, and the costs of IR light sources and detectors are decreasing. Due to the emerging developments in the biophotonics field, the technology transfer of our proposed technique, collecting structural and chemical information from tissue samples, has a strong potential to make an impact in clinical settings. Notably, we report a significant difference between an MIR image of a tissue section collected at a single wavelength illumination associated with the protein band [Fig. 3(a)] and the collagen map predicted by the RF-MIRSI [Fig. 3(b)]. This highlights the strength of our RF-MIRSI method in parsing out most of the tissue regions as non-collagen areas, even though they contain significant amounts of protein as evidenced in Fig. 3(a), the amide I absorption signal. Moreover, contrary to traditional single-channel spectrometry, hyperspectral imaging modalities can collect spatially and spectrally rich datasets. When evaluated using ML and artificial intelligence models, MIR hyperspectral imaging can provide versatile information that is critical in the biomedical field. In multimodal imaging, accurate image registration is inherently challenging, especially when images are collected by different instruments with distinct spatial resolution limits, as in the case of our SHG/RF-MIRSI approach. To tackle the unavoidable image registration errors, we used the BF-score instead of the standard -score as a validation metric. The average BF-score is calculated for all 33 ROIs encompassing all five tissues with various pixel and collagen probability thresholds as shown in Fig. 4(a). Our results show that the average BF-score reaches within 4-pixel thresholds for the collagen probability threshold of 50% and 55%, which indicates a good correlation between both sets of images. Below 4-pixel thresholds, the average -score is lower due to the misalignment in the registration of the images. The misalignment sources can include (i) human error while picking common registration landmarks for registration, which is amplified because SHG and MIRSI images are collected at different magnifications (20× versus 12.5×), and (ii) mismatching non-affine deformations that come from the different imaging system such as barrel distortion as well as the cover slipping process for the SHG imaging (sample for MIRSI imaging is uncovered). While different factors impact the multimodal image alignment accuracy, using 4-pixel thresholds (equivalent to , which is close to the spatial resolution limit of MIRSI) in BF-score calculation successfully addresses this issue. Apart from misalignment, other factors that contribute to the deviations between the SHG and RF-MIRSI results can also be explained by either the inherent differences between the two processes (SHG versus absorption), such as their cross-sections and depth of field, or the difference in our implementation (SHG with circularly polarized and MIRSI with linearly polarized light), and finally, the performance of the algorithm itself. Moreover, our quantitative validation results in Fig. 4(a) show that the average BF-score decreases with increasing collagen probability threshold. This can be explained by the lower true positive classification outcomes when a larger threshold is applied to RF-MIRSI images. Therefore, in this work, we first detected as many collagen-classified pixels as possible using a 50% collagen probability threshold, then minimized the non-structural collagen using a density filter (see Sec. 2). A similar strategy and threshold were used while binarizing the SHG images. We also compared the morphology in the RF-MIRSI collagen maps and SHG images via the dominant angular direction parameter calculated in individual ROIs [Fig. 4(b)] and identified a high correlation (Pearson’s of 0.82). This is also evident in Fig. 4(c), where the histogram distribution of the absolute angle differences between the imaging modalities peaks close to zero. Furthermore, the alignment (coherency) calculated for both techniques showed a high degree of correlation [Pearson’s 0.66, Fig. 4(d)]. The outliers in Figs. 4(b)–4(d) can be mainly attributed to the following three factors: (1) the performance mismatch between the two imaging modalities, (2) the performance of the RF algorithm itself, and (3) the coverslip effect where MIRSI acquisition was done on non-coverslipped samples and SHG was on coverslipped samples. Application of the coverslip can alter the geometry in some of the regions of the tissue and can give an uncorrelated result between both modalities. Regardless, the overall results indicate that the RF-MIRSI images generated from MIRSI spectral data correlate well with the SHG images. Among the ML models, RF has the advantage of quantifying the importance of data features in the decision-making process. It has this advantage, because it keeps the linear independence of the features, unlike many other ML models that mix multiple features, thus abstracting its physical interpretation. Moreover, RF is also a nonlinear algorithm, which is suitable for our molecular fingerprint dataset. We showcase this advantage by calculating the wavenumber importance by quantifying the increase in error generated by excluding a specific spectral feature (see Sec. 2). Figure 5(a) shows the 20 highest-ranked predictors. In MIRSI-based collagen studies, the spectral focus is usually on the protein-associated amide I and amide II bands.52–59 Our RF model also identified four features falling within the amide I 1600 to range among the top 20 highest ranked. Interestingly, the RF model heavily relied on the spectral region between 1360 and , with 10 out of the top 20 predictors residing there, even though the average collagen and non-collagen spectra are not different in that region [Fig. 5(b)]. A small window (1360 to ) within this region contains the wagging vibration of the proline side chains present in type I collagen, found in biological tissues.56 Therefore, the dominant dependence of the RF model on this spectral region must be due to the abundance of proline and 4-hydroxyproline in collagen triple-helix ( occurrence of each in type I collagen60). This underscores the critical role of our holistic TME analysis via multimodal imaging, as it can provide access to biochemical information from the structurally altered tissue regions. Moreover, the vibrational fingerprints of molecules depend on many factors such as the molecules’ surrounding environment, concentration, temperature, and many more. We have shown that our tandem method selectively captures the molecular fingerprint of collagen in its relevant, native tissue environment. To demonstrate this point, the spectra of pure human type I collagen are shown in Fig. S2 in the Supplementary Material, and it is distinct from the collagen spectrum detected in the tissue with the help of SHG and RF, highlighting the strength of our approach. In future studies, specific biochemical information such as the integrity of collagen’s triple helix structure,52 cross-linking collagen concentrations,52,57 the collagen quality associated with non-enzymatic cross-linking,59 and many more leveraging existing databases and literature61 can be investigated in the context of diseases, especially those that have been previously studied using SHG or staining methods such as various organ fibrosis62,63 and cancers.64 Such investigations can help elucidate the molecular drivers behind the morphological alterations in the TME observed in various cancer grades. Similarly, MIRSI can be used to complement SHG by analyzing interactions of collagen with other important ECM molecules such as fibronectin, which cannot be detected by SHG.65 Our results can be further improved by employing recently developed advanced laser scanning-based MIRSI methods or photothermal imaging,66,67 which can achieve a higher spatial resolution and better match SHG imaging.68 Moreover, metasurface-enhanced MIRSI can be used to improve the sensitivity and selectivity of the absorption spectra by benefiting from the light-matter interactions at the photonic cavities with resonances tuned to the region with a high density of important predictors.69 In conclusion, the proof-of-concept RF-MIRSI model was successfully used to detect fibrillar collagen based on the MIRSI spectral data from pancreatic tissue samples. This technique can be adapted to other tissue types and can complement state-of-the-art imaging modalities and analytical techniques to further investigate the complex nature of fibrillar collagen in the TME. Code and Data AvailabilityThe code, data, and materials relevant to this work can be shared upon reasonable request. AcknowledgmentsF.Y. acknowledges financial support from the National Institutes of Health (NIH) (Grant Nos. R61CA281795 and R21EB034411). K.W.E. also acknowledges financial support from the NIH (Grant Nos. R01CA238191, U54CA268069, and P41GM135019). B.R.P. acknowledges financial support from the UW-Madison College of Engineering Graduate Engineering Research Scholars (GERS) Fellowship. Moreover, the author(s) thank Toshi Kinoshita for his assistance in the sample preparation and the University of Wisconsin Translational Research Initiatives in Pathology Laboratory, supported by the UW Department of Pathology and Laboratory Medicine, UWCCC (Grant No. P30 CA014520) and the Office of The Director-NIH (Grant No. S10 OD023526) for use of its facilities and services. Finally, the authors thank Prof. Eduardo R. Arvelo from the Electrical and Computer Engineering Department at the UW-Madison for guidance in ML methods. ReferencesE. Henke, R. Nandigama and S. Ergün,
“Extracellular matrix in the tumor microenvironment and its impact on cancer therapy,”
Front. Mol. Biosci., 6 160 https://doi.org/10.3389/fmolb.2019.00160
(2020).
Google Scholar
J. N. Ouellette et al.,
“Navigating the collagen jungle: the biomedical potential of fiber organization in cancer,”
Bioengineering, 8
(2), 17 https://doi.org/10.3390/bioengineering8020017 BENGEQ 0178-2029
(2021).
Google Scholar
P. P. Provenzano et al.,
“Collagen reorganization at the tumor-stromal interface facilitates local invasion,”
BMC Med., 4
(1), 38 https://doi.org/10.1186/1741-7015-4-38
(2006).
Google Scholar
O. Nadiarnykh et al.,
“Alterations of the extracellular matrix in ovarian cancer studied by second harmonic generation imaging microscopy,”
BMC Cancer, 10
(1), 94 https://doi.org/10.1186/1471-2407-10-94 BCMACL 1471-2407
(2010).
Google Scholar
C. J. Hanley et al.,
“A subset of myofibroblastic cancer-associated fibroblasts regulate collagen fiber elongation, which is prognostic in multiple cancers,”
Oncotarget, 7
(5), 6159
–6174 https://doi.org/10.18632/oncotarget.6740
(2015).
Google Scholar
C. R. Drifka et al.,
“Highly aligned stromal collagen is a negative prognostic factor following pancreatic ductal adenocarcinoma resection,”
Oncotarget, 7
(46), 76197
–76213 https://doi.org/10.18632/oncotarget.12772
(2016).
Google Scholar
K. J. Lafaro and L. G. Melstrom,
“The paradoxical web of pancreatic cancer tumor microenvironment,”
Am. J. Pathol., 189
(1), 44
–57 https://doi.org/10.1016/j.ajpath.2018.09.009 AJPAA4 0002-9440
(2019).
Google Scholar
R. Lattouf et al.,
“Picrosirius red staining,”
J. Histochem. Cytochem., 62
(10), 751
–758 https://doi.org/10.1369/0022155414545787 JHCYAS 0022-1554
(2014).
Google Scholar
B. V. Janssen et al.,
“Artificial intelligence-based segmentation of residual tumor in histopathology of pancreatic cancer after neoadjuvant treatment,”
Cancers, 13
(20), 5089 https://doi.org/10.3390/cancers13205089
(2021).
Google Scholar
B. Li et al.,
“Differentiation of pancreatic ductal adenocarcinoma and chronic pancreatitis using graph neural networks on histopathology and collagen fiber features,”
J. Pathol. Inform., 13 100158 https://doi.org/10.1016/j.jpi.2022.100158
(2022).
Google Scholar
A. Keikhosravi et al.,
“Real-time polarization microscopy of fibrillar collagen in histopathology,”
Sci. Rep., 11
(1), 19063 https://doi.org/10.1038/s41598-021-98600-w SRCEC3 2045-2322
(2021).
Google Scholar
A. Keikhosravi et al.,
“Quantification of collagen organization in histopathology samples using liquid crystal based polarization microscopy,”
Biomed. Opt. Express, 8
(9), 4243 https://doi.org/10.1364/boe.8.004243 BOEICL 2156-7085
(2017).
Google Scholar
P. Kirya et al.,
“Leveraging optical anisotropy of the morpho butterfly wing for quantitative, stain-free, and contact-free assessment of biological tissue microstructures,”
(2024). Google Scholar
P. Campagnola,
“Second harmonic generation imaging microscopy: applications to diseases diagnostics,”
Anal. Chem., 83
(9), 3224
–3231 https://doi.org/10.1021/ac1032325 ANCHAM 0003-2700
(2011).
Google Scholar
M. S. Nelson et al.,
“The tumor microenvironment, methods and protocols,”
Methods Mol. Biol., 2614 187
–235 https://doi.org/10.1007/978-1-0716-2914-7_13
(2023).
Google Scholar
P. P. Provenzano et al.,
“Collagen density promotes mammary tumor initiation and progression,”
BMC Med., 6
(1), 11 https://doi.org/10.1186/1741-7015-6-11
(2008).
Google Scholar
M. W. Conklin et al.,
“Aligned collagen is a prognostic signature for survival in human breast carcinoma,”
Am. J. Pathol., 178
(3), 1221
–1232 https://doi.org/10.1016/j.ajpath.2010.11.076 AJPAA4 0002-9440
(2011).
Google Scholar
R. Cicchi et al.,
“Multidimensional non-linear laser imaging of Basal Cell Carcinoma,”
Opt. Express, 15
(16), 10135 https://doi.org/10.1364/oe.15.010135 OPEXFF 1094-4087
(2007).
Google Scholar
S. Y. Xiong, J. G. Yang and J. Zhuang,
“Nonlinear spectral imaging of human normal skin, basal cell carcinoma and squamous cell carcinoma based on two-photon excited fluorescence and second-harmonic generation,”
Laser Phys., 21
(10), 1844 https://doi.org/10.1134/s1054660x11170269 LAPHEJ 1054-660X
(2011).
Google Scholar
A. J. Fleszar et al.,
“The extracellular matrix of ovarian cortical inclusion cysts modulates invasion of fallopian tube epithelial cells,”
APL Bioeng., 2
(3), 031902 https://doi.org/10.1063/1.5022595
(2018).
Google Scholar
Y. Huang and Z. Zhuang,
“Second harmonic microscopic imaging and spectroscopic characterization in prostate pathological tissue,”
Scanning, 36
(3), 334
–337 https://doi.org/10.1002/sca.21116 SCNNDF 0161-0457
(2014).
Google Scholar
C. R. Drifka et al.,
“Periductal stromal collagen topology of pancreatic ductal adenocarcinoma differs from that of normal and chronic pancreatitis,”
Mod. Pathol., 28
(11), 1470
–1480 https://doi.org/10.1038/modpathol.2015.97 MODPEO 0893-3952
(2015).
Google Scholar
Y. Yao, A. J. Hoffman and C. F. Gmachl,
“Mid-infrared quantum cascade lasers,”
Nat. Photonics, 6
(7), 432
–439 https://doi.org/10.1038/nphoton.2012.143 NPAHBY 1749-4885
(2012).
Google Scholar
A. Schwaighofer et al.,
“Beyond Fourier transform infrared spectroscopy: external cavity quantum cascade laser-based mid-infrared transmission spectroscopy of proteins in the amide I and amide II region,”
Anal. Chem., 90
(11), 7072
–7079 https://doi.org/10.1021/acs.analchem.8b01632
(2018).
Google Scholar
M. Schnell et al.,
“All-digital histopathology by infrared-optical hybrid microscopy,”
Proc. Natl. Acad. Sci. U. S. A., 117
(7), 3388
–3396 https://doi.org/10.1073/pnas.1912400117
(2020).
Google Scholar
M. J. Pilling, A. Henderson and P. Gardner,
“Quantum cascade laser spectral histopathology: breast cancer diagnostics using high throughput chemical imaging,”
Anal. Chem., 89
(14), 7348
–7355 https://doi.org/10.1021/acs.analchem.7b00426 ANCHAM 0003-2700
(2017).
Google Scholar
K. Belbachir et al.,
“Collagen types analysis and differentiation by FTIR spectroscopy,”
Anal. Bioanal. Chem., 395
(3), 829
–837 https://doi.org/10.1007/s00216-009-3019-y ABCNBP 1618-2642
(2009).
Google Scholar
A. Hanifi et al.,
“Fourier transform infrared imaging and infrared fiber optic probe spectroscopy identify collagen type in connective tissues,”
PLoS ONE, 8
(5), e64822 https://doi.org/10.1371/journal.pone.0064822 POLNCL 1932-6203
(2013).
Google Scholar
B. Bird and J. Rowlette,
“A protocol for rapid, label-free histochemical imaging of fibrotic liver,”
Analyst, 142
(8), 1179
–1184 https://doi.org/10.1039/c6an02080a ANLYAG 0365-4885
(2016).
Google Scholar
E. Zimmermann et al.,
“Detection and quantification of myocardial fibrosis using stain-free infrared spectroscopic imaging,”
Arch. Pathol. Lab. Med., 145
(12), 1526
–1535 https://doi.org/10.5858/arpa.2020-0635-oa APLMAS 0003-9985
(2021).
Google Scholar
R. Mankar et al.,
“Polarization sensitive photothermal mid-infrared spectroscopic imaging of human bone marrow tissue,”
Appl. Spectrosc., 76
(4), 508
–518 https://doi.org/10.1177/00037028211063513 APSPA4 0003-7028
(2021).
Google Scholar
S. Tiwari et al.,
“INFORM: infrared-based organizational measurements of tumor and its microenvironment to predict patient survival,”
Sci. Adv., 7
(6), eabb8292 https://doi.org/10.1126/sciadv.abb8292 STAMCV 1468-6996
(2021).
Google Scholar
S. Tiwari et al.,
“Infrared spectroscopic imaging visualizes a prognostic extracellular matrix-related signature in breast cancer,”
Sci. Rep., 10
(1), 5442 https://doi.org/10.1038/s41598-020-62403-2 SRCEC3 2045-2322
(2020).
Google Scholar
D. C. LeBert et al.,
“Second harmonic generation microscopy in zebrafish,”
Methods Cell Biol., 133 55
–68 https://doi.org/10.1016/bs.mcb.2016.01.005 MCBLAG 0091-679X
(2016).
Google Scholar
S. Preibisch, S. Saalfeld and P. Tomancak,
“Globally optimal stitching of tiled 3D microscopic image acquisitions,”
Bioinformatics, 25
(11), 1463
–1465 https://doi.org/10.1093/bioinformatics/btp184 BOINFP 1367-4803
(2009).
Google Scholar
M. J. Baker et al.,
“Using Fourier transform IR spectroscopy to analyze biological materials,”
Nat. Protoc., 9
(8), 1771
–1791 https://doi.org/10.1038/nprot.2014.110 1754-2189
(2014).
Google Scholar
“Find image rotation and scale—MATLAB and simulink example,”
https://www.mathworks.com/help/images/find-image-rotation-and-scale.html
().
Google Scholar
“Control point selection tool—MATLAB cpselect,”
https://www.mathworks.com/help/images/ref/cpselect.html
().
Google Scholar
“Fit 2-D geometric transformation to control point pairs—MATLAB fitgeotform2d,”
https://www.mathworks.com/help/images/ref/fitgeotform2d.html
().
Google Scholar
“Apply geometric transformation to image—MATLAB imwarp,”
https://www.mathworks.com/help/images/ref/imwarp.html
().
Google Scholar
“Ensemble of bagged decision trees—MATLAB,”
https://www.mathworks.com/help/stats/treebagger.html
().
Google Scholar
L. Breiman,
“Random forests,”
Mach. Learn., 45
(1), 5
–32 https://doi.org/10.1023/a:1010933404324 MALEEZ 0885-6125
(2001).
Google Scholar
“Out-of-bag error—MATLAB,”
https://www.mathworks.com/help/stats/treebagger.ooberror.html
().
Google Scholar
G. Csurka, D. Larlus and F. Perronnin,
“What is a good evaluation measure for semantic segmentation?,”
in British Machine Vision Conf.,
(2013). Google Scholar
“Contour matching score for image segmentation—MATLAB bfscore,”
https://www.mathworks.com/help/images/ref/bfscore.html
().
Google Scholar
Z. Püspöki et al.,
“Focus on bio-image informatics,”
Adv. Anat. Embryol. Cell Biol., 219 69
–93 https://doi.org/10.1007/978-3-319-28549-8_3 AAEBDS 0301-5556
(2016).
Google Scholar
“Correlation coefficients—MATLAB corrcoef,”
https://www.mathworks.com/help/matlab/ref/corrcoef.html
().
Google Scholar
L. Landolt et al.,
“Fibrosis and cancer: shared features and mechanisms suggest common targeted therapeutic approaches,”
Nephrol. Dial. Transplant., 37
(6), 1024
–1032 https://doi.org/10.1093/ndt/gfaa301 NDTREA
(2020).
Google Scholar
C. Chandler et al.,
“The double edge sword of fibrosis in cancer,”
Transl. Res., 209 55
–67 https://doi.org/10.1016/j.trsl.2019.02.006
(2019).
Google Scholar
M. Yamauchi et al.,
“The fibrotic tumor stroma,”
J. Clin. Investig., 128
(1), 16
–25 https://doi.org/10.1172/jci93554
(2018).
Google Scholar
E. P. Paschalis et al.,
“Spectroscopic characterization of collagen cross‐links in bone,”
J. Bone Miner. Res., 16
(10), 1821
–1828 https://doi.org/10.1359/jbmr.2001.16.10.1821 JBMREJ 0884-0431
(2001).
Google Scholar
B. de C. Vidal and M. L. S. Mello,
“Collagen type I amide I band infrared spectroscopy,”
Micron, 42
(3), 283
–289 https://doi.org/10.1016/j.micron.2010.09.010 MICNB2 0047-7206
(2011).
Google Scholar
C. Chadefaux et al.,
“Curve-fitting micro-ATR-FTIR studies of the amide I and II bands of type I collagen in archaeological bone materials,”
E-Preserv. Sci., 6 129
–137
(2009).
Google Scholar
K. J. Payne and A. Veis,
“Fourier transform IR spectroscopy of collagen and gelatin solutions: deconvolution of the amide I band for conformational studies,”
Biopolymers, 27
(11), 1749
–1760 https://doi.org/10.1002/bip.360271105 BIPMAA 0006-3525
(1988).
Google Scholar
T. Riaz et al.,
“FTIR analysis of natural and synthetic collagen,”
Appl. Spectrosc. Rev., 53
(9), 703
–746 https://doi.org/10.1080/05704928.2018.1426595 APSRBB 0570-4928
(2018).
Google Scholar
A. Mieczkowska and G. Mabilleau,
“Validation of Fourier transform infrared microspectroscopy for the evaluation of enzymatic cross-linking of bone collagen,”
Calcif. Tissue Int., 113
(3), 344
–353 https://doi.org/10.1007/s00223-023-01105-z CTINDZ 0171-967X
(2023).
Google Scholar
C. Petibois et al.,
“Analysis of type I and IV collagens by FT-IR spectroscopy and imaging for a molecular investigation of skeletal muscle connective tissue,”
Anal. Bioanal. Chem., 386
(7–8), 1961
–1966 https://doi.org/10.1007/s00216-006-0828-0 ABCNBP 1618-2642
(2006).
Google Scholar
F. N. Schmidt et al.,
“Assessment of collagen quality associated with non-enzymatic cross-links in human bone using Fourier-transform infrared imaging,”
Bone, 97 243
–251 https://doi.org/10.1016/j.bone.2017.01.015
(2017).
Google Scholar
J. K. Rainey and M. C. Goh,
“A statistically derived parameterization for the collagen triple‐helix,”
Protein Sci., 11
(11), 2748
–2754 https://doi.org/10.1110/ps.0218502 PRCIEI 0961-8368
(2002).
Google Scholar
Z. Movasaghi, S. Rehman and Dr. I. ur Rehman,
“Fourier transform infrared (FTIR) spectroscopy of biological tissues,”
Appl. Spectrosc. Rev., 43
(2), 134
–179 https://doi.org/10.1080/05704920701829043 APSRBB 0570-4928
(2008).
Google Scholar
J. Nanchahal and B. Hinz,
“Strategies to overcome the hurdles to treat fibrosis, a major unmet clinical need,”
Proc. Natl. Acad. Sci. U. S. A., 113
(27), 7291
–7293 https://doi.org/10.1073/pnas.1607896113
(2016).
Google Scholar
N. C. Henderson, F. Rieder and T. A. Wynn,
“Fibrosis: from mechanisms to medicines,”
Nature, 587
(7835), 555
–566 https://doi.org/10.1038/s41586-020-2938-9
(2020).
Google Scholar
T. R. Cox and J. T. Erler,
“Fibrosis and cancer: partners in crime or opposing forces?,”
Trends Cancer, 2
(6), 279
–282 https://doi.org/10.1016/j.trecan.2016.05.004
(2016).
Google Scholar
D. S. James et al.,
“Probing ECM remodeling in idiopathic pulmonary fibrosis via second harmonic generation microscopy analysis of macro/supramolecular collagen structure,”
J. Biomed. Opt., 25
(1), 014505 https://doi.org/10.1117/1.jbo.25.1.014505 JBOPFO 1083-3668
(2020).
Google Scholar
Y. Bai et al.,
“Ultrafast chemical imaging by widefield photothermal sensing of infrared absorption,”
Sci. Adv., 5
(7), eaav7127 https://doi.org/10.1126/sciadv.aav7127 STAMCV 1468-6996
(2019).
Google Scholar
D. Zhang et al.,
“Depth-resolved mid-infrared photothermal imaging of living cells and organisms with submicrometer spatial resolution,”
Sci. Adv., 2
(9), e1600521 https://doi.org/10.1126/sciadv.1600521 STAMCV 1468-6996
(2016).
Google Scholar
K. Yeh et al.,
“Infrared spectroscopic laser scanning confocal microscopy for whole-slide chemical imaging,”
Nat. Commun., 14
(1), 5215 https://doi.org/10.1038/s41467-023-40740-w NCAOBW 2041-1723
(2023).
Google Scholar
S. Rosas et al.,
“Metasurface‐enhanced mid‐infrared spectrochemical imaging of tissues,”
Adv. Mater., 35
(28), e2301208 https://doi.org/10.1002/adma.202301208 ADVMEW 0935-9648
(2023).
Google Scholar
|
Collagen
Second harmonic generation
Tissues
Education and training
Mid infrared
Biological samples
Machine learning