Figures
Abstract
Background
Idiopathic pulmonary fibrosis (IPF) is a chronic progressive fibrotic lung disease associated with substantial morbidity and mortality. The objective of this study was to determine whether there is a peripheral blood protein signature in IPF and whether components of this signature may serve as biomarkers for disease presence and progression.
Methods and Findings
We analyzed the concentrations of 49 proteins in the plasma of 74 patients with IPF and in the plasma of 53 control individuals. We identified a combinatorial signature of five proteins—MMP7, MMP1, MMP8, IGFBP1, and TNFRSF1A—that was sufficient to distinguish patients from controls with a sensitivity of 98.6% (95% confidence interval [CI] 92.7%–100%) and specificity of 98.1% (95% CI 89.9%–100%). Increases in MMP1 and MMP7 were also observed in lung tissue and bronchoalveolar lavage fluid obtained from IPF patients. MMP7 and MMP1 plasma concentrations were not increased in patients with chronic obstructive pulmonary disease or sarcoidosis and distinguished IPF compared to subacute/chronic hypersensitivity pneumonitis, a disease that may mimic IPF, with a sensitivity of 96.3% (95% CI 81.0%–100%) and specificity of 87.2% (95% CI 72.6%–95.7%). We verified our results in an independent validation cohort composed of patients with IPF, familial pulmonary fibrosis, subclinical interstitial lung disease (ILD), as well as with control individuals. MMP7 and MMP1 concentrations were significantly higher in IPF patients compared to controls in this cohort. Furthermore, MMP7 concentrations were elevated in patients with subclinical ILD and negatively correlated with percent predicted forced vital capacity (FVC%) and percent predicted carbon monoxide diffusing capacity (DLCO%).
Conclusions
Our experiments provide the first evidence for a peripheral blood protein signature in IPF to our knowledge. The two main components of this signature, MMP7 and MMP1, are overexpressed in the lung microenvironment and distinguish IPF from other chronic lung diseases. Additionally, increased MMP7 concentration may be indicative of asymptomatic ILD and reflect disease progression.
Citation: Rosas IO, Richards TJ, Konishi K, Zhang Y, Gibson K, Lokshin AE, et al. (2008) MMP1 and MMP7 as Potential Peripheral Blood Biomarkers in Idiopathic Pulmonary Fibrosis. PLoS Med 5(4): e93. https://doi.org/10.1371/journal.pmed.0050093
Academic Editor: Peter Barnes, National Heart and Lung Institute, United Kingdom
Received: July 26, 2007; Accepted: March 13, 2008; Published: April 29, 2008
This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose.
Funding: NK, KG, TJR, YZ, JD, KOL, and MS were supported by National Institute of Health (NIH) grants HL073745, HL0793941, HL0894932, and a generous donation from the Simmons family. IOR, SDM, and BRG were supported by the Division of Intramural Research of the National Heart, Lung, and Blood Institute (NHLBI). MS and AP were supported by Universidad Nacional Autónoma de México Grant SDI.PTID.05.6. The funding institutions have not been involved in study design, data collection, interpretation, or preparation of this manuscript.
Competing interests: NK is a recipient of investigator initiated research grants from Biogen Idec and from Centocor for genomic and proteomic biomarker discovery and validation. Data presented in this paper were not funded by any of these grants. FS, as a consultant, has received less than $10,000 from GlaxoSmithKline (GSK) and Astra-Zeneca. All other authors declared no competing interests.
Abbreviations: BAL, bronchoalveolar lavage; CART, classification and regression tree; CI, confidence interval; COPD, chronic obstructive pulmonary disease; DLCO%, percent predicted carbon monoxide diffusing capacity; FVC%, percent predicted forced vital capacity; HP, hypersensitivity pneumonitis; HRCT, high-resolution computed tomography; ILD, interstitial lung disease; IPF, idiopathic pulmonary fibrosis; MMP, matrix metalloprotease; NSIP, nonspecific interstitial pneumonia; ROC, receiver operating characteristic curve; SAM, significance analysis of microarrays
Editors' Summary
Background.
Idiopathic pulmonary fibrosis (IPF) is a serious disease in which the lungs become progressively scarred or thickened for unknown reasons. In healthy people, air is taken in through the mouth or nose and travels down the windpipe into tubes in the lungs called the airways. Each airway has many small branches that end in alveoli, tiny air sacs with thin walls that are surrounded by small blood vessels called capillaries. When air reaches the alveoli, the oxygen in it passes into the bloodstream and is taken to the organs of the body to keep them working. In IPF, the alveoli and the space around them (the “interstitial” area) gradually become scarred and thickened, which stops oxygen's movement into the bloodstream. When only small areas of the lung are scarred, IPF may cause no symptoms. But, as more of the lung becomes damaged, IPF eventually causes breathlessness, even when resting. There is no effective treatment for IPF, although steroids and drugs that suppress the body's immune system are often tried in an attempt to slow its progression. On average, half of the people with IPF die within three years of diagnosis, often from respiratory or heart failure.
Why Was This Study Done?
It can be difficult to diagnose IPF—there are many lung diseases with similar symptoms, including numerous other interstitial lung diseases—and currently, physicians can only follow the progression of IPF by repeatedly testing their patients' lung function or by doing multiple chest X-rays. If proteins could be identified whose level in blood indicated disease activity (so-called “peripheral blood biomarkers”), it would be easier to diagnose and monitor patients. In addition, the identification of such biomarkers might suggest new drug targets for the treatment of IPF. In this study, the researchers look for peripheral blood biomarkers in IPF by using a “multiplex analysis” system to measure the level of several proteins in patient blood samples simultaneously.
What Did the Researchers Do and Find?
The researchers measured the levels of 49 plasma proteins (plasma is the fluid part of blood) in 74 patients with IPF and 53 healthy people (controls) and used a technique called “recursive partitioning” to define a five-protein signature that distinguished patients from unaffected study participants (controls). Matrix metalloproteinase 7 (MMP7) and MMP1—the two plasma proteins whose levels were most increased in patients with IPF compared to controls—were key components of this signature. Concentrations of MMP7 and MMP1 were higher in bronchoalveolar lavage samples (fluid obtained by washing out the lungs with saline) and in lung tissue samples from patients with IPF than in similar samples taken from healthy individuals. Plasma concentrations of MMP7 and MMP1 were significantly higher in patients with IPF than in patients with hypersensitivity pneumonitis, an interstitial lung disease that mimics IPF, but not increased in patients with chronic obstructive pulmonary disease or sarcoidosis, two other lung diseases. In an independent validation group, patients with IPF and familial pulmonary fibrosis had increased plasma concentrations of MMP7 and MMP1 that correlated with the severity of their disease. In addition, MMP7 concentrations were raised in close relatives of people with familial pulmonary fibrosis who had normal lung function tests but some lung scarring.
What Do These Findings Mean?
These findings provide evidence for a protein signature in the blood for IPF and suggest MMP1 and MMP7 may be useful as biomarkers for IPF. These two matrix metalloproteinases have previously been suggested to be involved in the development of IPF. However, additional work is probably needed to confirm that increased plasma concentrations MMP7 and MMP1 are specific for IPF, since it may be that these markers will not distinguish IPF from other interstitial lung diseases.
Additional Information.
Please access these Web sites via the online version of this summary at http://dx.doi.org/10.1371/journal.pmed.0050093.
- Read a related PLoS Medicine Perspective article
- The MedlinePlus Encyclopedia has a page on idiopathic pulmonary fibrosis (in English and Spanish) and on pulmonary fibrosis
- The US National Heart Lung and Blood Institute and the British Lung Foundation also provide information on IPF for patients and relatives
- Some of the researchers involved in this study provide more details about what might go wrong in IPF in a recent PLoS Medicine article
Introduction
Idiopathic pulmonary fibrosis (IPF), a progressive fibrotic interstitial lung disease (ILD) with median survival of 2.5–3 y, is largely unaffected by currently available medical therapies [1]. The disease is characterized by alveolar epithelial cell injury and activation, fibroblast/myofibroblast foci formation, and exaggerated accumulation of extracellular matrix in the lung parenchyma. Recent studies employing high-throughput genomic technologies to analyze samples from IPF patients or genetically modified animals have highlighted the complexity of the pathways involved in the disease (reviewed in [2–4]). While these studies have improved the understanding of the molecular mechanisms underlying lung fibrosis, they did not translate well into the clinical arena.
Identification of peripheral blood biomarkers may facilitate the diagnosis and follow-up of patients with IPF as well as the implementation of new therapeutic interventions. Currently, establishing a diagnosis of IPF may require surgical lung biopsy in patients with atypical clinical presentations or high-resolution computed tomography (HRCT) scans. Patients with IPF are often evaluated by serial pulmonary physiology measurements and repeated radiographic examinations. These studies provide a general assessment of the extent of disease, but do not provide information about disease activity on a molecular level. Higher serum concentrations of surfactant proteins [5], KL-6 [6], FASL [7], CCL-2 [8], α-defensins [9], and most recently SPP1 [10] have been reported in patients with IPF and other ILDs, but most of these studies were modest in size and assayed only a single or a few protein markers simultaneously.
In this study, we used a multianalyte protein assay system to simultaneously measure concentrations of 49 plasma proteins, including cytokines, chemokines, growth and angiogenic factors, matrix metalloproteases (MMPs), and markers of apoptosis in a derivation cohort comprised of IPF patients and healthy controls. We identified a combinatorial signature of five proteins; of these, we measured concentrations of two metalloproteases, MMP7 and MMP1, in other chronic lung diseases and compared them to the levels observed in IPF patients. Finally, the potential role of MMP7 and MMP1 as IPF peripheral blood biomarkers was tested in an independent validation cohort.
Methods
For detailed description of the methods used in this study, see Text S1.
Initial IPF Derivation Cohort
This study included 74 patients with IPF evaluated at the University of Pittsburgh Medical Center. The diagnosis of IPF was established on the basis of published criteria [11], and surgical lung biopsy when clinically indicated [12] (see Text S1). Clinical data were available through the Simmons Center database. Smoking status was defined as previously described [13]. Fifty-three control individuals were obtained from the pulmonary division sample collection core. Baseline demographic information is detailed in Table 1. The mean percent predicted forced vital capacity (FVC%) of IPF patients was 61.9 ± 20.8; mean percent predicted carbon monoxide diffusing capacity (DLCO%) was 42.1 ± 17.4.
Derivation Cohort Patient Characteristics
Chronic Obstructive Pulmonary Disease
Plasma samples from 73 patients with chronic obstructive pulmonary disease (COPD) evaluated at the University of Pittsburgh were available for this study. Individuals were clinically stable at the time of examination, had tobacco exposure of at least ten pack years, and had no clinical diagnosis of rheumatologic, infectious, or other systemic inflammatory disease. Disease severity was measured using the GOLD classification as previously described [14]. The COPD cohort included 13 patients with GOLD class 0–I, 21 patients with GOLD II, and 39 patients with GOLD III–IV.
Sarcoidosis
Plasma samples from 47 patients with sarcoidosis evaluated at the University of Pittsburgh Medical Center were tested. Patients with lung disease (n = 29) demonstrated an average FVC% of 76.7 ± 22.1, and average DLCO% of 72.9 ± 25.5. The diagnosis and staging of disease was determined according to American Thoracic Society and European Respiratory Society criteria, as previously described [15,16].
Hypersensitivity Pneumonitis
Serum samples from 41 patients with subacute/chronic hypersensitivity pneumonitis (HP) and 34 patients with IPF evaluated at Instituto Nacional de Enfermedades Respiratorias in Mexico were available for this study. Diagnosis of IPF and HP has been previously described for this cohort [17,18]. Briefly, HP patients showed the following features: (a) antecedent bird exposure and positive serum antibodies against avian antigens; (b) clinical and functional features of ILD; (c) HRCT showing diffuse centrilobular poorly defined micronodules, ground glass attenuation, focal air trapping, and mild to moderate fibrotic changes; and (d) greater than 35% lymphocytes in bronchoalveolar lavage (BAL) fluid. Forty-four percent of the patients had a surgical lung biopsy; in all cases lung histology was consistent with the diagnosis of HP. The average FVC% was 60.3 ± 15.3 for HP and 59.1 ± 17.2 for IPF patients.
Independent Validation Cohort
Serum samples from 20 control individuals, eight patients with subclinical idiopathic ILD, 16 patients with familial pulmonary fibrosis, and nine with sporadic IPF, evaluated at the Warren Grant Magnuson Clinical Center of the National Institutes of Health (NIH), were available for this study. Patients with subclinical disease were first-degree relatives of patients with familial pulmonary fibrosis; they were asymptomatic, with normal pulmonary function tests but HRCT findings consistent with early ILD. Familial pulmonary fibrosis was defined as previously described [19]. Normal volunteers were used as controls.
These cohorts have been previously described by us [20,21]. Briefly, the mean FVC% values for patients with sporadic IPF and familial pulmonary fibrosis were 59.4 ± 19.7 and 75.7 ± 16.7, respectively. Eight patients with familial pulmonary fibrosis were diagnosed with early asymptomatic ILD using HRCT [21]; the mean FVC% in this group was 101.3 ± 10.1. Gender, age, ethnic origin, and smoking status for all groups are presented in Table 2.
Validation Cohort Patient Characteristics
Lung tissue samples for microarray analysis were obtained through the University of Pittsburgh Health Sciences Tissue Bank as we previously described [22]. Twenty-three samples were obtained from surgical remnants of biopsies or lungs explanted from patients with IPF who underwent pulmonary transplant and 14 control normal lung tissues obtained from the disease free margins with normal histology of lung cancer resection specimens. The morphologic diagnosis of IPF was based on typical microscopic findings consistent with usual interstitial pneumonia [12,23]. All patients fulfilled the diagnostic criteria for IPF outlined by the American Thoracic Society and European Respiratory Society [11].
All studies were approved by the Institutional Review Board at the University of Pittsburgh, the National Heart, Lung, and Blood Institute, or the National Institute of Respiratory Diseases, Mexico. Informed consent was obtained from all patients.
Blood Samples
Blood (45 ml) was drawn from participants using standardized phlebotomy procedures. Plasma or serum was separated by centrifugation, and all specimens were immediately aliquoted and frozen.
BAL
BAL was performed through flexible fiberoptic bronchoscopy as part of the diagnostic process, as we previously described [18,22,24]. Supernatants were kept at −70 °C until use. BAL samples from 22 IPF patients (age 62.2 ± 7.2 y) and ten normal controls (age 41.5 ± 5 y) were available for this study.
Multiplex Analysis
Assays were performed using Luminex xMAP technology (Luminex Corporation) in 96-well microplate format according to appropriate manufacturers' protocols (Invitrogen and R&D Systems), as previously described [25] and in Text S1.
Bead-Based Immunoassays
A 34-plex assay was performed for IL1A, IL1RA, IL1B, IL2, IL2R, IL4, IL5, IL6, IL7, IL8, IL10, IL12B, IL13, IL15, IL17, TNFA, IFNA, IFNG, GMCSF, EGF, VEGF, GCSF, FGF2, HGF, CXCL9, CXCL10, CCL2, CCL3, CCL4, CCL5, CCL11, TNFRS1A, TNFRS1B, and TRAIL-R2 (Invitrogen). MMP assays included MMP1, MMP2, MMP3, MMP7, MMP8, MMP9, MMP12, and MMP13 (R&D Systems).
Assays for FAS, EGFR, FASL, Cyfra 21–1 (CKRT19 fragment), IGFBP1, and KLK10 were developed in our Pittsburgh Luminex Core Facility. The assays were validated as described [25].
ELISA
Quantitative sandwich enzyme immunoassay for human MMP1, MMP7, and AGER was performed as recommended by the manufacturer (R&D Systems).
Oligonucleotide Microarray Experiments
Detailed information is provided in Text S1. Briefly, total RNA was used as a template for synthesis of cDNA as recommended by the manufacturer of the arrays (Agilent Technologies). The cDNA was used as a template to generate Cy3-labeled cRNA that was used for hybridization on Agilent Whole Human Genome 4X 44K multipack arrays (Agilent Technologies). After hybridization, scanning, and feature extraction, data files were imported into a microarray database and linked with updated gene annotations using SOURCE (http://genome-www5.stanford.edu/cgi-bin/SMD/source/sourceSearch) and then normalized using cyclic LOESS [26]. Differentially expressed genes were identified using significant analysis of microarrays (SAM) [27]. Probes corresponding to the 49 protein markers were identified through their gene symbols. Expression levels for the probes that corresponded to these markers were extracted. In the case of redundant probes, those with the highest expression level and with the lowest Q-value were selected for presentation.
Statistical Analysis
A protein was considered differentially expressed when there was a change of at least 25% in concentration and statistical significance at p < 0.05 corrected for multiple testing. Data are reported as mean ± standard deviation. The Wilcoxon rank-sum test was used to identify potential biomarkers that univariately distinguish IPF samples from controls. For multiple testing the Bonferroni method was used to control the family-wise error rate at 5%. Data were analyzed using the R language for statistical computing (http://www.r-project.org/) [28]. Classification and regression trees (CART) methodology was used to identify potential combinations of peripheral blood biomarkers that could be used to distinguish IPF from controls. CART was performed using the rpart package for recursive partitioning. Classification performance was assessed using the ROCR package (http://rocr.bioinf.mpi-sb.mpg.de/). For oligonucleotide array data analysis, we applied SAM [27]. Data visualization and clustering were performed using Genomica (http://genomica.weizmann.ac.il/index.html) [29] and Spotfire Decision Site 9 (TIBCO).
Results
Plasma Proteins Distinguish IPF Patients from Controls in Derivation Cohort
Of 49 markers analyzed, 48 are detectable in plasma (Figure 1A); univariate analysis identified 12 proteins that are differentially expressed in IPF compared to controls (Table 3). Five MMPs (MMP7, MMP1, MMP3, MMP8, MMP9), two chemokines (CXCL10, CCL11), FAS, IL12B, and the soluble TNF receptors (TNFRSF1A, TNFRSF1B) are significantly overexpressed; AGER is significantly underexpressed in plasma of patients with IPF compared to controls. MMP7 and MMP1, which have previously been shown to play a role in IPF pathogenesis, are the top-ranked proteins in univariate analysis (Table 3). Significant differences persist when age, gender, or smoking status is statistically controlled.
(A) Heatmap of proteins measured in the plasma of IPF and control patients. Columns, individual patients; rows, proteins. Every protein level was divided by the geometric mean of values for the same proteins for all patients and log based 2 transformed. Increasing shades of yellow, increased; increased shades of purple, decreased; gray, unchanged. Proteins were clustered using Genomica. Red vertical line, cluster of proteins increased in IPF; green vertical line, cluster of proteins decreased in IPF.
(B) Classification tree obtained by CART applied to plasma protein concentration data from IPF patients and controls. A blue box identifies a terminal node as control; a red box as IPF. All counts are listed as control/IPF. Concentrations are in ng/ml. In the subgroup with high MMP7 concentration but low MMP1 concentration (14 IPF samples, five control samples), splitting on IGFBP1 and TNFRSF1A improves classification, while in the subgroup with low MMP7, MMP8 improves classification.
(C) ROC curves for using each of five markers, or their combination, to classify samples as IPF or control. Sensitivity, or true positive rate, is plotted on the y-axis, and false positive rate, or 1 − specificity, on the x-axis. The area under each ROC curve is equivalent to the numerator of the Mann-Whitney U-statistic comparing the marker distributions between IPF and control samples. The magenta line labeled “Combined” is for the combinatorial classifier using all five markers. The identity line at 45 ° represents a marker that performed no better than classifying samples as IPF or control by flipping a fair coin.
Plasma Proteins That Distinguish IPF from Controls
To determine whether combinations of these plasma proteins correctly classify IPF patients, we applied recursive partitioning to the entire set of 49 markers and found that plasma protein profiles clearly distinguish IPF patients from normal controls. CART analysis showed that MMP7 and MMP1, in addition to being the two most significant biomarkers, are key components of a combinatorial classifier that also includes MMP8, IGFBP-1, and TNFRS1A (Figure 1B). Sensitivity and specificity of the classifier are 98.6% (95% confidence interval [CI] 92.7%–100%) and 98.1% (95% CI 89.9%–100%), respectively. High concentrations of MMP7 alone (≥1.99 ng/ml) correctly classify 69 of 74 IPF patients (93.2%) but incorrectly classify five normal samples as IPF and five IPF samples as controls, whereas the combination of high plasma concentrations of both MMP7 (≥1.99 ng/ml) and MMP1 (≥2.15 ng/ml) excludes all controls. Thus the combination of high MMP7 and high MMP1 concentrations can distinguish IPF patients from controls. Receiver operating characteristic curves (ROCs) (Figure 1C) confirm that MMP7 is the best univariate classifier, although the combination of five markers performs somewhat better (Figure 1C), as does the combination of MMP7 and MMP1 (unpublished data).
MMP7 and MMP1 Are Increased in the Lung and BAL Fluid of Patients with IPF
To determine whether protein concentration differences in peripheral blood reflect gene expression differences present in the lung, we analyzed gene expression patterns in 23 IPF and 14 control lungs using oligonucleotide microarrays (Figure 2A). Of the five plasma proteins in the CART plasma signature (Figure 1B), only the genes for MMP7 and MMP1 are significantly overexpressed in IPF lungs compared to controls (SAM Q value = 0 for both genes; 7.3- and 15.7-fold increase, respectively). Of the ten other proteins that are significantly different in the plasma of patients with IPF (Table 3), the genes for MMP3, AGER, and IL12B are also significantly differentially expressed in IPF lungs (Figure 2A).
(A) Average gene expression levels (log scale) measured using gene expression microarrays of genes that encode the 49 protein markers in IPF lungs (y-axis) compared to control lungs (x-axis). Colored squares (black or red) are genes that encode proteins that changed significantly in plasma. Red squares are genes that changed significantly (SAM Q value <5%) in gene expression data and that encode proteins measured in peripheral blood. Green oblique lines denote 2-fold change.
(B and C) MMP7 (B) and MMP1 (C) concentrations (ng/ml) are significantly (p < 0.00001 and p = 0.018, respectively) higher in BAL fluid of patients with IPF (n = 22) compared to control individuals (n = 10).
To determine whether MMP7 and MMP1 proteins are secreted into the alveolar microenvironment, we measured their concentrations in BAL obtained from 22 patients with IPF and ten control individuals. MMP7 and MMP1 BAL concentrations are significantly higher in IPF patients when compared to controls (p < 0.00001 and p = 0.018, respectively) (Figure 2B and 2C). Hence, elevated MMP7 and MMP1 levels in the lung microenvironment are the most likely source for their increased concentrations in peripheral blood.
MMP7 and MMP1 Are Not Increased in Patients with COPD or Sarcoidosis
To determine whether concentrations of MMP7 and MMP1 are increased in other common chronic lung diseases, we measured plasma concentrations in patients affected with sarcoidosis or COPD. The 47 sarcoidosis patients were stratified into those with evidence for parenchymal lung disease (stage 2 or greater; n = 29) and those with no lung parenchymal involvement (n = 18). As shown in Figure 3, there are no significant differences in plasma concentrations of MMP7 (p = 0.78) (Figure 3A) or MMP1 (p = 0.27) (Figure 3B) between the sarcoidosis groups with or without lung abnormalities when compared to controls. COPD participants were grouped by GOLD class, into 0–I (n = 13), II (n = 21), and III–IV (n = 39). No significant differences are found in plasma concentrations of MMP7 (p = 0.21) or MMP1 (p = 0.85) between groups of COPD patients stratified by GOLD class (Figure 3A and 3B, respectively).
Concentrations (ng/ml) of MMP7 (A) and MMP1 (B) are significantly higher in patients with IPF (n = 74; p < 0.00001 and p = 0.018, respectively), compared to controls (n = 53), but not sarcoidosis (n = 47; p = 0.78 and p = 0.28, respectively), compared to controls (n = 53) or COPD (n = 73; p = 0.21 and 0.85, respectively, stratified by GOLD class, as 0–I, II, and III–IV).
MMP7 and MMP1 Are Significantly Higher in the Serum of Patients with IPF Compared to Patients with HP
To determine whether peripheral blood concentrations of MMP7 and MMP1 distinguish IPF from other common forms of ILD, we measured their levels in 41 patients with HP and 34 patients with IPF. Univariately, serum concentrations of MMP7 (p = 0.01) and MMP1 (p < 0.001) are significantly higher in IPF compared to HP; fold changes for MMP1 and MMP7 are 2.3 and 1.31, respectively (Figure 4A and 4B).
(A and B) Concentrations (ng/ml) of MMP7 (A) and MMP1 (B) in the blood are significantly higher in patients with IPF (n = 34) than in patients with HP (n = 41).
(C) Average gene expression levels (log scale) in IPF samples (y-axis) compared to HP (x-axis) measured by gene expression microarrays. Gray circles, all genes on the array; red circles, MMP1 and MMP7. Green oblique lines denote 2-fold change.
(D) Combinations of serum MMP7 (y-axis) and MMP1 concentrations (x-axis) in IPF (closed circles) and HP patients (open circles). Corners represent points in which the trade-off between positive predictive value (PPV) and negative predictive value (NPV) are optimal for ruling out IPF (blue) or concluding IPF (red) on the basis of MMP1 and MMP7 concentrations.
(E) ROC curves for using MMP1 or MMP7, or their combination, to classify samples as IPF or HP. Sensitivity, or true positive rate, is plotted on the y-axis, and false positive rate, or 1 − specificity, on the x-axis. The identity line at 45 ° represents a marker that performed no better than classifying samples as IPF or HP by flipping a fair coin.
Similar results are observed in a reanalysis of a previously published DNA microarray dataset comparing gene expression in lung tissue obtained from IPF and HP patients [18]. In this reanalysis, MMP7 and MMP1 levels are significantly higher in IPF compared to HP (false discovery rate [FDR] < 5%), however, as observed in the peripheral blood, the change in MMP7 levels is moderate when compared to the increase in MMP1 (Figure 4C).
Combinations of serum MMP1 and MMP7 concentrations have positive predictive values for determining that a patient has IPF ranging from 91% (MMP7 > 2.6 ng/ml and MMP1 > 8.9 ng/ml) to 66%, and negative predictive value (ruling out IPF) ranging from 96% (MMP7 < 2.9 ng/ml and MMP1 > 3.5ng/ml) to 70% (Figure 4D). Additionally, the combination of high MMP7 and high MMP1 peripheral blood concentrations distinguish IPF from HP with 96.3% sensitivity (95% CI 81.0%–100%) and 87.2% specificity (95% CI 72.6%–95.7%) (Figure 4E), further supporting that MMP1 in combination with MMP7 distinguishes IPF from HP.
MMP7 and MMP1 Are Significantly Higher in the Serum of an Independent Validation Cohort
To verify our findings, we measured serum concentrations of MMP7 and MMP1 in an independent validation cohort comprised of patients affected with IPF, familial pulmonary fibrosis, or subclinical ILD, and control individuals. This cohort has been recently described by us [21]. Even though concentrations were measured in serum and not plasma, significantly higher concentrations of MMP7 and MMP1 are found in patients with pulmonary fibrosis compared to controls (p < 0.001 and p = 0.01, respectively). Notably, serum MMP7 concentrations in patients with subclinical ILD are significantly higher compared to control individuals (p = 0.019) and significantly lower compared to patients with full-blown IPF (p < 0.0001) (Figure 5A), suggesting that MMP7 may serve as a biomarker for disease progression. There is no significant difference in MMP7 concentrations between patients with familial or sporadic IPF, consistent with the findings of Yang et al. [30].
(A) Dark solid lines show median concentrations in each group. The interquartile range (IR) or middle 50% of concentrations is delimited by a box. Data are expressed on a log base 2 scale.
(B) ROC curves for using MMP1 or MMP7, or their combination, to classify samples as IPF (sporadic or familial) or control in validation cohort.
(C and D) Serum MMP7 concentrations moderately correlate with decreases in FVC% (C) and DLCO% (D). Linear regressions and 95% CI inversely relate MMP7 concentration (ng/ml) to FVC% and DLCO%. *p < 0.05, **p < 0.01, and ***p < 0.001.
In this cohort, elevated MMP1 concentrations combined with high concentrations of MMP7 can distinguish IPF from controls with 89.2% sensitivity (95% CI 71.8%–91.7%) and 95.0% specificity (95% CI 75.1%–99.9%), supporting the findings in our derivation cohort (Figure 5B).
MMP7 Concentrations Correlate Moderately with Disease Severity
To determine whether concentrations of MMP7 or MMP1 correlate with disease severity, we compared pulmonary function measurements with serum concentrations of MMP7 and MMP1 in the validation cohort. We found a significant correlation between higher MMP7 concentrations and disease severity as measured by FVC% (Figure 5C) and DLCO% (Figure 5D). Fitted models predict a decline of 4.1% in DLCO% (p = 0.002, r = −0.53) and 4.0% in FVC% (p = 0.002, r = −0.51) for each increment of 1 ng/ml in serum MMP7. We did not find any statistically significant correlation between MMP1 concentrations and pulmonary function measurements (unpublished data).
Discussion
Overall, our study demonstrates the first evidence for a peripheral blood protein signature in IPF patients to our knowledge. MMP7 and MMP1, two matrix metalloproteases previously implicated in the pathogenesis of IPF [31], are significantly increased in plasma, serum, BAL fluid, and lung tissue of IPF patients, suggesting that increased MMP7 and MMP1 levels in the peripheral blood are indicative of the pathologic changes that characterize the IPF alveolar microenvironment. Used in combination, blood levels of MMP1 and MMP7 can distinguish IPF patients from diverse types of chronic lung disease including HP, a common interstitial pneumonia that can sometimes be indistinguishable from IPF [32–34]. Increases in MMP7 blood concentrations are observed in patients with subclinical familial pulmonary fibrosis, and higher levels of MMP7 are associated with disease severity. Taken together our findings support the use of MMP1 and MMP7 as IPF biomarkers and suggest that their role in diagnosis, early detection, and monitoring of disease progression should be further investigated.
Multiple MMPs are among the 12 proteins significantly increased in the blood of IPF patients. The roles of MMPs have been intensively studied and debated in IPF [35]. While multiple and often contrasting roles have been proposed for MMPs in regulating abnormal epithelial response to injury, fibroblast proliferation, extracellular matrix accumulation, and aberrant tissue remodeling, the consensus is that this family of matrix degrading enzymes is involved in disease pathogenesis [31,36–40]. The two top-ranked proteins in this study are MMPs known to be significantly overexpressed in the activated alveolar epithelium in IPF lungs. MMP1, a matrix metalloprotease that primarily degrades fibrillar collagen, is rarely expressed under normal conditions, but is highly overexpressed in reactive alveolar epithelial cells in IPF lungs [39]. MMP7, a matrix metalloprotease with multiple local inflammatory regulatory roles [41,42], is also highly upregulated in alveolar epithelial cells in IPF [39,43]. Furthermore, MMP7 knockout mice are relatively protected from bleomycin-induced fibrosis [39], suggesting that MMP7 may have a profibrotic effect in IPF. Taken in the above context, our results strongly suggest that activated epithelial cells in IPF lungs are the likely source of elevated peripheral blood concentrations of MMP1 and MMP7, thus supporting their use as biomarkers for disease detection and progression.
Our data show that neither patients with COPD, a chronic progressive lung disease, nor patients with sarcoidosis, a chronic granulomatous ILD, express significantly increased peripheral blood concentrations of MMP7 or MMP1. Further, elevated peripheral blood MMP1 concentrations, in the presence of elevated MMP7 concentrations, distinguish IPF from HP. A similar trend in gene expression of MMP7 and MMP1 is found in the lungs of patients with IPF and HP, further supporting the notion that the changes in peripheral blood concentrations of MMP7 and MMP1 are reflective of the lung gene environment and constitute a disease-specific signal. This finding may be very important clinically, because subacute HP is frequently misdiagnosed as idiopathic nonspecific interstitial pneumonia (NSIP), and in its chronic advanced form HP can be undistinguishable from IPF [32–34]. In fact, recent studies have demonstrated that histopathologic and HRCT abnormalities observed in chronic HP often overlap with those of usual interstitial pneumonia (UIP), representing an important challenge to the differential diagnosis of these conditions [33,34,44]. Thus, the elevated peripheral blood concentrations of MMP7 and MMP1 observed in IPF are not due to a systemic stress response to a chronic lung disease and distinguish COPD, sarcoidosis, and HP from IPF. While we do not advocate at this stage relying solely on peripheral blood concentrations of MMP7 and MMP1 in distinguishing IPF from HP, sarcoidosis, or the less difficult differential diagnosis of COPD, it seems likely that knowing these concentrations will impact clinical decision-making.
We did not compare IPF to other idiopathic interstitial pneumonias such as NSIP. There is nothing in our data to suggest that we can distinguish IPF from these diseases using MMP7 and MMP1 peripheral blood concentrations. In fact the finding of elevated MMP7 in patients with subclinical ILD may be indicative that this increase may be present in other idiopathic ILDs. Furthermore, gene expression patterns were found to be extremely similar in IPF and NSIP [30,45], and BAL MMP7 levels were also recently found to be similar in patients with IPF and NSIP [46]. The major limitation in these studies was the small number of cases with NSIP because of the substantial rarity of isolated NSIP. Therefore our results should encourage the establishment of multicenter collections of peripheral blood samples of patients with ILD with sufficient power to determine whether NSIP and IPF differ in their peripheral blood protein expression.
In comparison to other studies, major attributes of our analysis include the relatively large size of our derivation cohort and the large number of proteins assayed in this cohort of patients with IPF, the comparison of peripheral blood biomarker levels with their gene expression levels in the lungs and BAL, the comparison with multiple relatively large control populations with other chronic lung diseases to establish specificity of our findings, and the verification of our initial results in an independent validation cohort. A unique feature of our validation cohort is that it contains patients with subclinical ILD who are asymptomatic first-degree relatives of patients affected with familial IPF. These individuals have HRCT findings of early ILD, but do not have pulmonary function abnormalities, cough, or dyspnea [19,21]. Analysis of samples from this cohort allowed us to demonstrate that MMP7 concentrations are significantly higher in patients with early subclinical lung disease, suggesting that MMP7 may be a marker for early asymptomatic ILD. Peripheral blood concentrations of MMP7 also correlate with pulmonary function tests, which are surrogate measures of disease severity and thus may reflect molecular mechanisms of lung remodeling in IPF [31]. Naturally, the use of different platforms and different sample types limits our ability at this stage to set a disease-specific MMP concentration threshold. However, the reproducibility and concordance of our results across different sample types and in multiple cohorts suggest that such a threshold can and should be determined.
In conclusion, in this study we report for the first time to our knowledge the presence of a peripheral blood protein signature in a disease that is confined to the lung. This signature is composed of MMPs, TNF receptors, and some chemokines. Our data demonstrate that peripheral blood increases in two of these markers (MMP1 and MMP7) are also observed in lung and may be specific to IPF. We provide verification of our observations in an independent validation cohort and show that MMP7 correlates with disease severity and is increased in patients with subclinical ILD. While additional studies will determine the value of this protein signature in clinical practice, our results support a potential value of peripheral blood proteins as biomarkers in an organ-confined disease such as IPF. If validated, these biomarkers have the potential to greatly facilitate the introduction of new therapies in IPF and to profoundly affect the management of these patients.
Supporting Information
Alternative Language Abstract S1. Russian Translation of the Abstract by Anna E. Lokshin
https://doi.org/10.1371/journal.pmed.0050093.sd001
(24 KB DOC)
Alternative Language Abstract S2. Spanish Translation of the Abstract by Moises Selman
https://doi.org/10.1371/journal.pmed.0050093.sd002
(42 KB DOC)
Alternative Language Abstract S3. Japanese Translation of the Abstract by Kazuhisa Konishi
https://doi.org/10.1371/journal.pmed.0050093.sd003
(22 KB DOC)
Alternative Language Abstract S4. Chinese Translation of the Abstract
https://doi.org/10.1371/journal.pmed.0050093.sd004
(38 KB DOC)
Alternative Language Abstract S5. Hebrew Translation of the Abstract
https://doi.org/10.1371/journal.pmed.0050093.sd005
(69 KB DOC)
Accession Numbers
The Entrez Gene IDs (http://www.ncbi.nlm.nih.gov/sites/entrez) of the proteins discussed in this paper are: AGER, 177; CCL11, 6356; CXCL10, 3627; IL12B, 3593; IGFBP1, 3484; MMP1, 4312; MMP3, 4314; MMP7, 4316; MMP8, 4317; MMP9, 5318; TNFRSF1A, 7132; TNFRSF1B, 7133.
Acknowledgments
The authors wish to thank L. Chensny, A. Marrangoni, K. Johnson, and M. Bisceglia for their help in processing the samples and D. Plaskon for maintaining the database. A. Choi, J. Moss, E. Feingold, and J. Lee provided useful discussions and suggestions. This research would not have been possible without the cooperation of the patients of the Simmons Center and the Pulmonary-Critical Care Medicine Branch.
Author Contributions
IOR contributed to recruitment and analysis of the validation cohort, data analysis, conceptualization and design of the study, data analysis, and manuscript preparation. TJR contributed to project conceptualization, data analysis, statistics, and writing of the initial draft of the paper. KK contributed to microarray data generation and analysis, conceptualization of the study, and manuscript generation. YZ contributed to sample collection and protocol development, data analysis, and manuscript generation. KG contributed to participant recruitment, diagnosis, and ascertainment of the sarcoidosis cohort as well as the derivation cohort, protocol development, planning of experiments, and manuscript preparation. AEL contributed to Luminex data generation including custom assays and data analysis. KOL contributed to participant recruitment and diagnosis ascertainment of the derivation cohort, protocol development, and manuscript preparation. JC contributed to participant recruitment and diagnosis ascertainment of the HP cohort, ELISA assays, data analysis, and manuscript preparation. SDM contributed to participant recruitment and diagnosis ascertainment of the validation cohort, protocol development, and manuscript preparation. AP contributed to data analysis and manuscript preparation. FS contributed to participant recruitment and diagnosis ascertainment of the COPD cohort, protocol development, and manuscript preparation. JD contributed to conceptualization of proposed studies, diagnosis ascertainment of the derivation cohort, data analysis, and manuscript preparation. MS contributed to participant recruitment and diagnosis ascertainment of all Mexican patient cohorts, planning of the study, preparation of early drafts, and manuscript development. BRG contributed to design and conceptualization of the research project, participant recruitment and diagnosis ascertainment of the validation cohort, protocol development, and manuscript preparation. NK contributed to design and conceptualization of the research project, analysis of all data including gene expression and protein measurements, paper writing, and manuscript submission.
References
- 1. Kim DS, Collard HR, King TE Jr. (2006) Classification and natural history of the idiopathic interstitial pneumonias. Proc Am Thorac Soc 3: 285–292.
- 2. Kaminski N, Rosas IO (2006) Gene expression profiling as a window into idiopathic pulmonary fibrosis pathogenesis: can we identify the right target genes. Proc Am Thorac Soc 3: 339–344.
- 3. Gibson KF, Kaminski N (2004) The mechanisms of idiopathic pulmonary fibrosis: can we see the elephant. Drug Discov Today Dis Models 1: 117–122.
- 4. Keane MP, Strieter RM, Belperio JA (2005) Mechanisms and mediators of pulmonary fibrosis. Crit Rev Immunol 25: 429–463.
- 5. Greene KE, King TE Jr., Kuroki Y, Bucher-Bartelson B, Hunninghake GW, et al. (2002) Serum surfactant proteins-A and -D as biomarkers in idiopathic pulmonary fibrosis. Eur Respir J 19: 439–446.
- 6. Yokoyama A, Kohno N, Hamada H, Sakatani M, Ueda E, et al. (1998) Circulating KL-6 predicts the outcome of rapidly progressive idiopathic pulmonary fibrosis. Am J Respir Crit Care Med 158: 1680–1684.
- 7. Kuwano K, Maeyama T, Inoshima I, Ninomiya K, Hagimoto N, et al. (2002) Increased circulating levels of soluble Fas ligand are correlated with disease activity in patients with fibrosing lung diseases. Respirology 7: 15–21.
- 8. Suga M, Iyonaga K, Ichiyasu H, Saita N, Yamasaki H, et al. (1999) Clinical significance of MCP-1 levels in BALF and serum in patients with interstitial lung diseases. Eur Respir J 14: 376–382.
- 9. Mukae H, Iiboshi H, Nakazato M, Hiratsuka T, Tokojima M, et al. (2002) Raised plasma concentrations of alpha-defensins in patients with idiopathic pulmonary fibrosis. Thorax 57: 623–628.
- 10. Kadota J, Mizunoe S, Mito K, Mukae H, Yoshioka S, et al. (2005) High plasma concentrations of osteopontin in patients with interstitial pneumonia. Respir Med 99: 111–117.
- 11. American Thoracic Society (2000) Idiopathic pulmonary fibrosis: diagnosis and treatment. International consensus statement. American Thoracic Society (ATS), and the European Respiratory Society (ERS). Am J Respir Crit Care Med 161: 646–664.
- 12. Katzenstein AL, Myers JL (1998) Idiopathic pulmonary fibrosis: clinical relevance of pathologic classification. Am J Respir Crit Care Med 157: 1301–1315.
- 13. King TE Jr., Tooze JA, Schwarz MI, Brown KR, Cherniack RM (2001) Predicting survival in idiopathic pulmonary fibrosis: scoring system and survival model. Am J Respir Crit Care Med 164: 1171–1181.
- 14. Pauwels RA, Buist AS, Calverley PM, Jenkins CR, Hurd SS (2001) Global strategy for the diagnosis, management, and prevention of chronic obstructive pulmonary disease. NHLBI/WHO Global Initiative for Chronic Obstructive Lung Disease (GOLD) Workshop summary. Am J Respir Crit Care Med 163: 1256–1276.
- 15. Costabel U, Hunninghake GW (1999) ATS/ERS/WASOG statement on sarcoidosis. Sarcoidosis Statement Committee. American Thoracic Society. European Respiratory Society. World Association for Sarcoidosis and Other Granulomatous Disorders. Eur Respir J 14: 735–737.
- 16. [No authors listed] (1999) Statement on sarcoidosis. Joint Statement of the American Thoracic Society (ATS), the European Respiratory Society (ERS) and the World Association of Sarcoidosis and Other Granulomatous Disorders (WASOG) adopted by the ATS Board of Directors and by the ERS Executive Committee, February 1999. Am J Respir Crit Care Med 160: 736–755.
- 17. Bustos ML, Frias S, Ramos S, Estrada A, Arreola JL, et al. (2007) Local and circulating microchimerism is associated with hypersensitivity pneumonitis. Am J Respir Crit Care Med 176: 90–95.
- 18. Selman M, Pardo A, Barrera L, Estrada A, Watson SR, et al. (2006) Gene expression profiles distinguish idiopathic pulmonary fibrosis from hypersensitivity pneumonitis. Am J Respir Crit Care Med 173: 188–198.
- 19. Steele MP, Speer MC, Loyd JE, Brown KK, Herron A, et al. (2005) Clinical and pathologic features of familial interstitial pneumonia. Am J Respir Crit Care Med 172: 1146–1152.
- 20. Ren P, Rosas IO, Macdonald SD, Wu HP, Billings EM, et al. (2007) Impairment of alveolar macrophage transcription in idiopathic pulmonary fibrosis. Am J Respir Crit Care Med 175: 1151–1157.
- 21. Rosas IO, Ren P, Avila NA, Chow CK, Franks TJ, et al. (2007) Early interstitial lung disease in familial pulmonary fibrosis. Am J Respir Crit Care Med 176: 698–705.
- 22. Pardo A, Gibson KF, Cisneros J, Richards TJ, Yang Y, et al. (2005) Up-regulation and profibrotic role of osteopontin in human idiopathic pulmonary fibrosis. PLoS Med 2: e251.
- 23. American Thoracic Society, European Respiratory Society (2002) American Thoracic Society/European Respiratory Society International Multidisciplinary Consensus Classification of the Idiopathic Interstitial Pneumonias. This joint statement of the American Thoracic Society (ATS), and the European Respiratory Society (ERS) was adopted by the ATS board of directors, June 2001 and by the ERS Executive Committee, June 2001. Am J Respir Crit Care Med 165: 277–304.
- 24. Selman M, Carrillo G, Estrada A, Mejia M, Becerril C, et al. (2007) Accelerated variant of idiopathic pulmonary fibrosis: clinical behavior and gene expression pattern. PLoS ONE 2: e482.
- 25. Gorelik E, Landsittel DP, Marrangoni AM, Modugno F, Velikokhatnaya L, et al. (2005) Multiplexed immunobead-based cytokine profiling for early detection of ovarian cancer. Cancer Epidemiol Biomarkers Prev 14: 981–987.
- 26. Wu W, Dave N, Tseng GC, Richards T, Xing EP, et al. (2005) Comparison of normalization methods for CodeLink Bioarray data. BMC Bioinformatics 6: 309.
- 27. Tusher VG, Tibshirani R, Chu G (2001) Significance analysis of microarrays applied to the ionizing radiation response. Proc Natl Acad Sci U S A 98: 5116–5121.
- 28. Ihaka R, Gentleman R (1996) R: A language for data analysis and graphics. J Comp Graph Stat 5: 299–314.
- 29. Novershtern N, Itzhaki Z, Manor O, Friedman N, Kaminski N (2008) A functional and regulatory map of asthma. Am J Respir Cell Mol Biol 38: 324–336.
- 30. Yang IV, Burch LH, Steele MP, Savov JD, Hollingsworth JW, et al. (2007) Gene expression profiling of familial and sporadic interstitial pneumonia. Am J Respir Crit Care Med 175: 45–54.
- 31. Pardo A, Selman M (2006) Matrix metalloproteases in aberrant fibrotic tissue remodeling. Proc Am Thorac Soc 3: 383–388.
- 32. Perez-Padilla R, Salas J, Chapela R, Sanchez M, Carrillo G, et al. (1993) Mortality in Mexican patients with chronic pigeon breeder's lung compared with those with usual interstitial pneumonia. Am Rev Respir Dis 148: 49–53.
- 33. Ohtani Y, Saiki S, Kitaichi M, Usui Y, Inase N, et al. (2005) Chronic bird fancier's lung: histopathological and clinical correlation. An application of the 2002 ATS/ERS consensus classification of the idiopathic interstitial pneumonias. Thorax 60: 665–671.
- 34. Churg A, Muller NL, Flint J, Wright JL (2006) Chronic hypersensitivity pneumonitis. Am J Surg Pathol 30: 201–208.
- 35. Gadek JE, Kelman JA, Fells G, Weinberger SE, Horwitz AL, et al. (1979) Collagenase in the lower respiratory tract of patients with idiopathic pulmonary fibrosis. N Engl J Med 301: 737–742.
- 36. Suga M, Iyonaga K, Okamoto T, Gushima Y, Miyakawa H, et al. (2000) Characteristic elevation of matrix metalloproteinase activity in idiopathic interstitial pneumonias. Am J Respir Crit Care Med 162: 1949–1956.
- 37. Selman M, Ruiz V, Cabrera S, Segura L, Ramirez R, et al. (2000) TIMP-1, -2, -3, and -4 in idiopathic pulmonary fibrosis. A prevailing nondegradative lung microenvironment. Am J Physiol Lung Cell Mol Physiol 279: L562–L574.
- 38. Fukuda Y, Ishizaki M, Kudoh S, Kitaichi M, Yamanaka N (1998) Localization of matrix metalloproteinases-1, -2, and -9 and tissue inhibitor of metalloproteinase-2 in interstitial lung diseases. Lab Invest 78: 687–698.
- 39. Zuo F, Kaminski N, Eugui E, Allard J, Yakhini Z, et al. (2002) Gene expression analysis reveals matrilysin as a key regulator of pulmonary fibrosis in mice and humans. Proc Natl Acad Sci U S A 99: 6292–6297.
- 40.
Pardo A, Selman M, Kaminski N (2007) Approaching the degradome in idiopathic pulmonary fibrosis. Int J Biochem Cell Biol. Epub ahead of print.
- 41. Li Q, Park PW, Wilson CL, Parks WC (2002) Matrilysin shedding of syndecan-1 regulates chemokine mobilization and transepithelial efflux of neutrophils in acute lung injury. Cell 111: 635–646.
- 42. McGuire JK, Li Q, Parks WC (2003) Matrilysin (matrix metalloproteinase-7) mediates E-cadherin ectodomain shedding in injured lung epithelium. Am J Pathol 162: 1831–1843.
- 43. Cosgrove GP, Schwarz MI, Geraci MW, Brown KK, Worthen GS (2002) Overexpression of matrix metalloproteinase-7 in pulmonary fibrosis. Chest 121: 25S–26S.
- 44. Silva CI, Churg A, Muller NL (2007) Hypersensitivity pneumonitis: spectrum of high-resolution CT and pathologic findings. AJR Am J Roentgenol 188: 334–344.
- 45. Rosas IO, Kaminski N (2007) When it comes to genes–IPF or NSIP, familial or sporadic–they're all the same. Am J Respir Crit Care Med 175: 5–6.
- 46. Vuorinen K, Myllarniemi M, Lammi L, Piirila P, Rytila P, et al. (2007) Elevated matrilysin levels in bronchoalveolar lavage fluid do not distinguish idiopathic pulmonary fibrosis from other interstitial lung diseases. Apmis 115: 969–975.