Comparison of Label-free Methods for Quantifying Human Proteins by Shotgun Proteomics*,S

William M. Old{ddagger}, Karen Meyer-Arendt{ddagger}, Lauren Aveline-Wolf{ddagger}, Kevin G. Pierce{ddagger}, Alex Mendoza{ddagger}§, Joel R. Sevinsky{ddagger}, Katheryn A. Resing{ddagger} and Natalie G. Ahn{ddagger},§,||

From the {ddagger} Department of Chemistry and Biochemistry, § Howard Hughes Medical Institute, University of Colorado, Boulder, Colorado 80309-0215


    ABSTRACT
 TOP
 ABSTRACT
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Measurements of mass spectral peak intensities and spectral counts are promising methods for quantifying protein abundance changes in shotgun proteomic analyses. We describe Serac, software developed to evaluate the ability of each method to quantify relative changes in protein abundance. Dynamic range and linearity using a three-dimensional ion trap were tested using standard proteins spiked into a complex sample. Linearity and good agreement between observed versus expected protein ratios were obtained after normalization and background subtraction of peak area intensity measurements and correction of spectral counts to eliminate discontinuity in ratio estimates. Peak intensity values useful for protein quantitation ranged from 107 to 1011 counts with no obvious saturation effect, and proteins in replicate samples showed variations of less than 2-fold within the 95% range (±2{varsigma}) when ≥3 peptides/protein were shared between samples. Protein ratios were determined with high confidence from spectral counts when maximum spectral counts were ≥4 spectra/protein, and replicates showed equivalent measurements well within 95% confidence limits. In further tests, complex samples were separated by gel exclusion chromatography, quantifying changes in protein abundance between different fractions. Linear behavior of peak area intensity measurements was obtained for peptides from proteins in different fractions. Protein ratios determined by spectral counting agreed well with those determined from peak area intensity measurements, and both agreed with independent measurements based on gel staining intensities. Overall spectral counting proved to be a more sensitive method for detecting proteins that undergo changes in abundance, whereas peak area intensity measurements yielded more accurate estimates of protein ratios. Finally these methods were used to analyze differential changes in protein expression in human erythroleukemia K562 cells stimulated under conditions that promote cell differentiation by mitogen-activated protein kinase pathway activation. Protein changes identified with p < 0.1 showed good correlations with parallel measurements of changes in mRNA expression.


An important goal in functional proteomics is to globally profile changes in protein abundances in biological systems, providing a snapshot of the protein expression state of a cell in response to biological perturbations such as cell morphogenesis, disease progression, or drug treatment. This requires sensitive and accurate assays for identifying proteins in complex mixtures and quantifying their abundances. Many studies have utilized two-dimensional gel electrophoresis for proteomic profiling, but this method is limited in sensitivity and can be inefficient when analyzing insoluble proteins or those with very high or low mass. "Bottom up" shotgun proteomics is an alternative approach involving digestion of proteins in complex mixtures followed by mass spectrometric analysis of the peptides to identify represented proteins. One strategy uses multidimensional chromatography to partially resolve peptides by strong cation exchange (SCX)1 and reversed-phase (RP) HPLC, a method often referred to as MudPIT (multidimensional protein identification technology) (13). The last chromatographic step is interfaced with a mass spectrometer capable of data-dependent data acquisition of MS/MS fragmentation spectra, which are then searched against a protein database using programs such as Sequest (4, 5) or Mascot (6) to identify the represented proteins. Using this approach, several thousand proteins can be identified in a single sample.

Quantifying changes in protein abundance between samples is a key requirement for profiling changes in cell state at a molecular level. One approach uses isotope or mass tag labeling of peptides where two samples to be compared are covalently modified by isotopically distinguishable (e.g. 1H versus 2H, 12C versus 13C, or 14N versus 15N) but chemically similar adducts, the samples are proteolyzed and mixed, and relative changes in protein abundance are determined from ratios of intensities between the differentially labeled peptides (79). Stable isotope labeling enables relative peptide abundances to be directly compared, bypassing problems due to ion-suppressive effects of coeluting peptides (10). Studies reporting the accuracy and variability of ICAT experiments suggest the technique can reliably detect down to an ~1.5-fold change in protein abundance over a dynamic range from 10- to 100-fold (1114). Disadvantages include cost of isotopic labeling and requirement for pairwise comparisons between samples, which prevent retrospective comparisons and complicate large studies.

Label-free protein quantitation methods are promising alternatives. Three studies using standards have demonstrated that mass spectral peak intensities of peptide ions correlate well with protein abundances in complex samples. Bondarenko and co-workers demonstrated linear responses of peptide ion peak areas between 10 and 1,000 fmol of myoglobin spiked into human plasma with a relative standard deviation <11% (15, 16). Likewise Wang et al. (17) published similar results with protein standards spiked into serum, obtaining a median relative standard deviation of 26% for peak intensity ratios from ~3,400 ions in 25 replicate measurements. Another label-free method, termed spectral counting, compares the number of MS/MS spectra assigned to each protein. Spectral counts of standard proteins added to yeast extracts showed linearity over 2 orders of magnitude with high correlation to the relative protein concentration (18). An advantage of spectral counting is that relative abundances of different proteins can in principle be measured. Thus, significant correlations have been shown between spectral counts and independent estimates of protein copy number in yeast (19).

Each of these studies utilized relatively simple analytical protocols; the application of these methods to mammalian systems, where the dynamic range and the number of proteins is greater, requires methods for dealing with multiple chromatographic separations. Here we investigate the performances of peptide ion intensity measurements and spectral counting in analyzing changes in protein abundances in complex samples from human cells.


    EXPERIMENTAL PROCEDURES
 TOP
 ABSTRACT
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Sample Preparation
Details of protocols used to prepare samples were described previously (20). Briefly suspension cultures of K562 erythroleukemia cells (5 x 108) were grown in RPMI 1640 medium + 10% fetal bovine serum. Harvested cells were washed twice with phosphate-buffered saline containing 1 mM EDTA and 1 mM EGTA, and then soluble proteins were extracted in high salt buffer followed by high speed centrifugation. In some experiments, cells were treated with 10 nM phorbol 12-myristate 13-acetate (PMA) (Sigma) in the presence or absence of 20 µM U0126 (Promega) for 24 h prior to extraction. Treatment with PMA induces megakaryocyte cell differentiation, and U0126 inhibits PMA-induced differentiation (21, 22). Soluble proteins were either resolved by size exclusion chromatography (Sephacryl HR300 (26/60), Amersham Biosciences) or analyzed without separation. Proteins in each sample were alkylated with iodoacetamide, desalted into 100 mM NH4HCO3, pH 8.3, and proteolyzed with 3% trypsin, 0.5 mM CaCl2 (Wako). This protocol was optimized to ensure complete Cys alkylation, minimize artifactual loss of HSCH3 (48 Da) from Met, and minimize incomplete proteolysis. After lyophilization three times to remove NH4HCO3, ~0.5–3.5 mg of peptide was fractionated by SCX-HPLC chromatography (polysulfoethyl A, 2.1-mm inner diameter x 200 mm, Poly LC). Reversed-phase separation was performed with 250-µm-inner diameter reversed-phase capillary columns fabricated in-house using a frit made from a C18 Empore disk (3M, St. Paul, MN), inserting a 50-µm-inner diameter, 180-µm-outer diameter piece of tubing as an outlet, and then joining the two pieces of tubing with epoxy glue (EPO-TEK; Epoxy Technology, Billerica, MA). Columns were packed with Jupiter C18 resin (10-µm particle size; Phenomenex, Torrance, CA) using a bomb pressurized with helium gas at 100–400 p.s.i.

Three experimental samples were analyzed in this study. Sample 1 consists of soluble K562 proteins that were proteolyzed and then separated into 20 SCX-HPLC fractions in three replicate fractionations. Two analyses were carried out with these samples: (a) reversed-phase replicate analysis where fractions from one SCX replicate were split and each was analyzed by RP-LC/MS/MS over the full m/z range from 350 to 1,500 Da for each survey scan and (b) SCX replicate comparison where fractions from two of the replicate SCX-HPLC separations were analyzed by RP-LC/MS/MS over the full m/z range from 350 to 1,500 Da for each survey scan. Sample 2 consists of proteins in gel filtration fractions that were proteolyzed, peptides were resolved into 20 fractions by SCX-HPLC, and each SCX fraction was analyzed by RP-LC/MS/MS in one full m/z range run and six gas phase fractionation runs covering 300–678, 670–798, 798–918, 910–1,038, 1,030–1,278, and 1,270–1,750 Da (20). Sample 3 consists of soluble extracts from cells that were (i) unstimulated, (ii) stimulated with 10 nM PMA to induce cell differentiation, or (iii) pretreated with 20 µM U0126 prior to 10 nM PMA to block cell differentiation. Soluble proteins were extracted under each condition and proteolyzed, peptides were resolved in six fractions by SCX-HPLC, and SCX fractions were analyzed by RP-LC/MS/MS in one full m/z range (250–1,500 Da) and three gas phase fractionation analyses covering 350–806, 794–1,131, and 1,119–1,600 Da. Statistics for the three samples are listed in Supplemental Table 1, A and B.

For experiments where protein standards were added to a complex experimental sample, a peptide digest of 3 mg of protein from the soluble extract was separated into 20 SCX fractions, and two SCX fractions containing 13 and 11% of total peptides (calculated from A280 of each fraction normalized to total A280 summed over all fractions) were combined. Tryptic digests of BSA or bovine apotransferrin (ApoT), reduced and alkylated with iodoacetamide, were added to the pool, and 1.9% of the resulting mixture was used in each LC/MS analysis. This corresponds to ~10 µg of peptide from the original K562 extract in each analysis plus varying amounts of standard protein digests maximally added at 20 pmol of BSA + 5 pmol of ApoT (~0.3 µg of total standard protein).

Quantitation of Proteins between Gel Filtration Fractions
Proteins in fractions from sizing gel exclusion were separated by 10% SDS-PAGE and visualized by Coomassie staining. Gel images were scanned (UMAX 1200), and staining intensities were quantified using the public domain NIH Image program (rsb.info.nih.gov/nih-image/), measuring band volumes (pixel intensity integrated over band area) in arbitrary units. The ratio of the band volumes between different lanes was calculated and compared with ratios of protein abundance determined from spectral ion intensities or spectral counting. Proteins in selected bands were identified by in-gel digestion and MS analysis as described previously (23) using an ABI DE-STR MALDI-TOF mass spectrometer for peptide mass fingerprinting and an Agilent XCT ion trap mass spectrometer for LC/MS/MS sequencing.

Microarray Measurements
K562 cells (7 x 105) were cultured and treated with PMA ± U0126 as above, and total RNA was isolated by TRIzol extraction (Invitrogen). Independent experiments were performed in triplicate for each condition. First and second strand cDNA synthesis, in vitro transcription of biotin-labeled cRNA, and fragmentation were carried out following standard protocols from the Affymetrix Expression Analysis Technical Manual (www.affymetrix.com). The samples were hybridized onto U133 2.0 Plus GeneChips (Affymetrix) and processed at the University of Colorado Health Sciences Center Cancer Center Microarray Core facility. Datasets were corrected for background and normalized using robust multiarray average normalization (24). p values for assessing significance of changes were determined by using an empirical Bayes approach and were corrected for multiple testing using the false discovery rate (25, 26).

Mass Spectrometry
Each SCX fraction was analyzed by RP-LC/MS/MS using a ThermoFinnigan LCQ Deca ion trap mass spectrometer. MS/MS was performed by acquiring one full scan mass spectrum and then MS/MS spectra of the three most intense peaks. Automated gain control was set at 8 x 108. The m/z range either covered a full mass window (350–1,500 Da) or was divided into three to six narrow windows for gas phase fractionation over 350–1,750 Da (2729). DTA files were generated from the MS/MS spectra by TurboSequest and searched by Sequest or concatenated into a Mascot Generic File for searching with Mascot; searches used the International Protein Index (IPI) database from European Bioinformatics Institute (version 2.18 for Sample 2; version 3.0, November 2004 for Samples 1 and 3) (30). Sequest and Mascot search results were parsed into an Oracle 9i relational database using in-house parsers and a modified version of DBParser (31). Peptide identifications were filtered by MSPlus, a program for improved peptide identification and validation, which determines confidence based on agreement between Sequest and Mascot identifications as well as chemical properties of the peptides. The high confidence identifications are then processed by Isoform Resolver, which uses a peptide centric database strategy to assemble validated peptide sequences into protein identifications, accounting for groups of isoforms and splice variants that share common peptides (20). Quantitation by spectral ion intensity measurements or spectral counting was performed on the high confidence peptide sequences using the isoform groupings generated by Isoform Resolver.

Serac: New Software for Protein Quantitation
The Serac PeakExtractor module measures peptide ion intensities by calculating peak areas from extracted ion chromatograms (XICs) for each peptide ion in each LC/MS. PeakExtractor was written in Visual Basic and requires the Xcalibur development kit for accessing Finnigan LCQ .RAW data files. Each peptide ID is mapped back to its originating .RAW data file, the file is opened, and an XIC is generated by summing the intensities within a narrow m/z range from the full scan mass spectra for each scan cycle using a lower limit of ((peptide monoisotopic mass – 1) + charge)/charge and upper limit of ((peptide average mass + 2) + charge)/charge. Built-in Xcalibur development kit functions for peak finding, peak smoothing, and peak integration are used to determine the peak area intensity for the peptide with user-specified parameters to control each step. Parameters are determined that optimize peak finding, base-line subtraction, and integration for each experimental dataset. For example, Fig. 1A shows the interface that is used to optimize scanning and peak integration parameters for the ion trap MS to minimize the presence of peak heterogeneity; such parameters must be reoptimized whenever changes are made to the experimental protocol. Once the parameters have been established for each experiment, PeakExtractor analyzes the entire dataset in a batch operation. The running time to analyze a dataset with 91,273 high confidence MS/MS identifications (Fig. 5, Sample 3) on a Pentium 4 processor was 3 h.



View larger version (84K):
[in this window]
[in a new window]
 
FIG. 1. User interface for quantification software. Screen shots from the Serac PeakExtractor software module are shown. A, the program displays an extracted ion chromatogram for each m/z value with arrows indicating where MS/MS spectra were taken. B, the program shows MS and MS/MS spectra corresponding to this peak and displays the parameters that were used to quantify the peak area intensity. In this case, the peak is complex with two peptides coeluting and would be excluded from the quantitation because the coeluting peptides cannot be resolved from one another. C, example of Serac’s database user interface, which accesses information in the Oracle database, specifying peptides, masses, and retention times to quantify and store peak area intensities.

 


View larger version (39K):
[in this window]
[in a new window]
 
FIG. 5. Analysis of soluble K562 proteins separated by gel filtration. A, proteins in the indicated gel filtration fractions were separated by SDS-PAGE and visualized by Coomassie staining. Those identified by in-gel digestion and MALDI-TOF and then confirmed by LC/MS/MS are indicated by arrows. B, comparison of peak area intensities of peptides from cytoplasmic actin observed between six different gel filtration fraction combinations. Only peptides found in common in each comparison are plotted. Correlation coefficients of linear regression lines are greater than 0.9 in all cases. C, comparison of protein ratios measured by peak area intensities versus Coomassie staining for proteins identified in A (RGEL = log2(stained peak volume 1/stained peak volume 2)). Points represent ratios of proteins between indicated fractions where both measurements could be made. D, comparison of protein ratios measured by spectral counting versus Coomassie staining. E, comparison of protein ratios measured by spectral counting versus peak area intensities for those proteins shown in C and D. Higher correlations between RSC and RPA suggest that most of the variability in C and D is due to error in quantifying Coomassie staining intensities. F, comparison of spectral counts for proteins in gel filtration fractions 6 and 8. Significant variation outside the 95% confidence limits were observed as expected. G, comparison of protein ratios measured by spectral counting versus peak area intensities for proteins sharing ≥3 peptides in common between gel filtration fraction 6 versus fraction 8. Closed symbols indicate p ≤ 0.05 as determined from Student’s t test of the log2 protein ratios for significant change with closed triangles showing the subset with p ≤ 0.05 for the spectral count ratio as determined by the G test. HSP, heat shock protein; GAPDH, glyceraldehyde-3-phosphate dehydrogenase; ALDOA, aldolase A; fr#, fraction; GF, gel filtration; NCL, nucleolin.

 
PeakExtractor interacts directly with the Oracle database by inserting peptide intensity measurements into relational tables. Peak areas are measured and stored only when the MS/MS scan of the acquired and identified ion is contained within the peak boundaries to eliminate ions within the m/z range eluting in other regions of the HPLC run. Coeluting ions of similar m/z are more problematic and require manual inspection. When quantifying low resolution ion trap data, overlapping peaks are sometimes observed in the XIC (pseudopeaks) that correspond to different ions in the m/z range of interest. These may be identified by looking for spectral dissimilarity between the MS/MS spectra within the peak and rejecting the peak if heterogeneous. For example, coeluting peptides are shown in Fig. 1A by arrows, indicating the elution times for MS/MS spectra distinguishable in Fig. 1B.

Protein Ratios from Peak Area Intensities—
The Serac PASC module calculates ratios of ion intensities for peptides matched between different experiments and averages the peptide ratios as a measure of protein change as in stable isotope labeling studies (32, 33). After removing potential outliers using Dixon’s Q-test (34), the program calculates mean and standard deviations of log2 intensity ratios of matched peptides found in each protein. A Student’s t test is used to identify proteins that change significantly between datasets, resulting in two-tailed p values for each protein, similar to methods used in microarray analysis for identifying differentially expressed genes (35).

Protein Ratios from Spectral Counts—
PASC also combines spectral counts for each protein using the protein isoform grouping identified by Isoform Resolver and counting the number of MS/MS assigned to each protein. Spectral counts for peptides shared between isoforms were considered separately due to ambiguity in protein assignment. Differences in spectral counts are identified by applying a likelihood ratio test (G test) for independence (36), which is similar to the {chi}2 and Fischer’s exact tests and corrects for variations in total counts between datasets. The G statistic is approximately distributed as {chi}2 with one degree of freedom, allowing p value calculations for each protein that aid in identifying those with differential expression. To quantify changes in spectral counts, we estimated -fold changes as proposed by Beissbarth et al. (37) for serial analysis of gene expression (SAGE) data, which avoids the discontinuity seen in simple count ratios when a protein shows spectral count = 0 in one of the samples.

(Eq. 1)

where, for each protein, RSC is the log2 ratio of abundance between Samples 1 and 2; n1 and n2 are spectral counts for the protein in Samples 1 and 2, respectively; t1 and t2 are total numbers of spectra over all proteins in the two samples; and f is a correction factor set to 0.5 by Beissbarth et al. (37) and varied in this study. This expression has the advantage of correcting for differences in sampling depth between two experiments.


    RESULTS
 TOP
 ABSTRACT
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Quantifying Proteins from Spectral Ion Intensity Measurements—
We have developed new software, named Serac, for quantifying proteins in shotgun proteomic datasets, which enables high level comparison between experimental datasets. The Serac PeakExtractor module determines ion intensity values from LC/MS datasets and sums intensities for each peptide. The PASC module converts peptide intensity information and spectral counts into protein abundance ratios and identifies significant changes.

For each identified and validated peptide, PeakExtractor calculates a peak intensity area representing the abundance of each charge state in each RP-LC/MS/MS run using XICs of LC/MS spectra (see "Experimental Procedures" for details). Different charge forms of one peptide are treated as separate observations. A Visual Basic interface enables manual inspection of individual XICs (Fig. 1A) and optimization of parameters for XIC summation, background subtraction, peak detection, and detection of possible heterogeneity in MS/MS spectra within single peaks (Fig. 1B). PeakExtractor allows users to define parameters for peak fitting and methods for combining information from different LC/MS analyses. When several peaks appear in the XIC, the program chooses the peak with the correct MS/MS spectra for the peptide of interest. When different peptides with similar m/z values coelute, PeakExtractor accepts only those peaks with one peptide ion uniquely associated with it. PeakExtractor interfaces directly with a relational database such as Oracle or Microsoft Access (Fig. 1C) that is then used by the PASC analysis modules.

Correcting Peptide Peak Area Intensity Ratios for Systematic Bias—
Peak area (PA) intensity measurements of peptide ions from replicate RP-LC/MS/MS runs were compared for 16 SCX fractions from Sample 1. An example of systematic bias in signal is shown in Fig. 2A where variable recovery between replicate analyses was apparent in peptides from certain SCX fractions that deviated from the diagonal. These may be due to variations in sample loading, HPLC, or MS instrument performance. Anderle et al. (38) showed that systematic variations in mass spectrometric ion current ratio measurements could be minimized by normalizing each peak intensity by the sum or median of ion intensities over the entire run (including ions validated or not validated by MS/MS). We found that correcting each peptide intensity by the sum of total intensities centered the distribution of off-diagonal points found in various SCX fractions and reduced the variability around the population mean (Fig. 2, A and B) by reducing the log2 peptide ratio standard deviation from 0.66 to 0.53. In general, inconsistent loading was the most common cause of systematic variation between reversed-phase runs and the normalization procedure corrected for this bias.



View larger version (34K):
[in this window]
[in a new window]
 
FIG. 2. Correction of peak area intensities for systematic bias. A, peptide peak area intensities before normalization. Shown are log2-log2 scatter plots of the PA measurements for all peptides observed in common between replicate RP-LC/MS/MS runs of 16 different SCX fractions from Sample 1, each plotted as a different color. The center line indicates peptide intensity ratio = 1, indicating no change, and the red line shows 2-fold change. The dotted line indicates the 95% range (±2{varsigma}). B, peak area intensities after normalization for total peak area intensity in each RP run. Systematic bias is clearly evident in A, and normalization brings these datasets in line with the other LC/MS analyses. The scale change on both axes is caused by dividing each peak area by the total peak area over each RP run and multiplying by 1 x 105. C, comparison of replicate SCX runs of Sample 1 split into two 3-mg aliquots where each SCX fractionation is followed by RP-LC/MS/MS analysis. Peak area intensities are normalized for total peak area in each RP run. D, log2 protein ratios calculated for each peak area (RPA), comparing datasets in C, determined by calculating average and standard deviations of log2 (peptide intensity ratio). Values are plotted as a function of log10 peak area intensity with different colors distinguishing proteins with 1, 2, 3–5, and ≥6 peptides/protein in common between datasets. Lower variability in protein ratio is observed when three or more peptides are in common. E, density estimates for the log2 ratio distributions for data in D show approximate normal log2 distributions of proteins with ≥3 peptides in common between datasets.

 
Complexity increases when comparing peptide samples separated by SCX chromatography because peptides do not always elute in exactly the same fraction between SCX runs. We tested two methods to match peptide intensities between parallel SCX separations of Sample 1. In the first approach, peptide ion peak areas were quantified and matched only in the SCX fraction with highest intensity. In the second approach, peptide ion peak intensities were summed over each SCX fraction where the peptide was observed by MS/MS. The first method should compensate for ions missing due to incomplete MS/MS sampling, while the second method should compensate for variable separation of peptides into different SCX fractions.

Similar correlations were observed between replicate SCX runs using either method that could be attributed to the peptide elution behavior where ~90% of peptides eluted in only one SCX fraction (data not shown). We therefore chose to use the second approach for further studies, summing the intensity measurements of those peptide ions observed in multiple SCX fractions to compensate for variable SCX separation in more complex samples. Methods are included in the software for maximizing the alignment of SCX fractions when there is large chromatographic variation along with validation methods to easily detect outliers or systematic biases when changes are made to the overall system.

Random noise is introduced into the system at various steps, including SCX and reversed-phase chromatography, and contributes to the spread of the log2 peptide ratios around zero. The standard deviation of log2 peptide ratios for SCX replicates (Fig. 2C) was 0.59 compared with 0.53 for reversed-phase replicates (Fig. 2B), suggesting that the RP-LC/MS/MS process is more important than SCX chromatography in contributing to variability in peptide ratios. Protein ratios are calculated by intensity weighted averaging of peptide ratios, which has the effect of reducing the standard deviation of the protein ratio distributions, depending on the number of peptides for a given protein. The 95% range (±2{varsigma}) for log2 peptide ratios in this dataset was ±1.2, corresponding to the ability to detect a 2.3-fold change in ratio for a single peptide. When ratios of peptide intensities for each protein were averaged, the ratio threshold for significant change was lowered as a function of the number and standard deviation of peptide ratios contributing to the protein average as shown in Fig. 2D. As expected, the majority of false positives were observed in proteins represented by only one or two peptides in common between samples with lower variability in proteins with three or higher peptides in common (Fig. 2, D and E). The log2 ratio distributions for proteins represented by three or more peptides showed less variation (Fig. 2E) and were normally distributed as verified by normal probability plots (data not shown).

Linearity and Sensitivity—
We tested linearity of intensity measurements by spiking varying amounts of two standard proteins into 10 µg of a complex sample (pooled from two SCX fractions in Sample 1). In total, 21 and 43 peptide ions were observed for BSA (added at 0.2–20 pmol) and ApoT (added at 0.05–5 pmol), respectively. Linearity between peak area intensity versus amount of added standard was observed for nearly all peptides. Two examples are shown in Fig. 3A, and linear regression parameters for all peptides are listed in Supplemental Table 2.



View larger version (28K):
[in this window]
[in a new window]
 
FIG. 3. Analysis of peak area intensity measurements for standard proteins. A, peak area intensity versus amount of standard protein added to a complex sample of two pooled SCX fractions. Representative peptides from ApoT and BSA show linearity over a range of peptide intensities. Other peptides identified and parameters of regression lines are given in Supplemental Table 2. B, observed log2 protein ratios (RPA), determined by averaging log2 peak area intensity ratios, are systematically underestimated compared with expected RPA values. Ratios were determined for all possible pairwise combinations of diluted standards. Symbols indicate dilutions used for the denominator values in each ratio calculation. Open squares represent ratios measured from only one peptide in common. C, the systematic bias in B is minimized by subtracting a background value (2 x 107) from each intensity measurement.

 
Fig. 3B shows expected versus observed log2 protein ratios (RPA), the observed values calculated by averaging log2 intensity ratios for peptides in common between dilution pairs. Overall the observed protein ratios were correlated with expected over the range of the spiked amount of BSA and ApoT examined. The range of ratios quantified by peak area intensity measurements was observed up to log2 ~3.5 and was valid down to 0.4 pmol for ApoT and 1.6 pmol for BSA. This was true for dilution pairs with only one or two peptides in common (Fig. 3B, open squares for BSA), although combining the information from multiple peptides for each protein increased the confidence in the estimated ratio in much the same way that ratios from stable isotope analysis are combined to form protein ratios (32, 33). The observed dynamic range of log2 ~3.5 (10-fold) was lower than that of the tested dynamic range due to the fact that peptides detected (assigned to MS/MS) in the lowest concentrations (0.05, 0.2, and 0.3 pmol of ApoT and 0.6 and 1.2 pmol of BSA) were quantifiable but rejected due to poor peak characteristics of the parent ion chromatogram peak.

When observed and expected ratios were compared, a significant deviation in slope from the expected value of 1 was observed (Fig. 3B). Observed protein ratios were systematically underestimated with the largest deviations as high as 2-fold occurring when at least one ion in the pair had low intensity. This is symptomatic of background noise, which was relatively constant between samples of similar complexity analyzed on the same instrument. Removing all ions with peak area <2 x 107 and subtracting this background value from each intensity measurement improved the agreement between expected versus observed ratios (Fig. 3C), confirming that background contributes to protein ratio underestimation. After correction, the observed ratios matched expected ratios to within ~1.4-fold.

Quantifying Proteins by Spectral Counting—
Experiments described above were evaluated by spectral counting, by summing all MS/MS observations for any peptide in a given protein, including spectra redundant for ion charge states. Spectral counting is complicated when tryptic peptides are shared between different database entries containing overlapping sequences, which we refer to as "protein isoforms." Assessment of such occurrences was facilitated by Isoform Resolver (20), which uses a peptide-centric database to report different IPI entries that share peptide sequences.

To develop a quantitative measure for comparing proteins in different samples by spectral counting, we used a count ratio method described for SAGE analysis (37) and applied the G test of independence to assess significance (36). This method determines confidence level primarily from the ratio of total peptides observed in the two samples to be compared and is implemented in the Serac PASC module. The G test is useful for detecting differential expression between datasets with different sample populations and assumes only that sampling is equally random in each dataset.

Variability in spectral counts between samples was evaluated in replicate analyses of Sample 1. Fig. 4A compares spectral counts per protein in parallel SCX runs of the same proteolytic digestion, and Fig. 4B plots the same data on a log scale. In each case, G test calculations were used to determine 95 and 80% critical value thresholds, predicting confidence limits for differential protein expression. For example, at the 95% threshold, when 0, 1, 2, and 3 spectra are observed in one sample, spectral counts of 3, 6, 9, and 10 in the other sample are respectively required to observe changes in abundance of a given protein between two samples to yield p values <0.05. -Fold changes corresponding to these limits were calculated as in Equation 1 ("Experimental Procedures"), using a correction factor f = 0.5, as proposed by Beissbarth et al. (37) to avoid discontinuities for proteins with spectral count = 0 in one of the samples. Fig. 4C shows the minimum ratio that can be determined at 95 and 80% confidence as a function of the lowest spectral count. As spectral counts increase, protein ratios that can be measured with high confidence decrease, indicating that smaller differences in spectral counts are needed to detect significant changes.



View larger version (34K):
[in this window]
[in a new window]
 
FIG. 4. Quantitation by spectral counting. Linear (A) and log2 (B) scatter plots of spectral counts from replicate SCX runs are shown. Lines show 80 and 95% critical value thresholds for determining differential expression from spectral count differences calculated by the G test statistic. C, minimum protein ratios that can be determined at 80 and 95% confidence limits as shown in A and B. Protein ratios were calculated as in Equation 1 with a correction factor of f = 0.5. D, spectral counts measured for varying amounts of ApoT and BSA standards spiked into pooled SCX fractions as described in Fig. 3. ApoT shows higher spectral counts than BSA at each concentration due to the fact that each protein has different numbers of detectable peptides because some peptides are too hydrophilic to bind to the RP column and others have insufficient charge to be within the observed m/z range. Spectral counts for each protein show a saturation effect above ~30 spectra/protein per RP run most likely due to dynamic exclusion. E, observed log2 protein ratios calculated by spectral counts (RSC) using a correction factor of f = 0.5 versus expected values for protein standards. Points are distinguished according to the SC in the denominator, illustrating that the deviation is largest for ratios with SC = 0 (open squares). F, by adjusting the correction factor to f = 1.25, deviations between observed versus expected protein ratios are minimized.

 
Interestingly as spectral counts increase, the observed number of false positives fell below calculated confidence limits (Fig. 4, A and B). Thus, at spectral count (SC) ≥ 4, only 17 of 404 proteins fell outside the 80% line, corresponding to an empirical false positive frequency of 4.2% and suggesting that differences in spectral counts better report more significant changes in abundance than would be predicted by the G test. This may be a consequence of inaccuracy of G test statistics at low spectral counts (36). In any case, the confidence limits seemed to provide a conservative estimate of significance. We conclude that the maximal number of spectral counts between two samples should be four or more to obtain reliable comparisons.

Linearity and sensitivity in spectral count measurements were evaluated in experiments spiking varying amounts of BSA and ApoT into pooled SCX fractions. Triplicate measurements showed good agreement between spectral counts and protein added down to 200 fmol of BSA and 50 fmol of ApoT, near the limit of detection for the LC/MS/MS configuration used in the experiment (Fig. 4D). This validates the use of minimal spectral counts down to values of 1 or 0 when G test statistics are used to properly assess changes. At the other extreme, a saturation effect was observed in which spectral counts were nonlinear at high levels of added standards. Comparison of BSA and ApoT suggested nonlinearity above a spectral count threshold of ~30 that was independent of protein identity or amount added. This effect is likely to be caused by dynamic exclusion, which limits the number of spectra taken per peptide ion and therefore the number of spectral counts as the degree of sampling approaches maximal sequence coverage. The behavior reveals a practical limit in measurements of spectral counts per protein in any one RP-LC/MS/MS run. However, this limit was never reached in experimental samples (Fig. 4, A and B) where the maximal spectral counts in any single RP run never exceeded 22. Spectral counts distributed across different SCX fractions or replicate RP runs can be added without complications from saturation.

Fig. 4E shows various dilution pairs of BSA and ApoT, comparing expected versus observed log2 protein ratios (RSC), the observed estimated by Equation 1 using f = 0.5. Observed protein ratios increased proportionally with expected over the entire range of BSA and ApoT that was added. The range of ratios quantified by peak area intensity measurements was observed up to log2 ~6.6. Thus, spectral counting showed a wider range than peak area intensity measurements because it enabled ratio measurements to be made at the lowest concentrations of standards, where spectral counts were 0.

Observed protein ratios from spectral counting of protein standards showed large deviations from expected ratios in which observed ratios were systematically overestimated (Fig. 4E). This was especially true when the minimal spectral count in a dilution pair was 0 (Fig. 4E, open squares), indicating that the correction factor for denominator values was too small. By varying the magnitude of the correction factor to f = 1.25, better correspondences between observed and expected protein ratios were obtained (Fig. 4F). This reflected large changes in ratio for dilution pairs where minimum spectral counts were low but little effect when minimum spectral counts were high. After correction, the observed ratios matched expected ratios to within ~2.3-fold. Thus, the errors in protein ratios are significantly higher when measured by spectral counting than peak area intensities, particularly when minimum spectral counts in one sample equal 0 or 1.

Comparison of Methods for Protein Quantification—
Published experiments on label-free quantitation have all used standard proteins for validation. We therefore carried out tests to evaluate the behavior of ion intensity measurement versus spectral counting using more complex samples. Soluble proteins were separated by gel filtration (Sample 2) and visualized by Coomassie staining after separation of fractions by SDS-PAGE (Fig. 5A).

Intensities of peptide ions for these proteins were then compared between different gel filtration fractions following solution digestion, SCX fractionation of peptides, and RP-LC/MS/MS. Fig. 5B represents intensities of peptide ions for cytoplasmic actin, showing six pairwise comparisons of different gel filtration fractions. Only peptides found in common between each pair of fractions were plotted in each panel. The plots showed average linear correlation coefficients r = 0.94 ± 0.04. Similar results were observed for other high abundance proteins where at least three peptide ions could be directly compared between fractions.

To provide independent ratio measurements, protein bands were quantified by scanning densitometry, and ratios of protein abundances between different gel filtration fractions were determined from integrated areas under selected bands. Proteins were selected based on their apparent resolution on the gel and identified by in-gel digestion and MS. Proteins were eliminated when the analysis of the in-gel digests indicated a significant amount of a second protein was present in a band or when a protein eluted at different molecular weights on the gel.

RPA protein ratios determined from peak area intensity measurements were then determined for these proteins and compared with the protein ratios estimated from gel staining (Fig. 5C). The gel staining abundance ratio (RGEL) is expressed as a log2 value where 0 indicates no change, and positive and negative values indicate increases and decreases in staining intensity, respectively. Likewise protein ratios from spectral counts were compared with protein ratios from gel staining (Fig. 5D and Supplemental Table 3). Linear regression of the ratios from gel staining versus from peak area intensity measurements or spectral counts showed reasonable agreement, confirming that label-free methods for quantitation are able to report changes in protein abundance in complex samples. Interestingly a higher correlation was observed when we compared protein ratios from intensity measurements versus spectral counts for the same proteins (Fig. 5E). This strongly suggests that most of the variability in Fig. 5, C and D, is due to error in quantifying gel staining intensities and that protein ratios measured from peak intensities agree well with those measured from spectral counts.

We next examined the behavior of all proteins that could be compared between two gel filtration fractions. In contrast to the narrow distribution in spectral counts between replicate analyses of soluble K562 proteins (Fig. 4, A and B), the variation in spectral counts was much higher when comparing proteins between gel filtration fractions 6 and 8 (Fig. 5F). More than 30% of proteins were found to lie outside the 95% confidence limits (Fig. 5F). Significant differences are expected given that many proteins would be expected to change in abundance between two gel filtration fractions. An off-diagonal centering of the critical value thresholds in this figure reflected the difference in total spectral counts between the two gel filtration fractions that is corrected by the G test statistic.

We then compared RPA versus RSC for all proteins that could be compared between gel filtration fractions 6 versus 8. Fig. 5G shows 147 proteins that shared ≥3 peptides in common of 2,306 proteins identified in fraction 6 and 1,834 proteins in fraction 8. A reasonable correlation between RPA versus RSC was observed, although the variability was somewhat higher than data shown in Fig. 5E due to the lower average number of peptides in common over all proteins in Fig. 5G. However, the subset of 62 proteins with RPA values that were significantly different from 0 (p ≤ 0.05, determined from Student’s t test of the log2 protein ratios; Fig. 5G, closed symbols) included 48 (77%) with RSC significantly different from 0 (p ≤ 0.05, determined by the G test; Fig. 5G, closed triangles). The remaining 14 with p > 0.05 for RSC (Fig. 5G, closed circles) were those in which RPA magnitudes were lower. Thus, for proteins sharing ≥3 peptides in common between samples, those with significant changes in abundance were most readily identified by setting 95% confidence limits for both RPA and RSC, illustrating complementarity between these measurements.

Measurements of protein abundance differences were then extended to examine the behavior of proteins that could be compared between pairwise combinations of gel filtration fractions 6–16. An average of 732 proteins were observed in each gel filtration fraction, totaling 5,339 proteins (20). Each protein typically eluted in three or more fractions; thus we expect the majority of proteins to show differences between pairs of fractions with protein ratios varying from small to large. We examined various pairwise comparisons of fractions 6 through 16. In total, 22,218 unique comparisons of protein abundance could be made. Of these, 19% shared at least one peptide in common, necessary for peak area intensity comparisons, whereas 81% lacked common peptides and could only be evaluated by spectral counting (Fig. 6A). Of the former set, 1,233 (29%) showed three or more peptide ions in common between any two fractions, meeting our criteria for peak area intensity measurements (Fig. 6B). Of the latter set, 7,740 (35%) had at least four spectra in one of the paired fractions. Thus, a similar percentage of proteins met our criteria for spectral count measurements. However, the ratio of proteins quantifiable by spectral counts exceeded those quantifiable by peak area intensities by ~3-fold, and no proteins quantifiable by peak area measurements were excluded from spectral count measurements. Plotting the distribution of peptide ratios within each of these subsets showed a wider dispersion of protein ratios by spectral counting than peak area intensity measurements as well as ~3-fold wider magnitude range (Fig. 6D, left panel). Thus, the spectral counting measurements sampled a larger range of abundance ratios than peak area measurements.



View larger version (28K):
[in this window]
[in a new window]
 
FIG. 6. Abundance ratios measured for all pairwise comparisons of proteins in gel filtration fractions of soluble extracts. Venn diagrams indicate the following. A, all comparisons that could be made by spectral counting (requiring at least one spectra in either fraction pair) or peak area intensity measurements (requiring at least one peptide in common between fraction pairs). Fraction pairs examined include 6 versus 7, 7 versus 8, 6 versus 8, 8 versus 9, 8 versus 10, 9 versus 10, 10 versus 11, 10 versus 12, 11 versus 12, 12 versus 13, 12 versus 14, 13 versus 14, 14 versus 15, 14 versus 16, and 15 versus 16. B, comparisons by spectral counting where at least four spectra were observed in either fraction pair or by peak area measurements where at least three peptides were found in common between fraction pairs. Note that all proteins comparable by peak area intensities are also comparable by spectral counting. C, subsets of proteins in B where p ≤ 0.05 for either RPA and/or RSC. Note that some proteins in B show p ≤ 0.05 for spectral counting measurements but not peak area intensity measurements (4,978) and vice versa (172). D, left panel, range of RSC (open bars) where either fraction pair shows four or more spectra and RPA (gray bars) where at least three peptides are in common between fraction pairs. The box represents the interquartile range (25–75%), the center line represents the median, whiskers extend to the most extreme ratio up to 1.5x interquartile range, and data points outside this range are indicated by circles ({circ}). The magnitudes of ratios determined by spectral counting exceeds that determined by peak area intensities by ~4-fold in either direction. Right panel, range of ratios determined by RSC or RPA for subsets of proteins where p ≤ 0.05 for either ratio measure as in C. Examination of protein groups analyzed by SC only, SC + PA, and PA only (D, right), respectively, shows 18.2, 0.3, and 0.6% supported by one peptide (ignoring charge states) and 14, 6.8, and 9.9% supported by two peptides. By applying a false discovery rate of 4.2% for each unique peptide (20), the total false discovery rate for the groups of proteins analyzed by SC only, SC + PA, and PA only were, respectively, 0.85, 0.03, and 0.04%. Removal of proteins supported by one and two peptides did not change the range or shape of the ratio distributions.

 
Protein pairs were then filtered by p values, requiring p ≤ 0.05 for ratios from peak area or spectral count measurements. These subsets contain proteins most likely to report significant differences that should be highly represented in comparisons of gel filtration fractions. Of 7,740 proteins with spectral counts 4 in one fraction, 5,289 (68%) showed p ≤ 0.05. Of 1,233 proteins with ≥3 peptides in common, 172 showed p ≤ 0.05 for peak area measurements only, whereas 311 showed p ≤ 0.05 for both peak area and spectral counts. Thus, the percentage of proteins showing significant changes by peak area measurements was 39% compared with 68% for spectral counts, reflecting the higher sampling capability of spectral counting. When the distribution of peptide ratios between each set was compared, proteins quantified only by spectral counts showed a wider dispersion in ratio magnitude than those proteins that could be quantified by both spectral counts and peak areas. Likewise proteins that could be quantified by both spectral counts and peak areas showed a wider dispersion in ratio magnitude than those that could only be quantified by peak area measurements. This agrees with the intuitive expectation that the ranges of protein abundances monitored by each method are only partially overlapping.

Comparing Changes in Protein Abundance to mRNA Abundance—
Finally protein changes that accompany cell differentiation were compared with corresponding mRNA changes. In response to phorbol ester stimulation, K562 cells undergo morphological changes involving cell attachment and spreading and induction of genes characteristic of the megakaryocyte lineage. These responses are regulated by the MKK/ERK pathway and are therefore repressed by cell-permeable inhibitors of MKK1/2 such as U0126 (21, 22). Parallel experiments were carried out using shotgun proteomics and Affymetrix microarrays to examine cells that were untreated, stimulated with PMA to induce differentiation, or treated with PMA in the presence of U0126 to repress differentiation.

Soluble proteins were extracted, proteolyzed, separated by SCX-HPLC, and analyzed by RP-LC/MS/MS (Sample 3). The number of proteins identified in (a) control, (b) PMA-treated, and (c) PMA + U0126-treated samples were 463, 395, and 449, totaling 703 proteins. Of these, 62 and 67 proteins, respectively, showed three or more peptides in common between pairs a versus b and b versus c with 43 in common between all three conditions, whereas 254 proteins had four or more spectral counts among one of the three experiments. These were considered quantifiable by intensity measurements. RPA and RSC measurements between control versus PMA conditions were each plotted against log2 ratios of mRNA abundance (RmRNA) measured using Affymetrix microarrays (Fig. 7, A and B). The dispersion of ratios was larger with RSC than RPA as were correlations with RmRNA. After filtering log2 protein ratios for p ≤ 0.1, 41 proteins were observed to significantly change as measured by RSC, and these correlated well with changes measured by RmRNA (Fig. 7A, solid symbols), whereas 10 proteins with significant changes were indicated by RPA (p ≤ 0.1; Fig. 7B, solid symbols).



View larger version (18K):
[in this window]
[in a new window]
 
FIG. 7. Analysis of K562 proteins varying in response to PMA stimulation. A, RSC measurements of protein changes in cells stimulated with PMA (ratio = PMA/control, maximum SC ≥ 4) plotted against corresponding measurements of log2 mRNA ratios (RmRNA). Closed symbols indicate RSC where p ≤ 0.1, and triangles indicate RmRNA, p ≤ 0.1. B, RPA versus RmRNA (ratio = PMA/control, common peptides ≥ 3). Closed symbols indicate RPA, p ≤ 0.1, and triangles indicate RmRNA, p ≤ 0.1. C, analysis of myosin heavy chain type A (MYH9) by Western blotting of K562 whole cell lysates. Induction of protein is observed upon treatment with 10 nM PMA for 48 h or transfection for 48 h with wild type (WT) or constitutively active (CA) MKK1.

 
Proteins showing significant changes between control versus PMA conditions as filtered by RSC significance are indicated in Table I along with spectral counts, peak area, and microarray measurements for each of the three conditions. Importantly all proteins that increased or decreased in response to PMA treatment showed changes in the opposite direction when comparing PMA to PMA + U0126 and thus represent gene products responsive to regulation by MKK/ERK signaling. The majority of proteins responsive to MKK/ERK were those with known function in regulating cytoskeletal interactions as expected from the significant change in cell shape and adhesion accompanying MKK/ERK-mediated cell differentiation. All correlated well with induction of mRNA expression, indicating that MKK/ERK regulates differentiation via control of transcription as we showed previously by promoter analyses (22). To validate these findings, we examined myosin heavy chain type A (MYH9), a subunit of nonmuscle myosin, which had not previously been linked to MKK/ERK pathway regulation or megakaryocyte differentiation. Immunoblots of K562 whole cell lysates showed increased protein abundance of MYH9 following PMA stimulation of cells as well as overexpression of constitutively active MKK1, confirming its novel regulation by mitogen-activated protein kinase signaling (Fig. 7C).


View this table:
[in this window]
[in a new window]
 
TABLE I Proteins sharing significant changes between PMA versus control conditions as filtered by RSC significance

 

    DISCUSSION
 TOP
 ABSTRACT
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
In this study we describe new software tools for extracting quantitative information from shotgun proteomic datasets. The PeakExtractor module of the Serac software calculates peptide ion intensities based on peak areas from extracted ion chromatograms. The PASC module of Serac carries out peak intensity measurements, averaging intensity ratio values for multiple peptides from each protein, and also uses output from Isoform Resolver to calculate protein ratios from spectral counts.

Reproducibility and sensitivity were evaluated to assess the performance of each method. By normalizing ion intensity measurements by the total ion intensity for each RP run, peak area intensity measurements of replicate analyses showed reproducibility to within log2 ratio values of ±1.2, indicating that 2.3-fold changes could be measured with >95% confidence. Typical coefficients of variance were 30% (0.45 x S.D.) for protein ratios determined from three or more peptide ratios, which were distributed normally on a log2 scale. Analysis of standard proteins showed linearity in peak area intensities versus amount added and good correlations between observed and expected protein ratios. Interestingly observed ratios generally underestimated true ratios, but this systematic variation could be corrected for by subtracting background intensities from each measurement. Only 0.5% of peak areas quantified showed intensities less than 2 x 107 counts; thus relatively little information was lost by this correction.

Spectral counting yielded good reproducibility between replicate experiments. False positives were well within the 95% confidence limit and decreased as spectral counts increased. Here we applied the G test of independence to detect significant change in protein abundance with spectral counts. The G test is less accurate at low counts; therefore, calculations at any confidence limit predict that larger differences in spectral counts would be needed to detect significant changes in protein ratio as the minimum spectral count decreases (Fig. 4C). Analysis of standard proteins showed linearity in spectral counts versus amount of protein added. Correlations between observed and expected protein ratios were observed with standards, although observed ratios generally overestimated true ratios by significant amounts. This was primarily due to the discontinuity in log2 ratio that occurred in cases where the number of spectral counts in the sample with lower abundance was close to 0 and could be partially corrected by increasing the value of f used to calculate protein ratios from spectral counts.

Both methods showed good correlations in experiments using standard proteins. In addition, both methods could be validated by independent measurements evaluating protein abundance ratios by gel staining following SDS-PAGE. An important advantage of spectral counting is that peptides in common between datasets are not required for the protein ratio calculations, enabling greater percentages of proteins to be compared, as observed in the experiment comparing gel filtration fractions (Fig. 6C, Sample 2), which was chosen for its high representation of protein changes. By considering a wider range of proteins, where sequence coverage can be very low in one sample, greater sensitivity could be achieved. Such differences were also observed when the number of protein changes was relatively small (as in Sample 3). On the other hand, it was clear from the analysis of standards that the measured protein ratios showed larger error by spectral counting than by peak area intensity measurements. This was particularly true for proteins where one sample had only 0 or 1 spectra and was caused by the discontinuity in ratio measurement mentioned above. Thus, the two methods are complementary with spectral counting measurements yielding greater sensitivity and peak ion intensity measurements yielding greater accuracy in reporting changes in protein abundances. Intuitively ratios measured from spectral counts are most accurate for proteins with large numbers of spectra, and ratios measured from peak area intensities are most accurate for proteins with large numbers of overlapping peptide ions.

One of the limiting factors for quantitation by peak area intensity measurements as presented is the requirement for obtaining assigned MS/MS spectra prior to peak matching. This can reduce the number of comparable proteins if the reacquisition of MS/MS spectra between experiments is low and would be expected to be most problematic for ions with weak intensities. To examine this, we analyzed datasets of replicate RP runs of 16 SCX fractions (Sample 1a). We compared histograms of intensities (not normalized) from peaks whose MS/MS spectra were identified in replicate samples (Fig. 8, blue outline) versus those where replicate MS/MS spectra were not observed (Fig. 8, red outline). All peaks showed signal above background due to the fact that data were prefiltered to only accept well behaved, quantifiable peaks. The results showed that ~ of all peaks identified in any one sample were identified by MS/MS in the other sample. Thus, our estimate of peptides not reacquired due to MS/MS sampling limitations was ~ . Perhaps not surprisingly, the distribution of the unmatched group was skewed toward lower intensities with a mean value ~2-fold less than the matched group. This may limit the sensitivity of the peak area strategy compared with profiling by spectral counting. Nevertheless the criterion for stringency developed in this study, i.e. requiring three or more matched peptide ions per protein, has potentially greater impact on the number of proteins that can be quantified by peak area measurements than the MS/MS reacquisition issue. Furthermore even at this fairly low sampling depth, we can still observe the majority of peptide peaks reproducibly. The percentage of observable peaks should increase with higher sampling depth provided by newer, faster scanning instruments.



View larger version (17K):
[in this window]
[in a new window]
 
FIG. 8. The majority of MS/MS spectra are reacquired in replicate samples. Frequency histograms of MS/MS spectra reacquired or not reacquired between replicate RP-LC/MS/MS runs of 16 different SCX fractions from Sample 1a as a function of peak area intensity are shown. Matched MS/MS spectra identified in replicate samples (blue) are 2-fold more abundant than unmatched MS/MS spectra (red).

 
Peak intensity values useful for protein quantitation ranged from 2 x 107 up to 1011 counts with no obvious saturation effect, indicating that the dynamic range is >1,000-fold. For the Sample 1 dataset, the median peak area intensity was 8 x 108 counts with 95% of peaks above 1 x 108 counts and 90% below 6 x 109 counts (Fig. 2D and data not shown). Thus, with typical shotgun proteomic datasets, protein ratios up to and greater than 60-fold should be observable from peak area intensity measurements.

In comparisons of protein ratios between gel filtration fractions (Fig. 6D, Sample 2), the RPA measurements ranged up to 5.8 (56-fold), whereas RSC measurements ranged up to 7.3 (160-fold) (Fig. 6D). The apparently lower range of ratios detected by peak area measurements reflects the requirement for three or more peptides in common. This selects for proteins with higher coverage that therefore show smaller differences in abundance. In contrast, requiring a maximum of four or more peptides for quantitation by spectral counting selects for proteins with wider dispersion of abundance ratios. This is augmented by examining proteins with RSC or RPA selected for significance with p ≤ 0.05 (Fig. 6, C and D). Log2 protein ratios that were significantly different from 0 as determined by SC but not PA showed wider dispersion and range, whereas ratios determined by PA but not SC showed narrower distributions. This indicates that RSC will sample a larger effective dynamic range than RPA when datasets are filtered by tests of significance.

Performance of both approaches depends strongly on the depth of MS/MS sampling because ratios by spectral counts are most significant for proteins with large numbers of spectra, and ratios by peak area intensity are most significant for proteins with large numbers of overlapping peptide ions. Without high sampling, protein ratios estimated from peak area intensities are limited to abundant proteins with high sequence coverage. Likewise ratios estimated by spectral counting can be significantly suppressed when the minimum spectral count is 0 and the maximum spectral count is limited by sampling due to low protein abundance. The recent availability of more sensitive linear ion traps with higher scan rates promises to increase sampling depth and enhance capabilities for more effective label-free quantification.

In the analysis of Sample 3, we surveyed an average of 435 proteins from each of three conditions in human K562 cells of which 254 were quantifiable by spectral counting (SC ≥ 4) and 62 by peak area intensity measurements (common peptides ≥3) between two conditions. Thus, a relatively low depth of sampling enables ~ of proteins to be compared. Applying spectral counting, we detected 41 (9.4%) proteins changing in abundance between control and PMA treated samples, allowing p ≤ 0.1. By comparison, mRNA datasets revealed ~28% genes showing changes significant with p ≤ 0.1. The correspondence between mRNA and protein changes in our experiments is consistent with our previous findings that MKK/ERK promotes cell differentiation by controlling gene expression (22). Correspondence between mRNA and protein has also been shown to be significant in yeast for proteins within moderate to high abundance expression ranges (11).

In summary, label-free methods for protein quantitation in shotgun datasets offer an alternative approach to stable isotope labeling methods. Peak area intensity and spectral counting methods enable protein ratios significant to ~2.5-fold to be determined with high confidence. This represents lower sensitivity than can be achieved by isotopic labeling where protein ratios significant to ~1.5-fold are typically reported (7, 11, 13, 14). Nevertheless protein changes at or above 2.5-fold are well within the range of greatest interest in experimental comparisons, and the ability to achieve this without stable isotope labeling can be advantageous under conditions where metabolic labeling or chemical derivatization is difficult. To minimize experimental variation, data should be collected in a manner that interleaves samples, although satisfactory results were still obtained with datasets of Samples 2 and 3, each of which were collected over several months and not interleaved. Combining the complementary methods for quantitation by peak area intensities and spectral counting increases the power for detecting protein changes in shotgun experiments.


   FOOTNOTES
 
Received, March 22, 2005, and in revised form, June 22, 2005.

Published, MCP Papers in Press, June 23, 2005, DOI 10.1074/mcp.M500084-MCP200

1 The abbreviations used are: SCX, strong cation exchange; RP, reversed-phase; ApoT, apotransferrin; PMA, phorbol 12-myristate 13-acetate; PA, peak area; SC, spectral count; RPA= log2(peptide or protein ratio) measured from peak area intensities; RSC= log2(protein ratio) measured from spectral counts; XIC, extracted ion chromatograph; ERK, extracellular signal-regulated kinase; MKK, mitogen-activated protein kinase kinase. Back

* The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. Back

S The on-line version of this article (available at http://www.mcponline.org) contains supplemental material. Back

Present address: Research Triangle Inst., 3040 Cornwallis Rd., Research Triangle Park, NC 27709-2194. Back

|| To whom correspondence should be addressed. Tel.: 303-492-4799; Fax: 303-492-2439; E-mail: natalie.ahn@colorado.edu


    REFERENCES
 TOP
 ABSTRACT
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 

  1. McCormack, A. L., Schieltz, D. M., Goode, B., Yang, S., Barnes, G., Drubin, D., and Yates, J. R., III ( 1997) Direct analysis and identification of proteins in mixtures by LC/MS/MS and database searching at the low-femtomole level. Anal. Chem. 69, 767 –776[CrossRef][Medline]

  2. Link, A. J., Eng, J., Schieltz, D. M., Carmack, E., Mize, G. J., Morris, D. R., Garvik, B. M., and Yates, J. R., III ( 1999) Direct analysis of protein complexes using mass spectrometry. Nat. Biotechnol. 17, 676 –682[CrossRef][Medline]

  3. Wolters, D. A., Washburn, M. P., and Yates, J. R., III ( 2001) An automated multidimensional protein identification technology for shotgun proteomics. Anal. Chem. 73, 5683 –5690[CrossRef][Medline]

  4. Yates, J. R., III, Eng, J. K., McCormack, A. L., and Schieltz, D. ( 1995) Method to correlate tandem mass spectra of modified peptides to amino acid sequences in the protein database. Anal. Chem. 67, 1426 –1436[CrossRef][Medline]

  5. Eng, J. K., McCormack, A. L., and Yates, J. R., III ( 1994) An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database. J. Am. Soc. Mass Spectrom. 5, 976 –989[CrossRef]

  6. Perkins, D. N., Pappin, D. J., Creasy, D. M., and Cottrell, J. S. ( 1999) Probability-based protein identification by searching sequence databases using mass spectrometry data. Electrophoresis 20, 3551 –3567[CrossRef][Medline]

  7. Gygi, S. P., Rist, B., Gerber, S. A., Turecek, F., Gelb, M. H., and Aebersold, R. ( 1999) Quantitative analysis of complex protein mixtures using isotope-coded affinity tags. Nat. Biotechnol. 17, 994 –999[CrossRef][Medline]

  8. Zhou, H., Ranish, J. A., Watts, J. D., and Aebersold, R. ( 2002) Quantitative proteome analysis by solid-phase isotope tagging and mass spectrometry. Nat. Biotechnol. 20, 512 –515[CrossRef][Medline]

  9. Yao, X., Freas, A., Ramirez, J., Demirev, P. A., and Fenselau, C. ( 2001) Proteolytic 18O labeling for comparative proteomics: model studies with two serotypes of adenovirus. Anal. Chem. 73, 2836 –2842[CrossRef][Medline]

  10. Tang, K., Page, J. S., and Smith, R. D. ( 2004) Charge competition and the linear dynamic range of detection in electrospray ionization mass spectrometry. J. Am. Soc. Mass Spectrom. 15, 1416 –1423[CrossRef][Medline]

  11. Griffin, T. J., Gygi, S. P., Ideker, T., Rist, B., Eng, J., Hood, L., and Aebersold, R. ( 2002) Complementary profiling of gene expression at the transcriptome and proteome levels in Saccharomyces cerevisiae. Mol. Cell. Proteomics 1, 323 –333[Abstract/Free Full Text]

  12. Molloy, M. P., Donohoe, S., Brzezinski, E. E., Kilby, G. W., Stevenson, T. I., Baker, J. D., Goodlett, D. R., and Gage, D. A. ( 2005) Large-scale evaluation of quantitative reproducibility and proteome coverage using acid cleavable isotope coded affinity tag mass spectrometry for proteomic profiling. Proteomics 5, 1204 –1208[CrossRef][Medline]

  13. Gygi, S. P., Rist, B., Griffin, T. J., Eng, J., and Aebersold, R. ( 2002) Proteome analysis of low-abundance proteins using multidimensional chromatography and isotope-coded affinity tags. J. Proteome Res. 1, 47 –54[CrossRef][Medline]

  14. Griffin, T. J., Lock, C. M., Li, X. J., Patel, A., Chervetsova, I., Lee, H., Wright, M. E., Ranish, J. A., Chen, S. S., and Aebersold, R. ( 2003) Abundance ratio-dependent proteomic analysis by mass spectrometry. Anal. Chem. 75, 867 –874[CrossRef][Medline]

  15. Bondarenko, P. V., Chelius, D., and Shaler, T. A. ( 2002) Identification and relative quantitation of protein mixtures by enzymatic digestion followed by capillary reversed-phase liquid chromatography-tandem mass spectrometry. Anal. Chem. 74, 4741 –4749[CrossRef][Medline]

  16. Chelius, D., and Bondarenko, P. V. ( 2002) Quantitative profiling of proteins in complex mixtures using liquid chromatography and mass spectrometry. J. Proteome Res. 1, 317 –323[CrossRef][Medline]

  17. Wang, W., Zhou, H., Lin, H., Roy, S., Shaler, T., Hill, L., Norton, S., Kumar, P., Anderle, M., and Becker, C. ( 2003) Quantification of proteins and metabolites by mass spectrometry without isotopic labeling or spiked standards. Anal. Chem. 75, 4818 –4826[CrossRef][Medline]

  18. Liu, H., Sadygov, R. G., and Yates, J. R., III ( 2004) A model for random sampling and estimation of relative protein abundance in shotgun proteomics. Anal. Chem. 76, 4193 –4201[CrossRef][Medline]

  19. Ghaemmaghami, S., Huh, W. K., Bower, K., Howson, R. W., Belle, A., Dephoure, N., O’Shea, E. K., and Weissman, J. S. ( 2003) Global analysis of protein expression in yeast. Nature 425, 737 –741[CrossRef][Medline]

  20. Resing, K. A., Meyer-Arendt, K., Mendoza, A. M., Aveline-Wolf, L. D., Jonscher, K. R., Pierce, K. G., Old, W. M., Cheung, H. T., Russell, S., Wattawa, J. L., Goehle, G. R., Knight, R. D., and Ahn, N. G. ( 2004) Improving reproducibility and sensitivity in identifying human proteins by shotgun proteomics. Anal. Chem. 76, 3556 –3568[CrossRef][Medline]

  21. Whalen, A. M., Galasinski, S. C., Shapiro, P. S., Nahreini, T. S., and Ahn, N. G. ( 1997) Megakaryocytic differentiation induced by constitutive activation of mitogen-activated protein kinase kinase. Mol. Cell. Biol. 17, 1947 –1958[Abstract]

  22. Sevinsky, J. R., Whalen, A. M., and Ahn, N. G. ( 2004) Extracellular signal-regulated kinase induces the megakaryocyte GPIIb/CD41 gene through MafB/Kreisler. Mol. Cell. Biol. 24, 4534 –4545[Abstract/Free Full Text]

  23. Lewis, T. S., Hunt, J. B., Aveline, L. D., Jonscher, K. R., Louie, D. F., Yeh, J. M., Nahreini, T. S., Resing, K. A., and Ahn, N. G. ( 2000) Identification of novel MAP kinase pathway signaling targets by functional proteomics and mass spectrometry. Mol. Cell 6, 1343 –1354[CrossRef][Medline]

  24. Irizarry, R. A., Hobbs, B., Collin, F., Beazer-Barclay, Y. D., Antonellis, K. J., Scherf, U., and Speed, T. P. ( 2003) Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics 4, 249 –264[Abstract/Free Full Text]

  25. Gentleman, R. C., Carey, V. J., Bates, D. M., Bolstad, B., Dettling, M., Dudoit, S., Ellis, B., Gautier, L., Ge, Y., Gentry, J., Hornik, K., Hothorn, T., Huber, W., Iacus, S., Irizarry, R., Leisch, F., Li, C., Maechler, M., Rossini, A. J., Sawitzki, G., Smith, C., Smyth, G., Tierney, L., Yang, J. Y., and Zhang, J. ( 2004) Bioconductor: open software development for computational biology and bioinformatics. Genome Biol. 5, R80.1 –R80.16

  26. Wettenhall, J. M., and Smyth, G. K. ( 2004) limmaGUI: a graphical user interface for linear modeling of microarray data. Bioinformatics 20, 3705 –3706[Abstract/Free Full Text]

  27. Yi, E. C., Marelli, M., Lee, H., Purvine, S. O., Aebersold, R., Aitchison, J. D., and Goodlett, D. R. ( 2002) Approaching complete peroxisome characterization by gas-phase fractionation. Electrophoresis 23, 3205 –3216[CrossRef][Medline]

  28. Spahr, C. S., Davis, M. T., McGinley, M. D., Robinson, J. H., Bures, E. J., Beierle, J., Mort, J., Courchesne, P. L., Chen, K., Wahl, R. C., Yu, W., Luethy, R., and Patterson, S. D. ( 2001) Towards defining the urinary proteome using liquid chromatography-tandem mass spectrometry. I. Profiling an unfractionated tryptic digest. Proteomics 1, 93 –107[CrossRef][Medline]

  29. Davis, M. T., Spahr, C. S., McGinley, M. D., Robinson, J. H., Bures, E. J., Beierle, J., Mort, J., Yu, W., Luethy, R., and Patterson, S. D. ( 2001) Towards defining the urinary proteome using liquid chromatography-tandem mass spectrometry. II. Limitations of complex mixture analyses. Proteomics 1, 108 –117[CrossRef][Medline]

  30. Kersey, P. J., Duarte, J., Williams, A., Karavidopoulou, Y., Birney, E., and Apweiler, R. ( 2004) The International Protein Index: an integrated database for proteomics experiments. Proteomics 4, 1985 –1988[CrossRef][Medline]

  31. Yang, X., Dondeti, V., Dezube, R., Maynard, D. M., Geer, L. Y., Epstein, J., Chen, X., Markey, S. P., and Kowalak, J. A. ( 2004) DBParser: web-based software for shotgun proteomic data analyses. J. Proteome Res. 3, 1002 –1008[CrossRef][Medline]

  32. Li, X. J., Zhang, H., Ranish, J. A., and Aebersold, R. ( 2003) Automated statistical analysis of protein abundance ratios from data generated by stable-isotope dilution and tandem mass spectrometry. Anal. Chem. 75, 6648 –6657[CrossRef][Medline]

  33. MacCoss, M. J., Wu, C. C., Liu, H., Sadygov, R., and Yates, J. R., III ( 2003) A correlation algorithm for the automated quantitative analysis of shotgun proteomics data. Anal. Chem. 75, 6912 –6921[CrossRef][Medline]

  34. Rorabacher, D. B. ( 1991) Statistical treatment for rejection of deviant values: critical values of Dixon’s "Q" parameter and related subrange ratios at the 95% confidence level. Anal. Chem. 63, 139 –146[CrossRef]

  35. Tsai, C. A., Chen, Y. J., and Chen, J. J. ( 2003) Testing for differentially expressed genes with microarray data. Nucleic Acids Res. 31, e52[Abstract/Free Full Text]

  36. Sokal, R. R., and Rohlf, F. J. ( 1995) Biometry: the Principles and Practice of Statistics in Biological Research, pp.729 –731, W. H. Freeman, New York

  37. Beissbarth, T., Hyde, L., Smyth, G. K., Job, C., Boon, W.-M., Tan, S.-S., Scott, H. S., and Speed, T. P. ( 2004) Statistical modeling of sequencing errors in SAGE libraries. Bioinformatics 20, i31 –i39[Abstract/Free Full Text]

  38. Anderle, M., Roy, S., Lin, H., Becker, C., and Joho, K. ( 2004) Quantifying reproducibility for differential proteomics: noise analysis for protein liquid chromatography-mass spectrometry of human serum. Bioinformatics 20, 3575 –3582[Abstract/Free Full Text]