A Comparative Analysis of an Orthologous Proteomic Environment in the Yeasts Saccharomyces cerevisiae and Schizosaccharomyces pombe*,S

Assen Roguev{ddagger},§, Anna Shevchenko§, Daniel Schaft{ddagger}, Henrik Thomas, A. Francis Stewart{ddagger},|| and Andrej Shevchenko,||

From {ddagger} Genomics, Technische Universität Dresden, c/o Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, 01307 Dresden, Germany; and Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, 01307 Dresden, Germany


    ABSTRACT
 TOP
 ABSTRACT
 EXPERIMENTAL PROCEDURES
 RESULTS AND DISCUSSION
 CONCLUSION AND PERSPECTIVES
 REFERENCES
 
The sequential application of protein tagging, affinity purification, and mass spectrometry enables highly accurate charting of proteomic environments by the characterization of stable protein assemblies and the identification of subunits that are shared between two or more protein complexes, termed here "proteomic hyperlinks." We have charted the proteomic environments surrounding the histone methyltransferase, Set1, in both yeasts Saccharomyces cerevisiae and Schizosaccharomyces pombe. Although the composition of these nonessential Set1 complexes is remarkably conserved, they differ with respect to their hyperlinks to their proteomic environments. We speculate that conservation of the core components of protein assemblies and variability of hyperlinks represents a general principle in the molecular organization of eukaryotic proteomes.


Advances in genomic sequencing, gene manipulation technology, and mass spectrometry have stimulated efforts to decipher the functional organization of eukaryotic proteomes by systematic isolation and characterization of native protein complexes on a genomic scale (1, 2). Proteins can be tagged and purified, along with their interaction partners, under native conditions by immunoaffinity chromatography followed by their identification and characterization of post-translational modifications by mass spectrometry (reviewed in Ref. 3).

Documented protein-protein interactions and the composition of native protein complexes are very valuable resources, although biological interpretation of this knowledge is not straightforward. So far the concordance of the results obtained by two independent (although similar) protein tagging approaches has been rather poor (1, 2, 4). Furthermore, these data are also in poor concordance with a dataset obtained by two-hybrid screening (5) or inferred via various bioinformatic approaches (6, 7). Although the availability of complementary data is always a positive factor, it seems rather unlikely that observed discrepancies and excessive complexity of protein assemblies could be attributed solely to errors in analytical methods.

Bruce Alberts pictured the cell as a "collection of protein machines" (8). Comparison of the composition and linkages of similar machines in phylogenetically diverged organisms could provide insight into their molecular architecture, regulation, and involvement in various intracellular processes. The fission yeast Schizosaccharomyces pombe is an appropriate organism to validate and extend our understanding of the functional organization of the proteome of Saccharomyces cerevisiae. The tandem affinity purification (TAP)1 procedure (911), which has been successfully employed in purifying protein complexes from the budding yeast, also works in the fission yeast (12, 13). In many (although not in all) cases, bioinformatics can be used either to identify orthologous proteins in the two organisms by homology searches and close inspection of aligned full-length sequences, or to limit the selection to a small number of plausible candidates whose sequences share reasonable percentage of identity and/or display similar functional domains. At the same time, the two organisms are quite distant phylogenetically and have remarkably different physiology (14, 15). The completely sequenced genome of S. pombe (16) provides a valuable resource for facile mass spectrometric identification of isolated proteins.

A combination of the TAP method and "shotgun" mass spectrometric sequencing (17) was applied for comparative characterization of orthologous complexes involved in splicing (18) and cell cycle regulation (19). A remarkable conservation of the composition of orthologous complexes was reported, and a few novel interactors were discovered. However, the shotgun approach lacked aspects of quantification to determine if novel proteins were present in the stoichiometric or substoichiometric amounts, compared with the conserved core subunits. Furthermore, it is not uncommon that proteins are shared between individual protein complexes (4, 11) (we termed such proteins "proteomic hyperlinks"). If a hyperlink protein was inadvertently used as a bait, a mixture of subunits from two or more protein complexes might be isolated. Therefore, phylogenetic interpretation of the differences in identified proteomic environments could be ambiguous. It would be difficult to distinguish if the core of the complex was altered by adding/removing another subunit, or a new interactor was identified, or a novel association represents a yet unknown hyperlink to another individual complex.

In recent years, we and others successfully applied sequential epitope tagging immunoaffinity chromatography and mass spectrometry (SEAM) to characterize protein complexes and segments of protein interaction networks in the budding yeast (4, 11, 20). Although laborious, the approach enabled us to make clear distinction between individual complexes and to identify relevant proteomics hyperlinks. Using the TAP method, we previously isolated and characterized the Sc_Set1C complex (termed after the set1 gene, whose sequence possesses a characteristic SET domain (21)) that methylates lysine 4 in histone H3 and is implicated in epigenetic regulation (22, 23).

Sc_Set1C is comprised of eight subunits, seven of which pulled down the same eight proteins with similar relative stoichiometry upon TAP tagging and immunoaffinity isolation (22). However, the eighth protein, Swd2, was the notable exception. The pool of proteins co-isolated with Swd2 included the members of Sc_Set1C and nine members of another yeast complex termed CPF for cleavage and polyadenylation factor.

Subsequently, two other groups independently confirmed that Swd2 is a bona fide member of the budding yeast CPF, although its association with Sc_Set1C was not reported (1,24). Taken together, these data suggest that Swd2 protein is a subunit of two independent complexes, Sc_Set1C and Sc_CPF, and hyperlinks the histone methylation and polyadenylation machinery in the budding yeast. Although both complexes act at the site of active transcription, the significance of this hyperlink remains elusive.

Here we report the application of a comparative proteomic analysis of the proteomic environments of the orthologous Set1 methyltransferases in the budding and fission yeasts.


    EXPERIMENTAL PROCEDURES
 TOP
 ABSTRACT
 EXPERIMENTAL PROCEDURES
 RESULTS AND DISCUSSION
 CONCLUSION AND PERSPECTIVES
 REFERENCES
 
Epitope Tagging of Genes and Isolation of Protein Complexes—
The strains used in the study were generated using standard techniques and were isogenic to DB325-P41 (h-, ura4-D18, leu1–32). Cells were grown in rich medium supplied when necessary with 150 mg/liter G418. Proteins of interest were tagged by in-frame fusion of the 3'-end of their open reading frames with a PCR-generated targeting cassette encoding for the TAP-tag and a selectable marker flanked by long (usually 80 nt) tracts of homology (9, 10, 25). Gene deletions and TAP-cassette integrations were performed by PCR-mediated gene targeting (25). Correct cassette integrations were confirmed by PCR and Western blot analysis.

The procedure for the purification of protein complexes in S. pombe was essentially the same as employed previously in S. cerevisiae. The breaking and extraction of yeast cells was performed as described by Logie and Peterson (26). TAP purification was performed according to Rigaut et al. (9), with the following modifications: 10 ml supernatant collected after 43,000 rpm centrifugation were allowed to bind to 200 µl IgG Sepharose (Amersham Pharmacia Biotech, Piscataway, NJ), equilibrated in buffer E (27) for 2 h at 4 °C using a disposable chromatography column (Bio-Rad, Hercules, CA). Two to three column volumes (the equivalent of 4–6 liters of yeast culture at optical density at 600 nm between 2 and 3) were used per purification. The IgG Sepharose column was washed with 35 ml of buffer E without proteinase inhibitors, followed by 10 ml of the tobacco etch virus (TEV) cleavage buffer. Cleavage with TEV was performed using 10 µl (100 U) rTEV (Life Technologies, Inc., Grand Island, NY) in 1 ml cleavage buffer for 2 h at 16 °C. Calmodulin Sepharose (Stratagene, La Jolla, CA) purification was performed as described (11). Purified proteins were concentrated according to Wessel and Fluge (27).

Analysis H3-K4 Methylation in Sp_Swd2.1 and Sp_Swd2.2 Deletion Mutants—
Crude cell extracts from exponentially growing cells were prepared by glass bead lysis. Proteins were separated on 15% SDS polyacrylamide gel and blotted onto nitrocellulose membrane following the manufacturer’s instructions. For detecting lysine 4 (K4) methylated histone H3, an antibody recognizing di- and trimethylated forms of K4 was used (Abcam, Cambridge, UK). Secondary antibody was anti-rabbit IgG-HRP conjugate (Amersham Biosciences, Piscataway, NJ). The signals were visualized using the enhanced chemiluminescence system (Amersham Biosciences).

Identification of Proteins by Mass Spectrometry—
Proteins were separated by electrophoresis using gradient (6–18%) one-dimensional polyacrylamide gels and visualized by staining with Coomassie. Protein bands were excised and in-gel digested with trypsin (Roche Diagnostics, Indianapolis, IN) as described previously (28). Proteins were identified by a combination of matrix-assisted laser desorption/ionization mass spectrometry (MALDI MS) peptide mapping and nanoelectrospray tandem mass spectrometry (nanoES MS/MS) sequencing as described (29). Briefly, 1-µl aliquots were withdrawn from the in-gel digests and analyzed on a REFLEX IV mass spectrometer (Bruker Daltonics, Billerica, MA) on AnchorChip targets (Bruker Daltonics) as described (30). If no conclusive identification was achieved, gel pieces were extracted with 5% formic acid and acetonitrile. Unseparated mixtures of recovered tryptic peptides were sequenced by nanoES MS/MS on a QSTAR Pulsar i quadrupole time-of-flight mass spectrometer (MDS Sciex, Concord, Canada). Database searching with MALDI time-of-flight peptide mass maps and with uninterpreted tandem mass spectra was performed against a database of S. pombe proteins using Mascot software (Matrix Science Ltd., London, UK) installed on a local server. Hits with the MOWSE score exceeding 51 (the threshold score suggested by Mascot) were considered significant, but were accepted only upon manual inspection. Borderline hits were additionally verified by nanoES MS/MS.

The list of proteins identified in each pulldown experiment with relevant identification details is presented in the supplemental material.


    RESULTS AND DISCUSSION
 TOP
 ABSTRACT
 EXPERIMENTAL PROCEDURES
 RESULTS AND DISCUSSION
 CONCLUSION AND PERSPECTIVES
 REFERENCES
 
Protein Tagging, Affinity Purification, and Mass Spectrometric Analysis of Complexes in S. pombe—
In total, in the course of this and other projects performed in our laboratories, we attempted to tag 23 genes from the fission yeast S. pombe. The molecular mass of bait proteins was in a broad range from 12 to 129 kDa and pI from 4 to 11. Their codon adaptation index, which represents a relative level of protein expression, was from 0.177 to 0.324, hence suggesting that the selected proteins are lowly abundant. Two out of the 23 genes encoding for tagged proteins are essential. The TAP tag was successfully fused to the C terminus of 22 proteins, and 20 tagged proteins (86%) were subsequently detected by Western blot. Two bait proteins from these 20 did not pull down any detectable interaction partners, although baits were visualized as intense Coomassie stainable bands. Each of the other 18 baits pulled down 2–12 interaction partners. Here we only present the identification of proteins interacting with 12 out of 18 baits that encompass the genes related to S. pombe Set1 and CPF complexes (see supplemental material). Taken together, data from this and other projects suggest that the success rate of protein tagging and isolation of complexes in S. pombe and in S. cerevisiae (4) is similar.

In the budding yeast, TAP of protein complexes was variably accompanied by a co-isolation of a common set of highly abundant background proteins, including housekeeping proteins, metabolic enzymes, and ribosomal proteins (4). We found that persistent background proteins in S. pombe were different from S. cerevisiae, although, once again, highly abundant proteins, including housekeeping proteins and components of the protein synthesis machinery, were mostly observed (Table I).


View this table:
[in this window]
[in a new window]
 
TABLE I Background proteins in the TAP method in S. pombe

 
Sequential rounds of eptitope tagging and immunoaffinity isolation were applied to characterize protein complexes from both S. cerevisiae (22) and S. pombe (13). Basically, the bait protein was TAP tagged, integration of the tag and expression of the tagged protein was checked by PCR and Western blot, and the complex was purified from the whole-cell lysate by the two-step affinity chromatography. Then, proteins were separated by gradient SDS-PAGE and visualized by Coomassie staining. All bands detected in the gel (without using mock patterns as controls) were excised and identified by mass spectrometry (Fig. 1). Proteins repeatedly detected in different purifications are listed in Table I and were disregarded as background. The identified proteins that were considered as plausible members of the protein complex were in turn tagged and the purification was repeated. Again, all detectable bands were analyzed by mass spectrometry. No identification has been based on the similarity of gel patterns.



View larger version (22K):
[in this window]
[in a new window]
 
FIG. 1. TAP purification of tagged fission yeast homologues of Sc_Swd2. A, Purification of TAP-Sp_Swd2.1; B, TAP-Sp_Swd2.2. All Coomassie detectable bands (52 bands in A (indicated by dots) and 58 bands in B) were identified by mass spectrometry. Sp_Swd2.1 pulled down six members of Sp_Set1 complex, whereas 10 members of CPF complex were identified in association with Sp_Swd2.2 (designated at the panels). A complete list of identified proteins is provided in the supplemental material.

 
The sequential approach was rather laborious: altogether, for three protein complexes characterized in S. pombe (Sp_Set1C, Sp_Lid2C, and Sp_CPF), 12 baits were TAP tagged and mass spectrometric identification of 681 bands of co-isolated proteins was performed (Table II), so that each complex was independently purified several times using different subunits as baits (see supplemental material).


View this table:
[in this window]
[in a new window]
 
TABLE II Composition of protein complexes from S. cerevisiae and S. pombe

 
Although shotgun methods (17) provide higher throughput compared with band-per-band identification of gel-separated proteins, we see two major arguments in favor of using gels. First, visual inspection of Coomassie stained gel-separated proteins accompanied by densitometry scanning provides semiquantitative estimations of the stoichiometry between the subunits and indicates those proteins whose interaction with the core complex may be only transient. Second, gel patterns and spectra of individual bands could be compared directly, thus confidently stipulating the presence and, equally important, the absence of a particular protein in the preparation.

Orthologous Complexes Sc_Set1C and Sp_Set1C—
The genome of S. pombe encodes for the orthologous protein Sp_Set1, which shares 29% of the full-length sequence identity with its S. cerevisiae homologue. We further characterized the organization of the Set1-anchored proteomic environment in S. pombe (Fig. 2, Table II, and supplemental material). Similar to Sc_Set1C, Sp_Set1C also comprises eight subunits that (with the exception of Sp_Shg1) were the most close homologues of the corresponding members of Sc_Set1C. Sp_Set1C includes the trxG (trithorax group) homologue Sp_Ash2. Unlike its S. cerevisiae homologue Sc_Bre2, Sp_Ash2 possesses a PHD-finger domain, which is a hallmark motif of proteins implicated in chromatin regulation (31). Also, Sc_Shg1 is only distantly similar to Sp_Shg1 (10). Both proteins share a short region of similarity at their N termini and have similar elements of the predicted secondary structure (data not shown). Sequence similarity, although weak, was recognizable by conventional and back-BLAST searches using relaxed substitution matrices (such as, PAM30 and PAM70) as well as by Smith-Waterman searches. Having characterized Sp_Set1C, we established the highly conserved nature of this complex in S. cerevisiae and S. pombe, which could not be deduced by simple extrapolation from the known composition of Sc_Set1C. In particular, the biochemically identified Sp_Set1C subunits Shg1, Ash2, Swd2.1, and Swd3 could not be confidently predicted as Sp_Set1C members by reliance on bioinformatics.



View larger version (32K):
[in this window]
[in a new window]
 
FIG. 2. Proteomic environment of Set1 proteins in S. cerevisiae (A) and S. pombe (B). Individual complexes are circled. Successfully TAP-purified proteins are indicated with asterisks. Sc_Set1C comprises eight subunits, including Sc_Swd2 identified as a hyperlink to Sc_CPF. Sp_Set1C also comprises eight subunits. Sp_Ash2 and Sp_Sdc1 are hyperlinks between Sp_Set1C and a new protein complex termed Sp_Lid2C.

 
Characterization of Sp_Set1C revealed a novel hyperlink, which was not observed in S. cerevisiae (Fig. 2). Along with the members of Sp_Set1C, both Sp_Ash2 and Sp_Sdc1 pulled down yet another set of interacting proteins consisting of Sp_Jmj3 and two PHD-finger proteins, Sp_Snt2 and Sp_Lid2 (Table II). The latter protein also belongs to trxG protein family (32). None of these three proteins were co-isolated when the other Sp_Set1C subunits were TAP tagged. Another round of protein tagging and immunoaffinity isolation was performed using Sp_Lid2 and Sp_Snt2 as baits and established the presence of a new complex termed Sp_Lid2C (Fig. 2).

A Hyperlink Between Set1C and CPF Complexes in S. cerevisiae and S. pombe—
In S. cerevisiae, Swd2 was identified as a hyperlink between Sc_Set1C and Sc_CPF (Fig. 2). The genome of S. pombe encodes for two relatively distant homologues of Sc_Swd2, namely SPBC18H10.06c (now termed Sp_Swd2.1) and SPAC824.04 (now termed Sp_Swd2.2), sharing 34 and 32% of sequence identity with Sc_Swd2, respectively and sharing 30% of sequence identity between themselves. By tagging Sp_Set1C members, we established that Sp_Swd2.1 is a member of Sp_Set1C (Fig. 2). However, tagging Sp_Swd2.1 (Fig. 1A) or subunits of Sp_CPF (see supplemental materials) did not provide any evidence that it interacts with the Sp_CPF complex.

We therefore asked if the fission yeast CPF complex includes the other Swd2 homologue, Sp_Swd2.2, and if it might represent a yet undetected hyperlink between Sp_Set1C and Sp_CPF. To elucidate the proteomic environment of Sp_Swd2.2, the protein was tagged and used as a bait in the immunoaffinity purification. It pulled down the Sp_CPF complex, and, importantly, no members of Sp_Set1C complex were detected (Fig. 3). To confirm that Sp_Swd2.2 is a core subunit of the Sp_CPF complex, we further tagged its conserved member Sp_Ysh1. Sp_Ysh1 was chosen because it is a key component of Sc_CPF, and its sequence is remarkably conserved between S. cerevisiae and S. pombe with no other clear homologues shared between these two genomes. Using Sp_Ysh1-TAP as a bait, we pulled down the same subunits of CPF, including Sp_Swd2.2, but no Sp_Swd2.1 was observed in a detectable amount. Taken together, these data suggested Sp_Swd2.2, but not Sp_Swd2.1, is a genuine member of Sp_CPF (Fig. 3).



View larger version (27K):
[in this window]
[in a new window]
 
FIG. 3. Interaction partners of Swd2 in S. cerevisiae (A) and its paralogues in S. pombe (B). Sc_Swd2 hyperlinks Sc_SetC1 and Sc_CPF complexes. In S. pombe, the function of the Swd2 homologues is fully diverged, with Sp_Swd2.1 becoming a member of Sp_Set1C and Sp_Swd2.2 becoming a member of Sp_CPF.

 
Thus we demonstrated that although the composition of Set1C is conserved between S. cerevisiae and S. pombe, the hyperlinks of the Set1C complex are different in two ways. First, the fission yeast Sp_Set1C is hyperlinked to another complex, Sp_Lid2C, which is not present in the budding yeast (see below). Second, Set1C and CPF are no longer hyperlinked via Swd2-homologous proteins in S. pombe.

The Function of Swd2 Paralogues Is Completely Diverged in S. pombe—
We further investigated if Sp_Swd2.1 and Sp_Swd2.2 can substitute each other in Sp_Set1C and Sp_CPF complexes. To this end, we constructed a strain in which the Sp_Swd2.2 gene was deleted and Sp_Swd2.1 was tagged. The mutant strain grew slower compared with the wild-type strain and other strains with TAP-tagged proteins. Although much less protein material was purified, in a pull down experiment with Sp_Swd2.1 we were able to identify all subunits of Sp_Set1C except Sp_Sdc1, which produces only one peptide upon its digestion with trypsin and was undetectable in a heavy mixture with low molecular mass background proteins. At the same time, no subunits of Sp_CPF were detected (Fig. 4), suggesting that Sp_Swd2.1 cannot substitute for Sp_Swd2.2 in Sp_CPF.



View larger version (16K):
[in this window]
[in a new window]
 
FIG. 4. A, Purification of tagged Sp_Swd2.1 from {Delta}Sp_Swd2.2 strain. Only members of Sp_Set1C were identified. No members of Sp_CPF were detectable. B, Analysis of protein extract from {Delta}Sp_Swd2.2 strain with antibodies recognizing trimethylated K4 in histone H3. Extracts from wild type and {Delta}Sp_Swd2.1 were used as positive and negative controls.

 
As we demonstrated previously (13), Sp_Swd2.1 is essential for methylation of the K4 residue of histone H3. To test if Sp_Swd2.2 could also be involved in the H3K4 methylation, we created {Delta}Sp_Swd2.1 and {Delta}Sp_Swd2.2 deletion strains. Protein extracts from the deletion strains and from wild type were probed by immunoblotting with an antibody recognizing trimethylated K4 of histone H3. As expected, no H3K4 methylation occurred in {Delta}Sp_Swd2.1. However, no noticeable reduction of H3K4 methylation was observed in {Delta}Sp_Swd2.2 compared with wild type (Fig. 4).

We therefore concluded that in the fission yeast, duplicated swd2 genes are functionally specialized (33, 34), with Sp_Swd2.1 protein being a member of Sp_Set1C and Sp_Swd2.2 a member of Sp_CPF (Fig. 3).

A Protein Complex in S. cerevisiae that Is Orthologous to Sp_Lid2C—
As was demonstrated above, Sp_Ash2 and Sp_Sdc1 hyperlink Sp_Set1C and Sp_Lid2C complexes (Fig. 2). However, no such link to a complex similar to Sp_Lid2C was observed in S. cerevisiae, although both S. pombe proteins share significant sequence identity with their closest homologues in S. cerevisiae (Table II). We therefore attempted to isolate a protein complex orthologous to Sp_Lid2C from the budding yeast and to determine if it is hyperlinked to Sc_Set1C.

The genome of S. cerevisiae encodes for proteins YJR119c and Snt2 (YGL131c), which share 23 and 22% of full-length sequence identity with Sp_Lid2 and Sp_Snt2, respectively, and display a very similar composition of functional domains. We tagged Sc_Snt2 and YJR119C and isolated their interaction partners, but no proteins homologous to members of Sp_Lid2C were identified (data not shown).

Taken together, the data suggested that no protein complex orthologous to Sp_Lid2C exists in S. cerevisiae, despite the presence of a few reasonable sequence homologues in its genome. We speculate that Sp_Lid2C is possibly involved in H3K9 methylation, which does not occur in S. cerevisiae but is found in S. pombe and in humans (35, 36).

Although genes encoding for subunits of Sc_Set1C complex (other than Swd2) are nonessential, and certain members of Sc_Set1C and Sp_Set1C display only marginal sequence similarity (Table II), the overall composition of these complexes is well conserved. However, orthologous Sp_Set1C and Sc_Set1C complexes function in a differently "wired" proteomic network that comprises conserved (Sc_CPF and Sp_CPF) and nonconserved (Sp_Lid2C and Sc_Snt2C) protein assemblies.

Conserved Composition and Variable Hyperlinks of Orthologous Complexes—
Although Set1 complexes in S. cerevisiae and S. pombe are highly conserved, overall their proteomic environment differs substantially. Similar observations were previously made for many orthologous protein complexes characterized in different species (reviewed in Ref. 37).

Our data underscore the value and importance of maximal possible characterizations of the compositions of protein complexes, especially when considering a phylogenetic perspective. Using two entry points, Sc_Swd1 and Sc_Swd3 proteins, Gavin et al. (1) identified a complex (termed complex #108) with a very similar composition to Sc_Set1C. The complex #108 missed two subunits (Swd2 and Shg1) and, consequently, a hyperlink to Sc_CPF complex via Sc_Swd2. At the same time, complex #108 comprised three other proteins (yeast.cellzome.com), whose relation to Sc_Set1C was not independently confirmed. Gavin et al. also tagged seven out of 20 known subunits of Sc_CPF and detected Sc_ Swd2 in all affinity purifications (1). However, Sc_Swd2 itself was not tagged, and its relation to Sc_Set1C was not established. Missed interactions or artificially merged individual protein complexes hampered further comparison of Set1 proteomic environments and reasonable projection of their organization and function to mammalians.

The human genome encodes for two Set1-related genes: KIAA0339 and KIAA1076, sharing 35 and 37% identity to Sc_Set1 and 55 and 45% identity to Sp_Set1. KIAA0339 is engaged in a partially characterized complex comprising at least hAsh2 and WDR5 (a human homologue of Sc_Swd3) (38). This putative Hs_Set1C is also involved in H3K4 methylation (38). The entry point to Hs_Set1C, protein HCF-1, has no apparent homology to any of the core members of Sp_Set1C or Sc_Set1C but is associated with human Sin3 histone deacetylase (HDAC). We therefore speculate that the partial purification of Hs_Set1C was achieved via a hyperlink protein.

Taken together, our data and other published evidence strongly suggest that although orthologous protein complexes may be remarkably conserved, their proteomic environment and hyperlinks to other complexes are not. Furthermore, we propose that the conservation of the core and variability of links represents a common phenomenon in the molecular organization of eukaryotic proteomes.


    CONCLUSION AND PERSPECTIVES
 TOP
 ABSTRACT
 EXPERIMENTAL PROCEDURES
 RESULTS AND DISCUSSION
 CONCLUSION AND PERSPECTIVES
 REFERENCES
 
Understanding the phylogentic variability of proteomic hyperlinks is important for satisfactory extrapolations of global protein-protein interaction patterns, determined in model low-eukaryotic organisms, to higher eukaryotes including humans. To this end, boundaries of individual complexes and their hyperlinks should be accurately determined. The quality of data, the reliability and reproducibility of the discovered protein-protein interactions, as well as the scale and the character of the analysis are all factors important to the merits of proteomics extrapolations.

The comparative analysis of proteomic environments in a multiorganismal perspective offers an intriguing opportunity to extend and complement our understanding of how the evolution of genomes guides the evolution of protein machines. Comparative studies may reach far beyond simple cataloguing of observed differences. Rather, together with advanced bioinformatic approaches, correlations of concerted alterations in sequences of orthologous subunits could highlight functional specializations.

The multiorganismal approach in functional proteomics will likely require biochemical isolation of complexes and identification of cognate proteins beyond the boundaries of known genomes. Although lacking exact protein sequences in a database hampers the identification of proteins, a substantial coverage of yet unknown proteomes might be achieved by sequence-similarity searches (reviewed in Ref. 37). The bottleneck (and the likely focus of further efforts) is in the development of a generic approach for isolating protein complexes from cells and tissues of vertebrate organisms, which might be overcome by advanced genetic engineering methods in the future (39).


    FOOTNOTES
 
Received, August 21, 2003, and in revised form, November 6, 2003.

Published, MCP Papers in Press, November 15, 2003, DOI 10.1074/mcp.M300081-MCP200

This paper is available on line at http://www.mcponline.org

1 The abbreviations used are: TAP, tandem affinity purification; MALDI MS, matrix-assisted laser desorption/ionization mass spectrometry; nanoES MS/MS, nanoelectrospray tandem mass spectrometry; SEAM, sequential rounds of epitope tagging, immunoaffinity chromatography, and mass spectrometry; TEV, tobacco etch virus; Sc_XXX and Sp_XXX, protein XXX from S. cerevisiae and S. pombe, respectively; Sc_XXXC and Sp_XXXC, protein complex XXX from S. cerevisiae and S. pombe, respectively; CPF, cleavage and polyadenylation factor. Back

* The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. Back

S The on-line version of this article (available at http://www.mcponline.org/) contains supplemental material. Back

§ A. R. and A. S. contributed equally to this work. Back

|| To whom correspondence should be addressed. E-mail: shevchenko{at}mpi-cbg.de and stewart{at}mpi-cbg.de


    REFERENCES
 TOP
 ABSTRACT
 EXPERIMENTAL PROCEDURES
 RESULTS AND DISCUSSION
 CONCLUSION AND PERSPECTIVES
 REFERENCES
 

  1. Gavin, A. C., Bosche, M., Krause, R., Grandi, P., Marzioch, M., Bauer, A., Schultz, J., Rick, J. M., Michon, A. M., Cruciat, C. M., Remor, M., Hofert, C., Schelder, M., Brajenovic, M., Ruffner, H., et al. (2002) Functional organization of the yeast proteome by systematic analysis of protein complexes. Nature 415, 141 –147[CrossRef][Medline]

  2. Ho, Y., Gruhler, A., Heilbut, A., Bader, G. D., Moore, L., Adams, S. L., Millar, A., Taylor, P., Bennett, K., Boutilier, K., Yang, L., Wolting, C., Donaldson, I., Schandorff, S., Shewnarane, J., et al. (2002) Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry. Nature 415, 180 –183[CrossRef][Medline]

  3. Seraphin, B. (2002) Identification of transiently interacting proteins and of stable protein complexes. Adv. Protein Chem. 61, 99 –117[Medline]

  4. Shevchenko, A., Schaft, D., Roguev, A., Pijnappel, W. W. M. P., Stewart, A. F., and Shevchenko, A. (2002) Deciphering protein complexes and protein interaction networks by tandem affinity purification and mass spectrometry: Analytical perspective. Mol. Cell. Proteomics 1, 204 –212[Abstract/Free Full Text]

  5. Ito, T., Ota, K., Kubota, H., Yamaguchi, Y., Chiba, T., Sakuraba, K., and Yoshida, M. (2002) Roles for the two-hybrid system in exploration of the yeast protein interactome. Mol. Cell. Proteomics 1, 561 –566[Abstract/Free Full Text]

  6. Von Mering, C., Krause, R., Snel, B., Cornell, M., Oliver, S. G., Fields, S., and Bork, P. (2002) Comparative assessment of large-scale data sets of protein protein interactions. Nature 417, 399 –403[CrossRef][Medline]

  7. Kemmeren, P., van Berkum, N. L., Vilo, J., Bijma, T., Donders, R., Brazma, A., and Holstege, F. C. (2002) Protein interaction verification and functional annotation by integrated analysis of genome-scale data. Mol. Cell 9, 1133 –1143[Medline]

  8. Alberts, B. (1998) The cell as a collection of protein machines: Preparing the next generation of molecular biologists. Cell 92, 291 –294[Medline]

  9. Rigaut, G., Shevchenko, A., Rutz, B., Wilm, M., Mann, M., and Seraphin, B. (1999) A generic protein purification method for protein complex characterization and proteome exploration. Nat. Biotechnol. 17, 1030 –1032[CrossRef][Medline]

  10. Puig, O., Caspary, F., Rigaut, G., Rutz, B., Bouveret, E., Bragado-Nilsson, E., Wilm, M., and Seraphin, B. (2001) The tandem affinity purification (TAP) method: A general procedure of protein complex purification. Methods 24, 218 –229[CrossRef][Medline]

  11. Pijnappel, W. W., Schaft, D., Roguev, A., Shevchenko, A., Tekotte, H., Wilm, M., Rigaut, G., Seraphin, B., Aasland, R., and Stewart, A. F. (2001) The S. cerevisiae SET3 complex includes two histone deacetylases, Hos2 and Hst1, and is a meiotic-specific repressor of the sporulation gene program. Genes Dev. 15, 2991 –3004[Abstract/Free Full Text]

  12. Tasto, J. J., Carnahan, R. H., McDonald, W. H., and Gould, K. L. (2001) Vectors and gene targeting modules for tandem affinity purification in Schizosaccharomyces pombe. Yeast 18, 657 –662[CrossRef][Medline]

  13. Roguev, A., Schaft, D., Shevchenko, A., Aasland, R., and Stewart, A. F. (2003) High conservation of the Set1/Rad6 axis of histone 3 lysine 4 methylation in budding and fission yeasts. J. Biol. Chem. 278, 8487 –8493[Abstract/Free Full Text]

  14. Sipiczki, M. (2000) Where does fission yeast sit on the tree of life? Genome Biol. 1, REVIEWS1011

  15. Eisen, J. A. (2002) Brouhaha over the other yeast. Nature 415, 845 –848[CrossRef][Medline]

  16. Wood, V., Gwilliam, R., Rajandream, M. A., Lyne, M., Lyne, R., Stewart, A., Sgouros, J., Peat, N., Hayles, J., Baker, S., Basham, D., Bowman, S., Brooks, K., Brown, D., Brown, S., et al. (2002) The genome sequence of Schizosaccharomyces pombe. Nature 415, 871 –880[CrossRef][Medline]

  17. Link, A. J., Eng, J., Schieltz, D. M., Carmack, E., Mize, G. J., Morris, D. R., Garvik, B. B., and Yates, J. R. 3rd. (1999) Direct analysis of protein complexes using mass spectrometry. Nat. Biotechnol. 17, 676 –682[CrossRef][Medline]

  18. Ohi, M. D., Link, A. J., Ren, L., Jennings, J. L., McDonald, W. H., and Gould, K. L. (2002) Proteomics analysis reveals stable multiprotein complexes in both fission and budding yeasts containing Myb-related Cdc5p/Cef1p, novel pre-mRNA splicing factors, and snRNAs. Mol. Cell Biol. 22, 2011 –2024[Abstract/Free Full Text]

  19. Yoon, H. J., Feoktistova, A., Wolfe, B. A., Jennings, J. L., Link, A. J., and Gould, K. L. (2002) Proteomics analysis identifies new components of the fission and budding yeast anaphase-promoting complexes. Curr. Biol. 12, 2048 –2054[CrossRef][Medline]

  20. Seol, J. H., Shevchenko, A., Shevchenko, A., and Deshaies, R. J. (2001) Skp1 forms multiple protein complexes, including RAVE, a regulator of V-ATPase assembly. Nat. Cell Biol. 3, 384 –391[CrossRef][Medline]

  21. Nislow, C., Ray, E., and Pillus, L. (1997) SET1, a yeast member of the trithorax family, functions in transcriptional silencing and diverse cellular processes. Mol. Biol. Cell 8, 2421 –2436[Abstract/Free Full Text]

  22. Roguev, A., Schaft, D., Shevchenko, A., Pijnappel, W. W. M., Wilm, M., Aasland, R., and Stewart, A. F. (2001) The S. cerevisiae Set1 complex includes an Ash2 homolog and methylates histone 3 lysine 4. EMBO J. 20, 7137 –7148[Abstract/Free Full Text]

  23. Krogan, N. J., Dover, J., Khorrami, S., Greenblatt, J. F., Schneider, J., Johnston, M., and Shilatifard, A. (2002) COMPASS, a histone H3 (Lysine 4) methyltransferase required for telomeric silencing of gene expression. J. Biol. Chem. 277, 10753 –10755[Abstract/Free Full Text]

  24. Dichtl, B., Blank, D., Ohnacker, M., Friedlein, A., Roeder, D., Langen, H., and Keller, W. (2002) A role for SSU72 in balancing RNA polymerase II transcription elongation and termination. Mol. Cell 10, 1139 –1150[Medline]

  25. Krawchuk, M. D., and Wahls, W. P. (1999) High-efficiency gene targeting in Schizosaccharomyces pombe using a modular, PCR-based approach with long tracts of flanking homology. Yeast 15, 1419 –1427[CrossRef][Medline]

  26. Logie, C., and Peterson, C. L. (1999) Purification and biochemical properties of yeast SWI/SNF complex. Methods Enzymol. 304, 726 –741[Medline]

  27. Wessel, D., and Flugge, U. I. (1984) A method for the quantitative recovery of protein in dilute solution in the presence of detergents and lipids. Anal. Biochem. 138, 141 –143[Medline]

  28. Shevchenko, A., Wilm, M., Vorm, O., and Mann, M. (1996) Mass spectrometric sequencing of proteins from silver-stained polyacrylamide gels. Anal. Chem. 68, 850 –858[CrossRef][Medline]

  29. Shevchenko, A., Jensen, O. N., Podtelejnikov, A. V., Sagliocco, F., Wilm, M., Vorm, O., Mortensen, P., Shevchenko, A., Boucherie, H., and Mann, M. (1996) Linking genome and proteome by mass spectrometry: Large-scale identification of yeast proteins from two dimensional gels. Proc. Natl. Acad. Sci. U. S. A. 93, 14440 –14445[Abstract/Free Full Text]

  30. Havlis, J., Thomas, H., Sebela, M., and Shevchenko, A. (2003) Fast response proteomics by accelerated in-gel digestion of proteins. Anal. Chem. 75, 1300 –1306[CrossRef][Medline]

  31. Aasland, R., Gibson, T. J., and Stewart, A. F. (1995) The PHD finger: Implications for chromatin-mediated transcriptional regulation. Trends Biochem. Sci. 20, 56 –59[CrossRef][Medline]

  32. Gildea, J. J., Lopez, R., and Shearn, A. (2000) A screen for new trithorax group genes identified little imaginal discs, the Drosophila melanogaster homologue of human retinoblastoma binding protein 2. Genetics 156, 645 –663[Abstract/Free Full Text]

  33. Hughes, A. L. (1994) The evolution of functionally novel proteins after gene duplication. Proc. R Soc. Lond. B Biol. Sci. 256, 119 –124[Medline]

  34. Lynch, M., and Conery, J. S. (2000) The evolutionary fate and consequences of duplicate genes. Science 290, 1151 –1155[Abstract/Free Full Text]

  35. Briggs, S. D., Bryk, M., Strahl, B. D., Cheung, W. L., Davie, J. K., Dent, S. Y., Winston, F., and Allis, C. D. (2001) Histone H3 lysine 4 methylation is mediated by Set1 and required for cell growth and rDNA silencing in Saccharomyces cerevisiae. Genes Dev. 15, 3286 –3295[Abstract/Free Full Text]

  36. Noma, K., Allis, C. D., and Grewal, S. I. (2001) Transitions in distinct histone H3 methylation patterns at the heterochromatin domain boundaries. Science 293, 1150 –1155[Abstract/Free Full Text]

  37. Liska, A., and Shevchenko, A. (2002) Expanding the organismal scope of proteomics: cross-species protein identification by mass spectrometry and its implications. Proteomics 3, 19 –28[CrossRef]

  38. Wysocka, J., Myers, M. P., Laherty, C. D., Eisenman, R. N., and Herr, W. (2003) Human Sin3 deacetylase and trithorax-related Set1/Ash2 histone H3-K4 methyltransferase are tethered together selectively by the cell-proliferation factor HCF-1. Genes Dev. 17, 896 –911[Abstract/Free Full Text]

  39. Testa, G., Zhang, Y., Vintersten, K., Benes, V., Pijnappel, W. W., Chambers, I., Smith, A. J., Smith, A. G., and Steward, A. F. (2003) Engineering the mouse genome with bacterial artificial chromosomes to create multipurpose alleles. Nat. Biotechnol. 21, 443 –447[CrossRef][Medline]