1Tsukuba Research Institute, Novartis Pharma KK, Ohkubo 8, Tsukuba 300-2611 and 2Department of Functional Materials Science, Saitama University, Saitama 338-8570, Japan 3Present address: Computational Biology Research Center, National Institute of Advanced Industrial Science and Technology, 243 Aomi, Koto-ku, Tokyo 135-0064, Japan
4 To whom correspondence should be addressed. E-mail: yasuhiko.shibanaka{at}pharma.novartis.com
![]() |
Abstract |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Keywords: additivity principles/directed evolution/prolyl endopeptidase/thermostability
![]() |
Introduction |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Wells proposed that mutational effects on the properties of biopolymers are almost independent and additive (Wells, 1990). Dill called this concept the additivity principle (Dill, 1997
). Many studies, including our own (Aita et al., 2000
, 2001
, 2002
), support these principles (Takeda et al., 1989
; Houghten et al., 1991
; Lowman and Wells, 1993
). Therefore, it is tempting to design mutants that incorporate all known advantageous mutations. However, certain mutations (which are advantageous if they are the only mutation present in a protein) diminish each other's physiochemical effects (the epistatic effect; Kauffman and Macready, 1995
; Aita et al., 2002
). Therefore, it is desirable that many component mutations are assembled but a few combinations of the mutations are removed to avoid their epistatic effect. Here, we report an original directed evolution strategy that biases the creation of mutant libraries to favor the presence of multiple advantageous mutations within a mutant. This strategy, biased mutation-assembling, uses the technical framework of mutation scrambling (Uchiyama, 1999
) and the theoretical one developed previously (Aita and Husimi, 2000
).
A schematic of biased mutation-assembling is depicted in Figure 1. The method consists of the following steps:
|
![]() | (1) |
Prolyl endopeptidase (PEP) (prolyl oligopeptidase, EC 3.4.21.26) is the only endopeptidase known to show substrate specificity for proline and therefore it is expected to be applicable as a catalyst for pharmaceutical production, e.g. coupling reactions of biologically active peptides, such as C-terminal amidation of luteinizing hormone-releasing hormone (LH-RH) (Togame et al., 1994). However, PEP is susceptible to inhibition and inactivation under conditions for coupling reactions, such as the presence of organic solvents and high temperatures (Kreig and Wolf, 1995
). Uchiyama et al. (2000)
suggested that the thermostabilization (prolongation of activity half-life at high temperature) had a beneficial effect on the stability in the coupling reaction. Here, we describe the biased mutation-assembling protocol and the application of the strategy to the prolongation of the activity half-life of PEP from Flavobacterium meningosepticum at high temperature (thermostabilization of PEP) as a case study. We developed a basis set of 14 mutations that individually improved the thermostability of PEP and used these mutations to prepare assembling libraries with r values of 0.3, 0.5, 0.7, 0.8, 0.9 and 1.0. We found that the r value of a library directly correlates with the number of thermostable mutants in the library. After assaying only 2000 mutants, we found one, in the library with an r value of 0.9, that has a 1200-fold longer activity half-life than the wild-type enzyme at 60°C. This mutant contains 12 mutations and lacks an epistatic pair.
![]() |
Materials and methods |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
We previously constructed the vector pUK-FPEPb to express PEP in Escherichia coli (Uchiyama et al., 2000). This vector contains the PEP gene, which is inserted between EcoRI and PstI restriction sites (Figure 2). The gene was amplified using error-prone polymerase chain reaction (PCR) as described previously (Uchiyama et al., 2000
), with experimental conditions optimized so that only one or at most a few nucleotide substitutions were introduced (Leung et al., 1989
). Amplified DNAs were digested with EcoRI and PstI and then ligated into pUK-FPEPb vector, which was digested with the same enzymes. The vectors were transformed into E.coli JM109 and randomly chosen transformants were screened for PEP thermostability using an active-staining method (Uchiyama et al., 2000
). Mutations that were responsible for the thermostabilization were identified with DNA sequencing.
|
We found 14 single-point mutations, S19T, E67Q, F70L, S110F, N387K, A388V, S475G, I493V, E496K, N542I, K615R, G652V, S653A and Q656R, which individually improved PEP thermostability at 57.5°C. These mutations form the basis set for assembling libraries reported here. Since E67QF70L, N387KA388V, I493VE496K and G652VS653AQ656R are each close in sequence, we constructed libraries by modifying the protocol shown in Figure 1. The PEP gene was divided into six blocks, labeled IVI, and sets of corresponding DNA fragments were labeled in the same manner (Figure 2). Blocks IV (and therefore the corresponding sets of DNA fragments) were designed so that each block had closely located mutations on the overlapping regions and any of the mutations except for those near the midpoint. Figure 2 shows the detailed design of blocks. Incorporating proximal mutations into the primers for amplifying DNA fragments allowed us to control the number of mutations present in each DNA fragment, and also reconstituted genes. For example, fragment I was amplified from both the wild-type and the S19T mutant gene with the forward primer designated EcoRI-396-f and four reverse primers designated 67Q/70L-r, 67Q/70F-r, 67E/70L-r and 67E/70F-r, each of which coded one of four combinations of mutations at residues 67 and 70. This made fragment I form a set of eight (23) DNA fragments individually prepared; one wild-type fragment, three fragments with a single mutation, three fragments with two mutations and one fragment with three mutations, as shown in Table I. Figure 2 also shows the detailed information on the primers, in which the primers are named according to the mutations encoded and the direction, sense or anti-sense; for example, primer 67Q/70L-r is the reverse primer coding both E67Q and F70L mutations.
|
Fluorescent measurement using a Pico Green double-stranded DNA quantitation kit (Molecular Probes) was employed for quantification of DNA fragments.
Preparation of periplasmic lysates containing PEP mutants
In our expression system of PEP using pUK-FPEPb and E.coli JM109, PEP is found not only in the cytoplasmic compartment of E.coli, but also in the periplasmic compartment (Uchiyama et al., 2000). As shown by Beacham (1979)
and Swamy and Goldberg (1982)
, the periplasmic space of E.coli has a limited number of proteases, only seven out of the 25 known cellular proteases, and comprises only 48% of the total cell protein. Also, we have reported the isolation of thermostable PEP mutants using whole cell lysate of E.coli JM109 (Uchiyama et al., 2000
). In addition, we could not find any remarkable decrease in the PEP activity in the periplasmic lysate over 15 min even at 50°C. For these reasons, we considered that periplasmic lysates could be used for the evaluation of PEP thermostability with less proteolytic degradation effect. We prepared periplasmic lysates using a modified method reported by French et al. (1996)
as follows: transformants, which formed colonies on LB agar, were randomly selected and then cultured in 200 µl of LB liquid medium containing 100 µg/ml ampicillin in 96 deep-well culture plates at 37°C for 14 h with shaking at 200 r.p.m. The transformants were harvested by centrifugation at 10 000 g for 5 min and washed with 200 µl of 20 mM potassium phosphate buffer (pH 7.0). The cells were centrifuged again, resuspended in 75 µl of osmotic shock buffer [100 mM TrisHCl (pH 8.0), 0.5 M sucrose, 5 mM EDTA, 0.16 mg/ml lysozyme] and incubated on ice for 2 min. Next, 75 µl of cold distilled water were added and the incubation was continued on ice for 30 min. After centrifugation at 10 000 g for 5 min, the supernatant was used as the periplasmic lysates for evaluation of PEP activity half-life.
Evaluation of PEP mutant thermostability
Volumes of 50 µl of each periplasmic lysate were kept at 57.5 or 60°C for 5, 10 or 15 min, then PEP activity was assayed and, assuming first-order kinetics, a PEP activity half-life (t1/2) was calculated. The screening of mutants was conducted at 57.5°C, at which nearly 90% of the wild-type activity was lost during a 15 min heat treatment. Since highly thermostable mutants showed the great thermostability and showed very long half-lives with a large measurement error at 57.5°C, we evaluated such mutants by adopting higher temperature, 60°C, for precise measurement of their half-lives. For the PEP activity assay, 10 µl of a periplasmic lysate were added to 140 µl of PEP assay buffer [100 mM potassium phosphate (pH 7.0), 100 µg/ml bovine serum albumin, 1 mM dithiothreitol, 0.2 mM Z-Gly-Pro-pNA; Bachem] and the absorbance at 410 nm was monitored at 30°C. The PEP activity half-life is independent of total protein concentration of lysate and the initial lysate PEP activity (data not shown); consequently, it was not necessary to standardize or correct for protein concentration in the assay mixtures.
![]() |
Results |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Initially, using an error-prone PCR technique and active-staining method (Uchiyama et al., 2000), we found 14 mutations that individually improved PEP thermostability, namely S19T, E67Q, F70L, S110F, N387K, A388V, S475G, I493V, E496K, N542I, K615R, G652V, S653A and Q656R. The PEP activity half-lives at 57.5°C for mutants which have one of these mutations are longer than that of the wild-type enzyme (Aita et al., 2002
). These 14 mutations form the basis set for the biased mutation-assembling libraries. As shown in Figure 3, the mutations were mapped to the corresponding positions of the three-dimensional porcine muscle PEP structure (Fülöp et al., 1998
). With the exception of S19T, for which a homologous site does not exist, many of these mutations are spatially located near the N- and C-termini. Mutations near the termini might decrease thermal motion, thereby stabilizing the catalytic domain.
|
Approximately 150300 transformants were randomly picked from each assembling library prepared with individual r values of 0.3, 0.5, 0.7, 0.8 and 0.9. The PEP activity half-lives at 57.5°C for the periplasmic lysates of these transformants were determined. A proportion of transformants that showed markedly low PEP activity ranged from 10 to 20%, with these transformants eliminated from the evaluation. The distribution of the PEP activity half-lives for each library is presented as scatter plots in Figure 4a, in which each individual plot in a column represents a mutant in each assembling library. Figure 4b shows the maximum activity half-lives for the mutants in each library. The half-lives of wild-type PEP and the mutant with all 14 mutations are included in Figure 4a and b. Clearly, a large r value correlates directly with increased production of mutants with greatly improved thermostability. The most thermostable mutant among the 264 samples in the library prepared with r = 0.9 shows an 1600-fold longer activity half-life at 57.5°C than wild-type PEP. However, when r = 1.0, the mutant with all 14 mutations is less thermostable than the most thermostable one and shows at most a 38-fold longer half-life than wild-type PEP. That the mutant containing all 14 mutations does not have a greatly increased thermostability is probably a consequence of the epistatic mutation pair. As the r value increases, the activity half-lives of the majority of mutants converge on approximately that of the mutant with 14 mutations. These results suggest that the mutational effect among the 14 mutations is almost additive, but that there are a few significantly epistatic pairs that cause a diminishing return in PEP thermostability.
|
Figure 4 indicates that a biased library with a large r value should reduce the number of samples that must be assayed when the fittest mutants in binary n-dimensional sequence space are to be found. Since assembling libraries have a binomial distribution of mutant genes, we can estimate the minimum number of assays required to find the mutant with the best mutation combination. The probability, P, that a particular mutant with d point mutations exists among N samples picked randomly from an assembling library with a given r value is
![]() | (2) |
|
![]() |
Discussion |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
For biased mutation-assembling, the incorporation frequency for each mutation is stochastically controlled by the mixing fraction (which is determined by the r value) of the mutant DNA fragments present during recombination. Our previously reported mutation scrambling corresponds to the case with an r value of 0.5, not importing the additivity principles. Therefore, the library contains every conceivable combination of advantageous mutations at a stochastically even frequency of occurrence (Uchiyama, 1999). In this study, the library of PEP mutants, for which all mixing fractions were calculated using an r value of 0.9, contained many thermostable PEP mutants and consequently the search for such mutants was significantly reduced. For other methods, such as DNA shuffling, assuming that all mutations are introduced independently in a binary manner, the r value is the reciprocal of the number of mutations, n: r = 1/n. If we had used such a method to find thermostable PEP mutants, the r value would have been, at most, 0.07 (1/14) and the screening size 4.5 x 1014. However, when DNA fragments are prepared by DNase digestion, mutations, especially close in sequence, usually remain linked and recombine together; therefore, the r value would almost certainly have been smaller and the number of assays required larger. Synthetic shuffling (Ness et al., 2002
), which is one of modified DNA shuffling methods, can also produce a biased library, but each mutation was introduced with an approximately equal probability of incorporation. It will be interesting to integrate additivity principles into this method. Additionally, for biased mutation-assembling, the incorporation probability of an individual mutation can be controlled by adjusting the value of the mixing fraction for each fragment containing a given mutation, although, in this study, the same r value was used for all 14 mutations. For example, if a mutation has a large (or a small) beneficial effect on a desired property, the r value in the equation mixing fraction = ri(1 r)mi should be set near 1.0 (or near 0.5). We have used this strategy to improve the substrate specificity of glucose dehydrogenase-B (GDHB) (EC 1.1.5.2) from Acinetobacter calcoaceticus for glucose. This enzyme shows broad substrate specificity that, especially the activity against maltose, greatly diminishes its utility. A set of mutations was identified that decreased the activity against maltose. Some of these mutations also decreased that against glucose and some of the mutations did not. Using an r value of 0.5 for the first subset of mutations and an 0.9 for the second subset, we were able to decrease specifically the activity against maltose by 4% of wild-type GDHB without significantly affecting the activity against glucose (retaining 40% of wild-type GDHB). Consequently, we found what is probably the best combination of mutations resulting in improved specificity for glucose (manuscript in preparation).
Meanwhile, biased mutation-assembling requires a lot of primers and PCR reaction for the preparation of fragments when a lot of mutations are closely located as described here, requiring a lot of work compared with other methods, such as DNA digestion. However, in cases where component mutations are scattered enough to design individual blocks to have only one mutation as shown in Figure 1, biased mutation-assembling is a very simple method like overlap extension PCR.
A biased library is also produced if the wild-type backcross strategy, for which a mutant with all mutations is recombined with wild-type DNA, is used (Zhao and Arnold, 1997). However, as discussed above, mutations, especially close in sequence, are usually linked during recombination, which makes the control of the incorporation probability of individual mutations virtually impossible. Additionally, this strategy requires the advanced preparation of a gene containing all desired mutations. For these reasons, the wild-type backcross strategy is a less efficient way to prepare a biased library than is biased mutation-assembling.
A crucial element for the successful design of biased mutation-assembling libraries is the composition of the mutation basis set. We found the basis set for the PEP assembling library using error-prone PCR. However, when error-prone PCR is the chosen method, codon degeneracy limits the extent to which the amino acid substitution occurs and, therefore, all desirable mutations might not be found. This limitation can be overcome if other methods are used to produce point mutations. For example, Murakami et al. developed a method that introduces mutations into a gene without the limitations imposed by codon degeneracy (Murakami et al., 2002). This powerful method exhaustively identifies single-point mutations that improve the physiochemical properties of a protein. Combining this method with biased mutation-assembling may make directed evolution experiments more efficient.
In our study, an r value of 0.9 most effectively improved the thermostability of PEP; however, in general, the optimal r value depends on how rugged the fitness landscape is for a particular protein (Aita et al., 2000). Previously, we roughly estimated that the optimal r value is between 0.8 and 0.9 for most proteins (Aita and Husimi, 2000
; Aita et al., 2000
, 2001
, 2002
). Additional characterization of protein fitness landscapes should improve the estimate of the statistically optimal r value.
In summary, we have shown that biased mutation-assembling is an efficient optimization technique for improving protein properties, through applying this strategy to the improvement of the thermostability of PEP as a case study. To show that biased mutation-assembling is a generally applicable method, we have produced libraries for several other enzymes and found mutants with improved thermostability and substrate specificity. Additional studies of other proteins will be needed to validate this method further. The statistical and theoretical nature of protein fitness landscapes is the foundation for the concept of biased mutation-assembling. We believe that if the knowledge gained from protein fitness landscape calculations is incorporated into new, rapid protein evolution methods, such methods will be robust and reliable.
![]() |
Acknowledgements |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
![]() |
References |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Aita,T., Uchiyama,H., Inaoka,T., Nakajima,M., Kokubo,T. and Husimi,Y. (2000) Biopolymers, 54, 6479.[CrossRef][ISI][Medline]
Aita,T., Iwakura,M. and Husimi,Y. (2001) Protein Eng., 14, 633638.[CrossRef][ISI][Medline]
Aita,T., Hamamatsu,N., Nomiya,Y., Uchiyama,H., Shibanaka,Y. and Husimi,Y. (2002) Biopolymers, 64, 95105.[CrossRef][ISI][Medline]
Beacham,I.R. (1979) Int. J. Biochem., 10, 877883.[CrossRef][ISI][Medline]
Dill,K.A. (1997) J. Biol. Chem., 272, 701704.
French,C., Keshavarz-Moore,E. and Ward,J.M. (1996) Enzyme Microb. Technol., 19, 332338.[CrossRef][ISI]
Fülöp,V., Böcskei,Z. and Polgar,L. (1998) Cell, 94, 161170.[CrossRef][ISI][Medline]
Houghten,R.A., Phinilla,C., Blondelle,S.E., Appel,J.R., Dooley,C.T. and Cuervo,J.H. (1991) Nature, 354, 8486.[CrossRef][ISI][Medline]
Kauffman,S.A. and Macready,W.G. (1995) J. Theor. Biol., 173, 427440.[CrossRef][ISI][Medline]
Kikuchi,M., Ohnishi,K. and Harayama, S. (2000) Gene, 243, 133137.[CrossRef][ISI][Medline]
Kreig,F. and Wolf,N. (1995) Appl. Microbiol. Biotechnol., 42, 844852.[CrossRef][ISI][Medline]
Leung,D.W., Chen,E. and Goeddel,D.V. (1989) Technique J. Methods Cell Mol. Biol., 1, 1115.
Lowman,H.B. and Wells,J.A. (1993) J. Mol. Biol., 234, 564578.[CrossRef][ISI][Medline]
Murakami,H., Hohsaka,T. and Sisido,M. (2002) Nat. Biotechnol., 20, 7681.[CrossRef][ISI][Medline]
Ness,J.E., Kim,S., Gottman,A., Pak,R., Krebber,A., Borchert,T.V., Govindarajan,S., Mundorff,E.C. and Minshull,J. (2002) Nat. Biotechnol., 20, 12511255.[CrossRef][ISI][Medline]
Neylon,C. (2004) Nucleic Acids Res., 32, 14481459.
Pogulis,R.J., Vallejo,A.N. and Pease,L.R. (1996) Methods Mol. Biol., 57, 167176.[Medline]
Stemmer,W.P.C. (1994a) Nature, 370, 389391.[CrossRef][ISI][Medline]
Stemmer,W.P.C. (1994b) Proc. Natl Acad. Sci. USA, 91, 1074710751.
Swamy,K.H.S. and Goldberg,A.L. (1982) J. Bacteriol., 149, 10271033.[ISI][Medline]
Takeda,Y., Sarai,A. and Rivera,V.M. (1989) Proc. Natl Acad. Sci. USA, 86, 439443.
Togame,H., Inaoka,T. and Kokubo,T. (1994) J. Chem. Soc., Chem. Commun., 11071108.
Uchiyama,H. (1999) Bio Industry, 16, 2735 (in Japanese)
Uchiyama,H., Inaoka,T., Ohkuma-Soyejima,T., Togame,H., Shibanaka,Y., Yoshimoto,T. and Kokubo,T. (2000) J. Biochem., 128, 441447.[Abstract]
Wells,J.A. (1990) Biochemistry, 29, 85098517.[CrossRef][ISI][Medline]
Zha,D., Eipper,A. and Reetz,M.T. (2003) Chembiochem, 4, 3439.[CrossRef][ISI][Medline]
Zhao,H. and Arnold,F.H. (1997) Proc. Natl Acad. Sci. USA, 94, 79978000.
Received October 22, 2004; revised March 30, 2005; accepted April 11, 2005.
Edited by Alan Fersht
|