Xem mẫu

Eur. J. Biochem. 271, 1623–1629 (2004) Ó FEBS 2004 doi:10.1111/j.1432-1033.2004.04076.x MINIREVIEW Phage-display as a tool for quantifying protein stability determinants Joanne D. Kotz1, Christopher J. Bond2 and Andrea G. Cochran1 1Department of Protein Engineering and 2Medicinal Chemistry, Genentech, Inc., South San Francisco, CA, USA To address questions of protein stability, researchers have increasinglyturnedtocombinatorialapproachesthatpermit the rapid analysis of libraries of protein variants. Phage-displayhasprovedtobeapowerfultoolforanalyzingprotein stabilityduetothelargelibrarysizeandtherobustnessofthe phageparticletoavarietyofdenaturingconditions.Withthe B1 domain of protein G (GB1) and a camelid heavy chain antibody as model systems, we are using phage-display lib-rariestoexperimentallyaddressquestionsthathavegenerally been addressed in silico, either through computational stud-ies or statistical analysis of known protein structures. One efforthasfocusedonidentifyingnovelsolutionstorepacking the hydrophobic core of GB1, while maintaining stability comparable to the wild type protein. In a second study, a small set of substitutions in complimentarity-determining Introduction Understanding determinants of protein stability is critical both for predicting the tertiary structure of a protein from an amino acid sequence, as well as for protein design. Rather than characterizing individual proteins with single mutations, or defined combinations of mutations, research-ers have increasingly been using selection and screening methods to investigate protein stability. In comparison to the labor-intensive process of generating and characterizing individual mutant proteins, these combinatorial approaches offer the important advantage of simultaneously generating libraries of protein variants, thus allowing a much larger number of mutations to be investigated. However, inter-preting the results fromcombinatorial experiments is not as straightforward as characterizing individual proteins. Con-sequently, results must be carefully assessed in light of the librarydesignandselectionpressureapplied.Eachscreening or selectionmethod, anumber ofwhichare discussedinthis Correspondence to A. G. Cochran, Department of Protein Engineering, Genentech, Inc., 1 DNA Way, South San Francisco, CA, 94080, USA. Fax: + 1 650 225 3734, Tel.: + 1 650 225 5943, E-mail: andrea@gene.com Abbreviations: CDR3, complimentarity-determining region 3; GB1, B1 domain of protein G; scFv, single chain variable fragment; VH, variable heavy chain. (Received 5 January 2004, revised 18 February 2004, accepted 5 March 2004) region3wasfoundtostabilizetheframeworkofthecamelid antibody. Another major focus has been to obtain quanti-tative data on b-sheet stability determinants. We have suc-cessfully adapted a phage-display method for quantitating affinities of protein variants (shotgun alanine scanning) to analysis of GB1 stability. Using this method, we have ana-lyzed the energetic contributions of cross-strand side chain– side chain interactions. Finally, we discuss parameters to consider in using phage-display to discriminate subtle sta-bility differences among fully folded variants. Overall, this method provides a fast approach for quantitatively addres-sing biophysical questions. Keywords: beta sheet; hydrophobic core; phage-display; protein G; protein stability. review series, will have inherent advantages and limitations that should be considered in addressing specific questions of protein structure. Phage-display is one selection technique that has been successfullyappliedtoinvestigatingproteinstability[1,2].In adapting phage-display from the more common selection for binding affinity, investigators have focused on mutating residues affecting protein stability, but not directly involved in ligand binding (Fig. 1). Proteins are selected that retain binding capacity, with the implicit assumption that a properly folded protein is required for an intact binding interface [3,4]. As a protein mutagenesis strategy, phage-display offers a number of important advantages. The technology for generating large libraries (1010 members) has been well developed [5], permitting the simultaneous characterization of a relatively large number of mutants. In addition, the high in vitro stability of the phage particle [6] permits the use of a wide range of selection conditions. For example, investigators have used high temperature [7–9] and denat-urants [8,9] to increase selective pressure. Varying the stringency of selection conditions by these methods allows greater flexibility in experimental design and is particularly relevant to questions of protein stability. One limitation of the above approach is the requirement for a known binding partner with a binding interface that is unaffected by the mutations introduced. A number of researchers have developed strategies for circumventing this coupling of protein stability and function, relying on the greater susceptibility to proteolysisofunfoldedproteins.An 1624 J. D. Kotz et al. (Eur. J. Biochem. 271) Ó FEBS 2004 Fig. 1. Phage-display as a method for selecting for protein stability. Folded proteins are retained, based on the formation of the three-dimensional structure necessary to form afunctional bindinginterface. Unfolded proteins cannot bind and therefore are not selected. accompanying review by Bai & Feng will discuss the significant progress that has been made in developing these methods [10]. In this review, we will focus on studies in our laboratory investigating the effect of mutations on the stability of the B1 domain of Streptococcal protein G (GB1). One aspect of GB1 stability that we have addressed is the tolerance for mutations in the core of the protein. These studies provide a platformfor comparing the results of combinatorial experimental studies with a system that has been well characterized computationally [11–14]. This work relies upon the binding of properly folded GB1 to the immuno-globulin Fcfragmenttoseparatethefewfunctionalvariants froma large number of unfolded proteins. A similar strategy is used by Bond et al. to characterize residues necessary for protein stability in single chain camelid antibodies [15]. By comparing the results of these studies using different proteins but a similar experimental design, we discuss the extent to which the most stable proteins are selected. Additionally, our laboratory has extended phage-based stability selections to include quantification of the relative contributions of amino acid substitutions to protein stabil-ity. Using the b-sheet of GB1 as a model system, we have asked whether a number of variants, all of which are folded under the conditions of the selection, can be recovered differentially based on varying stabilities. Remarkably, we have found that, at least in some circumstances, a quanti-tative correlation to biophysical data can be obtained from a statistical analysis of selected phage populations [16]. We alsodiscussexperimentsaddressingthephysicalbasisofthis selection and the range of stabilities that can be differen-tiated. Selecting the most stable protein variants Two studies, one of GB1 and one of a camelid antibody, have employed phage-display to identify the most stable clones froma large pool of unfolded proteins. In both cases stable clones are identified fromthe selection and stability is confirmed when the individual proteins are characterized. However, it is not clear whether the globally most stable proteinsencodedinthelibrariesareidentified.Thesuccesses and limitations of this strategy, and ideas for increasing the selective pressure, are discussed below. Fig. 2. Mutation of the GB1 core. Core residues (red) and boundary residues(blue)wererandomizedinonelibrary.Othercoreresidues[10] are shown in gray. This figure and Fig. 5 were generated fromthe NMR structure (PDB code 2GB1) [33] using INSIGHT II (Accelrys, San Diego, CA). Repacking the GB1 core Wehavebeguntoinvestigatebyphage-displaythetolerance tosubstitutioninthehydrophobiccoreofGB1.Onegoalof this study was to compare phage-based strategies to computational methods. Based on the definitions of Mayo andcoworkers,ourlibraryfocusedoncoreresidues5,7and 30, having less than 10% of the side-chain surface exposed to solvent [11], and boundary residues 16, 18 and 33, those which lie at the interface between the buried core and surfaceresidues[12].Theseresiduesarespatiallyclosetoone another and potentially in contact (Fig. 2). However, this library does not exactly duplicate the computational library because in the experimental system, residues near the Fc binding interface cannot be varied (for instance, core residues 3 and 39, and boundary residues 25 and 29 that were all changed in a hyperstable GB1 variant [12]). Based on the computational studies, core positions 5 and 30 were expected to be intolerant to substitution. In contrast, the coreposition7andboundarypositions16 and18 havebeen shown to tolerate substitutions, with some variants even showing increased stability over the wild type protein [11,12,14]. Following threerounds ofselection atroomtemperature, a consensus sequence began to emerge (Table 1). In agreement with previous work, the wild type residues were predominantly observed at positions 5 and 30, whereas alternative residues appeared to be tolerated, or even preferred, at all other library positions. For instance, arginine and tryptophan were frequently observed at positions 16 and 18, and tryptophan was preferred at position 33. Following two additional rounds of selection, a few particular sequences began to dominate the library. Three individual GB1 variants were expressed and purified for biophysical characterization; one of these differed only at boundary residues and was based on the consensus sequence fromthe third round of sorting, while two represented the most dominant clones from the fifth round of sorting. All three proteins underwent two-state thermal Ó FEBS 2004 Phage-display for quantifying protein stability (Eur. J. Biochem. 271) 1625 Table 1. Experimental repacking of the GB1 core. A representative sample of clones obtained after three rounds of selection for GB1 binding is shown at top. Proteins that were individually purified and characterized are shown in bold. Position Round of % of 5 7 16 18 30 L L R R F L L R W F L F R W F L I R W L L S T L F L L T W F L L R W F L S I K F L Y P V F L L T T F 33 sorting clones Tm (°C) W 3 1 W 3 1 F 3 1 W 3 1 W 3 1 F 3 1 W R3 consensus 81 L 5 30 59 M 5 5 62 Y WT 81 unfolding transitions. For the mutant based on the third round consensus sequence, the melting temperature (Tm) was equal to that of wild type GB1 (81 °C). The Tm was somewhat reduced from wild type for the two dominant fifth round clones (59 °C and 62 °C; Table 1). In each case the proteins were fully folded at roomtemperature and retained IgG binding activity (J. D. Kotz & A. G. Cochran, unpublished results). To reduce the time required for computational analysis, only hydrophobic residues were allowed at core positions and only 16 residues were allowed at boundary positions (with cysteine, methionine, glycine and proline excluded in the in silico experiment) [11,12,14]. In contrast, in our experimentalsystemall20aminoacidswereencodedateach position. Surprisingly, in the two clones that dominated the library, amino acids disallowed in the computational study were shown to result in stable, folded proteins. At the core position 7, serine was observed in one of the frequently observed clones. At boundary position 16, a proline occurred in the second clone investigated (Table 1). These resultshighlightthestabilityoftheGB1coretosubstitutions that may seem energetically unlikely, or even irrational, but that can be rapidly explored using a phage selection. Design of a heavy chain antibody scaffold A conceptually similar approach was employed by Bond et al. in the design of a camelid heavy chain antibody scaffold for use in constructing naıve antibody libraries [15]. Here, the association of the variable heavy chain (VH) with protein A was used as a surrogate for direct stability measurements. The VH domains in camelid heavy chain antibodies are most similar to the classical VH3 family and as such bind protein A with micromolar affinity. Further-more, the protein A binding site is distal to the former light chain interface and involves residues within the b-sheet structure (Fig. 3). As in the GB1-Fc system, the protein A– antibody interaction requires a correctly folded molecule, and therefore binding can be used as a direct readout for antibody stability. To adapt to the loss of the light chain, these heavy chain scaffolds rely on portions of complimentarity-determining Fig. 3. StructureoftheProtein A–Fabcomplex(PDBcode1DEE)[34]. The heavy chain is colored blue and the light chain green. Protein A (orange)interacts with the heavy chain on the side opposite to the light chain interface and is represented using the program SWISSPDBVIEWER. region 3 (CDR3) to maintain structural integrity. This additional role of CDR3 complicated the design of heavy chain libraries for antigen binding selections by requiring a scaffold in which the structural residues of CDR3 were fixed. To identify these structurally important residues, potential heavy chain CDR3 scaffolds were evaluated by sorting a 17-residue CDR3 library against protein A. Following three rounds of sorting, 335 clones were isolated and sequenced. When purified proteins were individually characterized, the four most frequently observed clones had thermostabilitiesofapproximately60 °C,similartothoseof other camelid heavy chain antibodies [15]. A crystal structure of one clone revealed that the residues selected at both ends of the CDR3 loop are ordered and interact with the former light chain interface, supporting the idea that these residues are structurally important (Fig. 4). Con-versely, the remainder of the loop is disordered, consistent with the observed tolerance to substitution at these loop positions (C. J. Bond, J. C. Marsters & S. S. Sidhu, unpublished results; [15]). To what extent are the most stable sequences selected? In both of the above studies, the dominant clones obtained after selection were shown to be stable and well-folded proteins. However, in the GB1 study we failed to identify new variants with significantly increased stability compared to the starting protein. Furthermore, the dominant round five clones were not as stable as the wild type sequence and therefore were not the most stable proteins encoded in the library. This observation high-lights an important caveat when using phage-display. The sequences selected at each round are influenced by a variety of factors including codon usage, expression levels, 1626 J. D. Kotz et al. (Eur. J. Biochem. 271) Ó FEBS 2004 improve the stability of single chain variable fragment (scFv) by phage-display [8]. In their study, the phage-displayingscFvvariantsweresubjectedtohightemperature, denaturant or no stress prior to selection. High-temperature stress resulted in the most sequence convergence and the most stable clone analyzed. The authors concluded from this observation that, at least in their system, temperature stress was much more stringent than incubation with denaturant. However, this may not be general; instead, it may be a consequence of the irreversibility of thermal denaturation and the reversibility of chemical denaturation for scFV [8]. In the case of GB1, with known hyperstable variants, we intend to compare the results of increasing the selectivepressurethrougheachofthesemethods.Hopefully, as additional systems are investigated, a general under-standing of the pressure needed for a given stringency of selection will emerge. Quantifying protein stability A major focus of our laboratory has been extending the use of phage-display to allow ranking of the stabilities of individual proteins in a pool of folded variants. In addition, a rapid method for quantitatively, and simultaneously, characterizing a large number of mutants would greatly aid in understanding the effects of complex interactions on stability, for instance, the effect of long range tertiary contacts on b-sheet formation. We discuss recent advances intheuseofphage-displaytoprobetheenergeticsofb-sheet formation, as well as progress in understanding important experimental variables of the method. Fig. 4. Structure of the evolved heavy chain scaffold [17]. The frame-work regions are colored grey while CDR3 is colored red. Residues criticaltoscaffoldstablility,bothattheformerlightchaininterfaceand in CDR3, are shown in yellow. The im age was generated using SWISSPDBVIEWER. export to the surface of the phage and affinity for the target protein. Thus, although a folded protein is a minimum requirement, phage-display selection may not depend solely on protein stability. Toaddressthislimitation,theselectioncanbemademore dependent on protein stability by destabilizing the host. For example, to test the range of turn sequences permitted in stable proteins [7], DeGrado and coworkers generated randomturn sequences in GB1 host proteins of different stabilities; these libraries were screened for IgG competent bindersateitherroomtemperatureorelevatedtemperature. They observed no sequence preference for turns in the wild typehost,whereasclear sequencepreferences were observed in destabilized hosts. As the stringency of the screen increased, the functional solutions increasingly resembled the wild type sequence and turns that are most commonly found in proteins. Highlighting the effectiveness of the screen, they confirmed biochemically that IgG binders obtained under the most stringent conditions were signifi-cantly more stable than nonbinders [7]. In a different approach, Pluckthun and coworkers directly compared the use of temperature or denaturant as a selective pressure to Analyzing stability determinants in the b-sheet of GB1 The potential for obtaining quantitative biophysical infor-mation from phage-display was suggested by a new method called alanine shotgun scanning, which analyzes the ener-getic contribution of residues at a binding interface. Sidhu, Weiss and coworkers [18–20] have treated the observed frequency ratios of residues at a given position (i.e. wild type/alanine ratio) as an equilibriumconstant, which is then used to calculate the relative free energies of binding for different protein variants. Relative energies calculated from this distribution data have been shown to correspond directly with data obtained for individual purified mutants [18]. Thus, shotgun scanning provides a rapid method for using phage-display to quantify changes in affinity. In order to apply this method to the ranking of protein stabilities, the well-established b-sheet model system GB1 seemed an ideal initial target [13]. A library was constructed varying two cross-strand residues, 44 and 53 (Fig. 5). These positions were guest sites in published mutagenesis studies fromwhich b-sheet propensity scales have been developed [21–24]. In both protein mutagenesis and phage studies, the host protein included the I6A mutation, resulting in the destabilization of the protein by 2.5 kcalÆmol)1 relative to wild type GB1. Following a binding selection at room temperature, individual clones were sequenced. From the observed residue distributions at position 53, a phage-based stabilityscalewascalculated.Strikingly,thisscalecorrelated quantitatively with the published propensity scale derived from thermal stability measurements [16]. Ó FEBS 2004 Phage-display for quantifying protein stability (Eur. J. Biochem. 271) 1627 stabilities, we will probably need to achieve a balance between a very stable host, whose folded population would bepredictedtobeinsensitivetostabilitychanges,andavery unstable host that would not allow characterization of destabilizingmutations.Inourcurrentstudiesofpositions6 and 15 of the GB1 b-sheet, we have used hosts of two different stabilities (wild type and a mutant destabilized by 2 kcalÆmol)1). By comparing results from the two host proteins, we intend to characterize the host stability range necessary for obtaining quantitative data. Surprisingly, we have found that variants that should be fully folded at the temperature of selection can nevertheless be discriminated, raising questions about the basis for selection [16]. Fig. 5. Quantifying b-sheet stability. The hydrogen-bonded pair (red) and the nonhydrogen-bonded pair (blue) were varied in separate phage-displaylibraries.Surroundingresiduesfromthesolventexposed face of the b-sheet are shown in gray. The use of an unusually trivial library, in which just two surface positions were varied to all amino acids, permitted the rapid analysis of the energetic contribution of side chain–side chain interactions at a single surface site. This analysis could be compared to earlier studies of residue pairing in diverse b-sheets fromthe Protein Data Bank [25–27]. These earlier studies indicated many statistically significant deviations fromrandompairing, the nature of whichdependedtosomeextentontheexactmethodofdata normalization. In contrast, the data obtained by the phage-display method (that do not require normalization) sugges-tedonlyminor energeticcontributions from mostside chain interactions [16]. This rapid analysis of many amino acid pairs demonstrates the power of using combinatorial approaches to address questions of protein stability, where a large number of interactions must be characterized to understand the underlying trends. Current work in our laboratory is directed toward extending the studies of b-propensities and side chain–side chain interactions in b-sheets. We are now creating a library at positions 6 and 15 of the GB1 b-sheet. These two side chains forma nonhydrogen bonded pair, unlike residues 44 and53whosebackboneamidesarehydrogenbondedtoone another. Nonhydrogen bonded pairs have different Ca and Cb distancesthanhydrogenbondedpairsandthereforemay have different residue preferences and potential for side chain–side chain interactions. Initial results suggest that the selected residue distributions correlate well with a conven-tional stability scale for position 6, and we arein the process of analyzing the relationship between the two positions (J. D. Kotz & A. G. Cochran, unpublished results). Importantly, these results suggest that the phage-display method will be generally applicable to quantitative compar-isons of protein stabilities. Importance of host stability for selection The investigation of turn stability fromDeGrado and coworkers (discussed above) emphasized the importance of hoststabilityintuningthestringencyofaselectionorscreen [7]. In our effort to rank protein variants of very similar Physical basis for ordering stabilities As described above, a binding selection relies on the requirementthataproteinbefoldedinordertofunctionally interact with a binding partner. This is a powerful method fortheisolationofrarefoldedvariantsfromalargerpoolof unfolded molecules [4]. However, in distinguishing a more stable protein frommany other stable proteins, the physical basis for the selection is not as clear. The most straightfor-ward idea is that each protein variant is in equilibrium between the folded and unfolded state with only the folded state being competent for binding [2]. That is, folding is thermodynamically coupled to binding, resulting in an apparent affinity change as the fraction of folded molecules changes [28]. A second possibility is that many proteins in the pool are fully folded but that enhanced protein stability leads somehow to higher target affinity, increasing the likelihoodofrecoveryinthebindingselection.Finally,more stable proteins may be displayed at a higher level on the surface of the phage, but then once displayed, retained with equal probability during the interaction with the binding partner. To distinguish changes in display level from selections requiring interaction with the specific binding partner (stability- or affinity-based), one could divide a phage library in half and sort one half against a binding partner and the other half against an expression tag. A comparison of the sequences obtained from these two different selections should reveal any display bias. Alter-natively, a Western blot could be used to directly probe the display levels of selected protein variants. To distinguish affinity-based selections fromthose based solely on protein stability, the affinities of purified mutant proteins for target can be measured by standard methods (immunoassay or surface plasmon resonance) at the temperature of the phage selection (or at temperatures at which the variants are fully folded); in studies with GB1, it does not appear that sufficient affinity differences exist among the folded variants to explain their differential selection (J. D. Kotz & A. G. Cochran, unpublished results) [16,24]. Analyzing the less stable GB1 variants Another potential limitation in quantifying stability by phage-display results fromthe inherently larger number of sequencesobservedforthemorestableclones.Forexample, in the analysis of side chain interactions at positions 44 and 53 in GB1, the hydrophobic amino acids are more stabilizing, and thus they occur much more frequently than ... - tailieumienphi.vn
nguon tai.lieu . vn