PRO_2187_sm_SuppInfo

advertisement
Supporting Methods
Input structure energy minimization
Many protein X-ray crystal structures have minor clashes of atomic radii, improper
orientation of asparagine, glutamine, or histidine side-chains, or backbone dihedral
angles outside of acceptable ranges.1 All of these problems can adversely affect the
Rosetta score function. Thus every input X-ray crystal structure was repacked and
minimized with Rosetta. This process has three steps. First, the side-chain χ angles are
minimized to optimize local contacts, next a full rotamer packing step is done to relieve
any clashes that minimization could not solve. Finally, the side-chains, backbone and
rigid body orientation is minimized to obtain the structure used for further analysis.
Most minimized output structures had an RMSD of less than 1.5 Å to the native crystal
structure. The designed models were also run through this protocol to allow comparison
between the designs and native structures. An example command line for this protocol
is given below:
./min_pack_min.<exe>
-database <rosetta_database>
-l start_structs.list
-pack_first false
-no_rbmin false
-min_all_jumps true
-nstruct 50
-score12prime true
-out::pdb_gz true
-ndruns 5
-run::min_type dfpmin_armijo_nonmonotone
-use_input_sc true -ex1 -ex2 -extrachi_cutoff 1
-no_his_his_pairE true
-ignore_unrecognized_res
-no_optH false
Interface analysis protocol
The protein-protein interfaces were analyzed using the InterfaceAnalyzerMover 2. This
protocol takes a protein-protein complex as an input structure and then creates an
unbound structure. The interface energy, SASA, and other metrics are calculated based
on the differences between the bound and unbound structures.
./rosetta_scripts.<exe>
-l minimized_structs.list
-parser:protocol interface_analysis.xml
-score12prime true
-ignore_unrecognized_res
-no_his_his_pairE
-out:file:score_only IA.score.sc
-no_optH false -ex1 -ex2 -use_input_sc
-run::min_type dfpmin_armijo_nonmonotone
-extrachi_cutoff 1
-linmem_ig 10
-ignore_unrecognized_res
-atomic_burial_cutoff 0.01
-sasa_calculator_probe_radius 1.2
Where interface_analysis.xml is the following script:
<ROSETTASCRIPTS>
<SCOREFXNS>
<s12_prime weights="score12prime"/>
</SCOREFXNS>
<TASKOPERATIONS>
</TASKOPERATIONS>
<MOVERS>
<InterfaceAnalyzerMover name=fullanalyze scorefxn=s12_prime
packstat=1 pack_input=0 pack_separated=1 jump=1 tracer=0
use_jobname=1 resfile=0/>
</MOVERS>
<PROTOCOLS>
<Add mover_name=fullanalyze/>
</PROTOCOLS>
</ROSETTASCRIPTS>
Interface hot-spot calculation
Hotspot residues at designed and natural interfaces were found by mutating all
interface residues, except for glycine, to alanine and calculating Gbind with Rosetta. A
residue was judged to be a hotspot if the calculated was greater than +2.0 REU. Only
natural and designed heterodimers were used. The following command line was used to
run the protocol:
rosetta_scripts.default.macosclangrelease -l
heterodimer_pdbs.list -parser:protocol interface_hotspots.xml
The script interface_hotspots.xml is adapted from3, it also makes use of a score function
trained to recapitulate experimental Gbind4.
<ROSETTASCRIPTS>
<SCOREFXNS>
<interface_score weights=interface/>
</SCOREFXNS>
<TASKOPERATIONS>
<RestrictToInterfaceVector name=interface_task
jump=1/>
</TASKOPERATIONS>
<FILTERS>
<TaskAwareAlaScan name=ala_scan
task_operations=interface_task jump=1 repeats=3
scorefxn=interface_score repack=1 report_diffs=1
exempt_identities=GLY write2pdb=1 />
</FILTERS>
<MOVERS>
</MOVERS>
<PROTOCOLS>
<Add filter_name=ala_scan/>
</PROTOCOLS>
</ROSETTASCRIPTS>
Polar burial definition
Rosetta calculates solvent accessible surface area (SASA) using the Le Grand and Merz
method5 and keeps the SASA up to date as described by Leaver-Fay et al.6 The SASA for
a polar atom is sum of the SASA for that atom, plus the SASA for any bound hydrogens.
A polar atom is defined as buried if the total SASA for that atom is less than 0.1 Å 2. If a
buried polar atom does not have a hydrogen-bonding partner, as defined as having a
hydrogen-bond energy of less than 0.0 REUs, then that atom is considered buried and
unsatisfied. A hydrogen bond is defined as buried if the SASA for the two involved polar
atoms is less than 3.0 Å2. Based on distances observed from low B-factor waters to
protein atoms7 we chose to use atomic radii from Reduce8 and a water probe radius of
1.2 Å to find buried polar atoms and hydrogen bonds.
References
1. Davis IW, Leaver-Fay A, Chen VB, Block JN, Kapral GJ, Wang X, Murray LW,
Arendall WB, Snoeyink J, Richardson JS, et al. (2007) MolProbity: all-atom contacts
and structure validation for proteins and nucleic acids. Nucleic acids research 35:W375–
83.
2. Lewis SM, Kuhlman BA (2011) Anchored design of protein-protein interfaces. PloS
one 6:e20872.
3. Meenan NAG, Sharma A, Fleishman SJ, Macdonald CJ, Morel B, Boetzel R, Moore
GR, Baker D, Kleanthous C (2010) The structural and energetic basis for high selectivity
in a high-affinity protein-protein interaction. Proceedings of the National Academy of
Sciences of the United States of America 107:10080–5.
4. Kortemme T, Baker D (2002) A simple physical model for binding energy hot spots in
protein-protein complexes. Proceedings of the National Academy of Sciences of the
United States of America 99:14116–21.
5. Le Grand SM, Merz KM (1993) Rapid Approximation to Molecular Surface Area via
the Use of Boolean Logic and Look-Up Tables. Journal Of Computational Chemistry
14:349–352.
6. Leaver-Fay A, Butterfoss G, Snoeyink J, Kuhlman B (2007) Maintaining solvent
accessible surface area under rotamer substitution for protein design. Journal Of
Computational Chemistry 28:1336–1341.
7. Matsuoka D, Nakasako M (2009) Probability distributions of hydration water
molecules around polar protein atoms obtained by a database analysis. The journal of
physical chemistry. B 113:11274–92.
8. Word JM, Lovell SC, Richardson JS, Richardson DC (1999) Asparagine and
Glutamine: Using Hydrogen Atom Contacts in the Choice of Side-chain Amide
Orientation. Journal Of Molecular Biology 285:1735–1747.
9. Lawrence MC, Colman PM (1993) Shape complementarity at protein/protein
interfaces. Journal of molecular biology 234:946–50.
10. Fleishman SJ, Whitehead TA, Ekiert DC, Dreyfus C, Corn JE, Strauch E-MM, Wilson
IA, Baker D (2011) Computational design of proteins targeting the conserved stem
region of influenza hemagglutinin. Science 332:816–821.
11. Karanicolas J, Corn JE, Chen I, Joachimiak LA, Dym O, Peck SH, Albeck S, Unger T,
Hu W, Liu G, et al. (2011) A de novo protein binding pair by computational design and
directed evolution. Molecular cell 42:250–60.
Supporting Figures
0.4
0.8
Frequency
0.3
designed
heterodimers
homodimers
0.2
0.1
0.4
24
00
22
00
20
00
18
00
16
00
14
00
12
00
0.2
10
00
80
0
0.0
60
0
Frequency
0.6
DSASA (Å2)
DSASA (Å2)
50
00
45
00
40
00
35
00
30
00
25
00
20
00
15
00
10
00
50
0
0.0
>
Figure S1: Size of designed and natural interfaces. The change in SASA upon binding
for natives and designs is shown. Inset: Designs only on a smaller scale. Successful
designs are highlighted by colored lines (red = GLhelix-4, blue = HB36, purple = HB80,
green = MID1, orange = βdimer1). Solid lines represent computational designs, dashed
lines represent the crystal structure of the design.
DGbind / DSASA
0.00
-0.01
-0.02
-0.03
-0.04
0
1000
2000
3000
DSASA (Å2)
Figure S2: Comparison of interface energy density (∆Gbind/∆SASA) designed interfaces
and crystal structures of successful designs. Successful designs are highlighted by
colored points (red = GLhelix-4, blue = HB36, purple = HB80, green = MID1, orange =
βdimer1). Solid squares represent the computational model, ×’s represent the crystal
structure of the design. Least squares fit is described in main text Figure 2.
A
0.4
Frequency
0.3
designed
native
native X-tal
0.2
0.1
0.
90
0.
85
0.
80
0.
75
0.
70
0.
65
0.
60
0.
55
0.
50
0.0
Shape complementarity
B
0.4
Frequency
0.3
designed
native
native X-tal
0.2
0.1
0.
85
0.
80
0.
75
0.
70
0.
65
0.
60
0.
55
0.
50
0.
45
0.0
RosettaHoles score
Figure S3: Packing quality measure of the design models, Rosetta minimized natural
interfaces and crystal structures of native proteins. Two independent measures of
packing quality are shown; (A) the shape complementarity score9 for interface the
interface and (B) the RosettaHoles score for residues at the interface. For each metric a
value of 1.0 represents perfect packing, while lower values represent packing defects.
Dashed lines represent the crystal structures of the design models (red = GLhelix-4, blue
= HB36, HB80 = purple, green = MID1, orange = βdimer1).
0.4
Frequency
0.3
natural heterodimers
designed heterodimers
0.2
0.1
0.0
0
1
2
3
4
5
6
Hotspots / 1000 Å2
Figure S4: Number of hotspot residues at natural and designed interfaces per 1,000 Å2
(SASA) of interface area. A hotspot residue is defined as one such that mutation of that
residue to alanine gives a Gbind > +2.0 REU.
0.25
0.20
Frequencey
Fleishman set
Our lab set
0.15
0.10
0.05
0.
58
0.
55
0.
52
0.
49
0.
46
0.
43
0.
40
0.
37
0.
34
0.
31
0.
28
0.
25
0.00
DSASA(Polar)/DSASA
Figure S5: Fraction of polar interface area of two sets of designed protein interfaces.
The Fleishman et al. (Nature 2011) set represents proteins designed to bind to influenza
HA. Our lab set represents the designs included in Supporting Information Table II and.
A
0.4
Frequency
0.3
designed
native
0.2
0.1
0.0
0
1
2
3
4
5
6
2
No. Unsatisfied / 1000 Å
B
0.5
Frequency
0.4
designed
native
0.3
0.2
0.1
0.
28
0.
24
0.
20
0.
16
0.
12
0.
08
0.
04
0.
00
0.0
Buried side-chain EH-bond / DGbind
Figure S6: Buried polar atoms and buried hydrogen bonds at interfaces of design
models. Values for successful design models are shown in solid lines (red = GLhelix-4,
blue = HB36, HB80 = purple, green = MID1, orange = βdimer1). A) The number of
buried polar atoms without a hydrogen-bonding partner per 1,000 Å2 of interface area.
B) The total energy of a buried, side-chain involved hydrogen bond at the interface as a
fraction of total binding energy (∆Gbind). HB36 and HB80 acquired an additional buried
hydrogen bond due to mutations introduced by affinity maturation.
Supporting Information Table I
PROJECT
-strand
homodimer
experimental
result
success
definition
failure
failure
failure
failure
failure
failure
betadimer2
betadimer3
betadimer4
oligomer
oligomer
oligomer
oligomer
monomer
monomer
dimer KD = 1
M
dimer low
expression
monomer
monomer
1JL1_anti_des1
1JL1_parl_des1
1sc0_des1
no binding
no binding
dimer
failure
failure
failure
1c1y_tf
1c1y_v
1c1y_wy
no binding
no binding
no binding
failure
failure
failure
30 M
nonspecific
failure
>50μM
>50μM
failure
failure
2d4x_0028
2d4x_0320
2fz4_design
2onu_design
20 μM
40 μM
> 100 μM
aggregates
weak
failure
failure
failure
1g2r_334
oligomer
no
expression
dimer
failure
Design model
1cc8_3742
1cc8_0966
1cc8_1894
1cc8_4097
1cc8_LeuZipper
1cc8_4579
betadimer1
RalA
binder
design
Rac1
binder
design
Ubiquitin
binder
design
Znmediated
heterodime
r
Znmediated
homodimer
1JJV.ubq.11_0005
1Y8C.ubq.123_000
2
2ODV.ubq.9_0003
1rz4_436
1yzm_329
strong
PDB ID
3zy7
weak
failure
failure
failure
strong
3v1c
2qov_414
KD=30 nM
oligomer
oligomer
monomer/dime
r - poor
solubility
always
monomer
monomer w/o
zinc, tet
with zinc
ubch7_10266
helical_peptide
no binding
no binding
2il5_335
1he9_180
2a90_308
2d4x_557
UbcH7 and
Ankyrin
designs
Ubc12 Zn
Ankyrin
designs
PAK1
binder
design
-pix
anchor
design
G-tail Cterm helix
0097_D108A_D99G
aggregates
with zinc
aggregates
with zinc
1i2t_233
spider_roll
1i2t_1212
1i2t_3533
s032
s037
mbp_17
mbp_42
1mn8_17567
1mn8_4957
no binding
KD = 100 μM
330 μM
not soluble
not soluble
160 μM
no binding
no binding
not soluble
not soluble
bpix1
bpix2
bpix3
100 μM
155 μM
148 μM
non-specific
binding
non-specific
binding
non-specific
binding
non-specific
binding
0032_I141A
gtail1
gtail2
gtail3
gtail4
3v1e
failure
failure
failure
failure
failure
failure
failure
failure
failure
no
weak
failure
failure
failure
failure
failure
failure
failure
failure
failure weakened
failure
failure
failure
failure
failure
failure
GoLoco
extension
1255
1680
4091
aa_0273
aa_0951
aa_0971
aa_0976
Successful designs from other
Pdar/Prb
Design_11
HA binder
HB36
HB80
no binding
no binding
no binding
no binding
KD = 800 nM
no binding
no binding
labs:
KD < 10 nM
wrong
orientation
Evolved to
KD = 4 nM
Evolved to
KD = 1 nM
failure
failure
failure
failure
strong
failure
failure
2xns
weak
3q9n
strong
3r2x
strong
4eef
The design models for the designed binders to HA can be found in the supplemental
information in Fleishman et al.10 Models for Pdar/Prb from Karanicolas et al.11 are
available on Proteipedia:
http://www.proteopedia.org/wiki/index.php/User:John_Karanicolas/de_novo_protein_binding_pairs_
by_computational_design
Supporting Information Table II
Heterodimer (Chains)
1ACB (E,I)
1AVA (A,C)
1AVW (A,B)
1AXI (B,A)
1AY7 (A,B)
1B27 (A,D)
1BLX (A,B)
1BND (A,B)
1BPL (B,A)
1BRB (E,I)
1BRL (A,B)
1BVN (P,T)
1C1Y (A,B)
1CGI (E,I)
1CSE (E,I)
1CT4 (E,I)
1CXZ (A,B)
1D4V (B,A)
1D4X (A,G)
1D6R (A,I)
1DFJ (E,I)
1DHK (A,B)
1DS6 (A,B)
1DTD (A,B)
1DZB (A,X)
1E44 (B,A)
1E96 (B,A)
1EAI (A,C)
1EAY (A,C)
1EFV (A,B)
1EM8 (A,B)
1EUC (B,A)
1EUV (A,B)
1F2T (A,B)
1F34 (A,B)
1F5Q (A,B)
1F5R (A,I)
1F60 (A,B)
1FCD (A,C)
1FFG (B,A)
1FIN (A,B)
Homodimer (Chains)
1A4I (A,B)
1A4U (A,B)
1AA7 (A,B)
1AD1 (A,B)
1ADE (A,B)
1AFW (A,B)
1ALK (A,B)
1AOR (A,B)
1AQ6 (A,B)
1AUO (A,B)
1BBH (A,B)
1BD0 (A,B)
1BH5 (A,B)
1BJW (A,B)
1BMD (A,B)
1BXG (A,B)
1C6X (A,B)
1CBK (A,B)
1CDC (A,B)
1CHM (A,B)
1CNZ (A,B)
1COZ (A,B)
1CQS (A,B)
1D1G (A,B)
1DOR (A,B)
1DPG (A,B)
1DQP (A,B)
1DQT (A,B)
1DVJ (A,B)
1EAJ (A,B)
1EBL (A,B)
1EHI (A,B)
1EKP (A,B)
1EN5 (A,B)
1EN7 (A,B)
1EOG (A,B)
1EV7 (A,B)
1EWZ (A,C)
1EXQ (A,B)
1EYV (A,B)
1EZ2 (A,B)
1FR2
1FT1
1FYH
1G4U
1G4Y
1GL4
1GPW
1GUA
1H1S
1H2A
1H2S
1H31
1HE1
1HL6
1HX1
1I1R
1IAR
1IBR
1ITB
1J7D
1JIW
1JKG
1JLT
1JQL
1JTD
1JTP
1JTT
1JW9
1K9O
1KA9
1KI1
1KSH
1KTZ
1KU6
1KXP
1KXV
1L4Z
1LFD
1LUJ
1LW6
1M4U
1M9E
1MA9
1MEE
1MG9
(B,A)
(B,A)
(A,B)
(S,R)
(R,B)
(A,B)
(A,B)
(A,B)
(A,B)
(L,S)
(A,B)
(A,B)
(C,A)
(A,B)
(A,B)
(A,B)
(B,A)
(B,A)
(B,A)
(B,A)
(P,I)
(B,A)
(B,A)
(A,B)
(A,B)
(A,L)
(A,L)
(B,D)
(E,I)
(F,H)
(B,A)
(A,B)
(B,A)
(A,B)
(D,A)
(A,C)
(A,B)
(B,A)
(A,B)
(E,I)
(A,L)
(A,D)
(A,B)
(A,I)
(B,A)
1F13
1F17
1F4Q
1F6D
1F89
1FC5
1FJH
1FL1
1FP3
1FUX
1FWL
1FYD
1G0S
1G1A
1G1M
1G64
1G8T
1GD7
1GGQ
1H8X
1HDY
1HJ3
1HJR
1HQO
1HSJ
1HSS
1I0R
1I2W
1I4S
1I8T
1IPI
1IRI
1J30
1JD0
1JMV
1JOG
1JP3
1JR8
1JV3
1JYS
1K3S
1K6Z
1K75
1KGN
1KIY
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,C)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
1N0L
1NF3
1NLV
1NPE
1NQI
1NRJ
1NW9
1O6S
1OHZ
1OKK
1ONQ
1OO0
1OP9
1OPH
1OR0
1OXB
1P2J
1P2M
1P5V
1PDK
1PPF
1PQZ
1PVH
1Q40
1QAV
1R0R
1R8S
1RE0
1RJ9
1RKE
1S0W
1S1Q
1SCJ
1SGD
1SHW
1SLU
1SMP
1SPB
1STF
1SVX
1TA3
1TE1
1TMQ
1TX4
1UBK
(A,B)
(A,C)
(A,G)
(A,B)
(B,A)
(B,A)
(B,A)
(A,B)
(A,B)
(D,A)
(A,B)
(A,B)
(B,A)
(B,A)
(B,A)
(A,B)
(A,I)
(A,B)
(A,B)
(A,B)
(E,I)
(A,B)
(A,B)
(B,A)
(B,A)
(E,I)
(E,A)
(B,A)
(A,B)
(A,B)
(A,C)
(A,B)
(A,B)
(E,I)
(B,A)
(B,A)
(A,I)
(S,P)
(E,I)
(B,A)
(B,A)
(B,A)
(A,B)
(A,B)
(L,S)
1KSO
1L5X
1LBQ
1LHP
1LHZ
1LNW
1LQ9
1M0W
1M3E
1M4I
1M6P
1M7H
1M98
1M9K
1MI3
1MJH
1MKB
1MNA
1N1B
1N2A
1N2O
1N80
1NA8
1NFZ
1NU6
1NW1
1NWW
1NY5
1O4U
1OAC
1ON2
1OR4
1ORO
1OTV
1OX8
1P3W
1P43
1P6O
1PE0
1PJQ
1PN0
1PN2
1PP2
1PT5
1Q8R
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,C)
(A,B)
(R,L)
(A,B)
(A,B)
1UGH
1UJZ
1US7
1USU
1UUZ
1UZX
1V74
1VG0
1WQ1
1YCS
1YVN
2HBE
2KIN
2NGR
2SIC
2SNI
2TEC
3FAP
3YGS
4SGB
(E,I)
(B,A)
(B,A)
(A,B)
(D,A)
(A,B)
(A,B)
(A,B)
(G,R)
(B,A)
(A,G)
(B,A)
(A,B)
(B,A)
(E,I)
(E,I)
(E,I)
(A,B)
(P,C)
(E,I)
1QFH
1QHI
1QMJ
1QR2
1QXR
1QYA
1R5P
1R7A
1R8J
1R9C
1REG
1RN5
1RQL
1RVE
1RYA
1S2Q
1S44
1SCF
1SMT
1SO2
1SOX
1TLU
1TRK
1UC8
1V26
2DAB
2GSA
2HHM
2NAC
2SQC
3LYN
3SDH
7AAT
8PRK
9WGA
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(X,Y)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
(A,B)
Download