Supplementary Material Characterization of cleavage intermediate

advertisement
Supplementary Material
Characterization of cleavage intermediate and star sites of RM.Tth111II
Zhenyu Zhu*, Shengxi Guan, Derek Robinson, Hanna El Fezzazi, Aine Quimby, and
Shuang-yong Xu*
New England Biolabs, Inc., 240 County Road, Ipswich, MA 01938, USA
Supplementary Figure 1.
Color-coded functional domains and secondary structure prediction and PROMALS3D
alignment (computer server: http://prodata.swmed.edu/promals3d/promals3d.php).
Tth111II and TthHB27I amino acid sequences were used in the alignment. The exact
boundaries of the functional domains remain to be determined by structure analysis and
experimentation.
Blue=endonuclease catalytic domain (rough boundary: aa 1-146)
Green=alpha helical domain (rough boundary: aa 147-374)
Orange=N6-adenine methyltransferase  group (rough boundary 375-726,
MTase motifs X (FYTP), I (GSGT), VIII (VFEGAS), underlined aa blocks)
Dark blue=specificity domain (rough boundary 727-1106)
Secondary structure (ss) prediction: h,  helix; e,  sheet. 9=high
probability of the secondary structure prediction (PROMALS3D)
Conservation:
TthHB27I_CAARCA
Tth111II_CAARCA
Consensus_aa:
Consensus_ss:
Conservation:
TthHB27I_CAARCA
Tth111II_CAARCA
Consensus_aa:
Consensus_ss:
Conservation:
TthHB27I_CAARCA
Tth111II_CAARCA
Consensus_aa:
Consensus_ss:
Conservation:
TthHB27I_CAARCA
Tth111II_CAARCA
Consensus_aa:
Consensus_ss:
Conservation:
TthHB27I_CAARCA
Tth111II_CAARCA
Consensus_aa:
Consensus_ss:
Conservation:
TthHB27I_CAARCA
Tth111II_CAARCA
Consensus_aa:
Consensus_ss:
Conservation:
TthHB27I_CAARCA
Tth111II_CAARCA
Consensus_aa:
Consensus_ss:
Conservation:
TthHB27I_CAARCA
Tth111II_CAARCA
Consensus_aa:
Consensus_ss:
Conservation:
TthHB27I_CAARCA
Tth111II_CAARCA
1
1
71
56
141
126
211
196
281
266
351
336
421
406
491
476
561
546
9999999999999999999999999999999999999999999999999999999999999999999999
MLSLLTGGVFRRVKLMNWIDLYTHLKQEVPWFFNSVRLAASQAHNEAEFESRINNAIERLAQKLGVQLLF
---------------MNWIDLYTHLKQEVPWFFNSVRLAASQAHNEAEFESRINNAIERLAQKLGVQLLF
...............MNWIDLYTHLKQEVPWFFNSVRLAASQAHNEAEFESRINNAIERLAQKLGVQLLF
hhh hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh
hhhhhhhhhhhhhhhh
eeee
9999999999999999999999999999999999999999999999999999999999999999999999
REQYTLATGRADAVYNRLVIEYEPPGSLRPNLKHSHTQHAVRQVMNYIEELSRAERHDRDRLLGVVFDGH
REQYTLATGRADAVYNRLVIEYEPPGSLRPNLKHSHTQHAVRQVMNYIEELSRAERHDRDRLLGVVFDGH
REQYTLATGRADAVYNRLVIEYEPPGSLRPNLKHSHTQHAVRQVMNYIEELSRAERHDRDRLLGVVFDGH
eeeeee
hhhhhhhhhhhhhhhhh
hhhh hhhh
9999999999999999999999999999999999999999999999999999999999999999999999
YFIFVRYHEGHWIVEEPLEVNPASCERFLRSLFSLSSGRALIPENLVEDFGSQNDLSRQATRALYHALQG
YFIFVRYHEGHWIVEEPLEVNPASCERFLRSLFSLSSGRALIPENLVEDFGSQNDLSRQATRALYHALQG
YFIFVRYHEGHWIVEEPLEVNPASCERFLRSLFSLSSGRALIPENLVEDFGSQNDLSRQATRALYHALQG
eeeeeee
eee
hhhhhhhhhhhhh
hhhhhhhhhhhh hhhhhhhhhhhhhhh
9999999999999999999999999999999999999999999999999999999999999999999999
HTSDLTARLFVQWQIFFGETAGADAAGGELKHKSELLAFARGMGLRGSRIDMPRFLFALHTYFSFLVKNI
HTSDLTARLFVQWQIFFGETAGADAAGGELKHKSELLAFARGMGLRGSRIDMPRFLFALHTYFSFLVKNI
HTSDLTARLFVQWQIFFGETAGADAAGGELKHKSELLAFARGMGLRGSRIDMPRFLFALHTYFSFLVKNI
hhhhhhhhhhhhh
hhh hhhhhhhhhhhh
hhhhhhhhhhhhhhhhhhh
9999999999999999999999999999999999999999999999999999999999999999999999
ARLVLQAYAGGGLGTTPLTTIANLEGEALRRELQNLESGGLFRTLGLKNLLEGDFFAWYLDAWNPEVEEA
ARLVLQAYAGGGLGTTPLTTIANLEGEALRRELQNLESGGLFRTLGLKNLLEGDFFAWYLDAWNPEVEEA
ARLVLQAYAGGGLGTTPLTTIANLEGEALRRELQNLESGGLFRTLGLKNLLEGDFFAWYLDAWNPEVEEA
hhhhhh
hhh
hhhhhhhhhhhh
hhhh
hhhhhhhhhhh hhhhhh
9999999999999999999999999999999999999999999999999999999999999999999999
LRQVLARLAEYNPATVQDDPHSARDLLKKLYHYLLPRDIRHDLGEFYTPDWLAERLLNQLGEPWFIMPPG
LRQVLARLAEYNPATVQDDPHSARDLLKKLYHYLLPRDIRHDLGEFYTPDWLAERLLNQLGEPWFIMPPG
LRQVLARLAEYNPATVQDDPHSARDLLKKLYHYLLPRDIRHDLGEFYTPDWLAERLLNQLGEPWFIMPPG
hhhhhhhhhhhhh
hhhhhhhhhhhhhhhh
hhhhhhhhhhh hhhhhh
9999999999999999999999 99999999999999999999999999999999999999999999999
NHPPRGLPDKRLLDPACGSGTFLVLAIRALKVNCFLAGFSEADTLEVILNSVVGIDLNPLAVTAARVNYL
NHPPRGLPDKRLLDPACGSGTFPVLAIRALKVNCFLAGFSEADTLEVILNSVVGIDLNPLAVTAARVNYL
NHPPRGLPDKRLLDPACGSGTF.VLAIRALKVNCFLAGFSEADTLEVILNSVVGIDLNPLAVTAARVNYL
eee
hhhhhhhhhhhhhhhh
hhhhhhhhh eeeee hhhhhhhhhhhh
9999999999999999999999999999999999999999999999999999999999999999999999
LAIADLLPYRRREVEIPVYLADSILTPARGEGLFAQNRRILETAVGPLPVPEVINSRAKMERLTDLLEEY
LAIADLLPYRRREVEIPVYLADSILTPARGEGLFAQNRRILETAVGPLPVPEVINSRAKMERLTDLLEEY
LAIADLLPYRRREVEIPVYLADSILTPARGEGLFAQNRRILETAVGPLPVPEVINSRAKMERLTDLLEEY
hhh
hhhhhhhhhhhhhh
999999999999999999999999999999 9 9999999999999999999999999999999999999
VRGDFSTEAFLARAKKEIPDLADALHADEVLTELYERLRDLHRQGLDGIWARVLKNAFMPLFLEPFDYVV
VRGDFSTEAFLARAKKEIPDLADALHADEVITGLYERLRDLHRQGLDGIWARVLKNAFMPLFLEPFDYVV
70
55
140
125
210
195
280
265
350
335
420
405
490
475
560
545
630
615
Consensus_aa:
Consensus_ss:
Conservation:
TthHB27I_CAARCA
Tth111II_CAARCA
Consensus_aa:
Consensus_ss:
Conservation:
TthHB27I_CAARCA
Tth111II_CAARCA
Consensus_aa:
Consensus_ss:
Conservation:
TthHB27I_CAARCA
Tth111II_CAARCA
Consensus_aa:
Consensus_ss:
Conservation:
TthHB27I_CAARCA
Tth111II_CAARCA
Consensus_aa:
Consensus_ss:
Conservation:
TthHB27I_CAARCA
Tth111II_CAARCA
Consensus_aa:
Consensus_ss:
Conservation:
TthHB27I_CAARCA
Tth111II_CAARCA
Consensus_aa:
Consensus_ss:
VRGDFSTEAFLARAKKEIPDLADALHADEVlT.LYERLRDLHRQGLDGIWARVLKNAFMPLFLEPFDYVV
h
hhhhhhhhhhhhhhhh
hhhhhhhhhhhhhhhhhhhhhhhhhhh
eeeee
631
616
701
686
771
756
841
826
911
896
981
966
9999999999999999999999999999999999999999999999999999999999999999999999
GNPPWINWESLPQAYREQTAELWTCYGLFVHSGMDTILGKGKKDASTLMTYAVADRFLKEGGKLGFLITQ
GNPPWINWESLPQAYREQTAELWTCYGLFVHSGMDTILGKGKKDASTLMTYAVADRFLKEGGKLGFLITQ
GNPPWINWESLPQAYREQTAELWTCYGLFVHSGMDTILGKGKKDASTLMTYAVADRFLKEGGKLGFLITQ
hhhhhhhhhhhhhhhhh
hhhhhhhhhhhhhhh
eeeeee
999999999999999999999999999999999999999999999999999999 999999999999999
SVWKTGAGQGFRRFRIGENGPHLRVLHVDDLSSLQVFEGASTRTSAFVLQKGRPTRYPVPYTYWKKTTKG
SVWKTGAGQGFRRFRIGENGPHLRVLHVDDLSSLQVFEGASTRTSAFVLQKGRPPRYPVPYTYWKKTTKG
SVWKTGAGQGFRRFRIGENGPHLRVLHVDDLSSLQVFEGASTRTSAFVLQKGRPsRYPVPYTYWKKTTKG
hhh
hhhhhhhhhh
eeeeeee
eeeeeeeee
eeeee
99999999999999999999999999999999999999999999 9999999999999999999999999
EGLDYDSTLGEVMEQTKRLRFHAVPVDPDDLTSPWLTARRRALYAVRKVLGTSEYRAYEGANSGGANGIY
EGLDYDSTLGEVMEQTKRLRFHAVPVDPDDLTSPWLTARRRALYSVRKVLGTSEYRAYEGANSGGANGIY
EGLDYDSTLGEVMEQTKRLRFHAVPVDPDDLTSPWLTARRRALYtVRKVLGTSEYRAYEGANSGGANGIY
hhhhhhh hhhhhhh
hhhhhh
ee
ee
9999999999999999999999999999999999999999999999999999999999999999999999
WLEILAERPDGLVVVRNVTEGAKREVEGITTELEPDLLYPLLRGRDVRRWYAQPSLHILMVQDPKTRRGI
WLEILAERPDGLVVVRNVTEGAKREVEGITTELEPDLLYPLLRGRDVRRWYAQPSLHILMVQDPKTRRGI
WLEILAERPDGLVVVRNVTEGAKREVEGITTELEPDLLYPLLRGRDVRRWYAQPSLHILMVQDPKTRRGI
ee
hhhhhhhhhh
hhhhhhhh
eee
9999999999999999999999999999999999999999999999999999999999999999999999
DEQVLQKRYPKTWAYLKRFEAVLRERSGFRRYFTRKDRNGRMVETGPFYSMFNVGDYTFAPWKVVWRYVA
DEQVLQKRYPKTWAYLKRFEAVLRERSGFRRYFTRKDRNGRMVETGPFYSMFNVGDYTFAPWKVVWRYVA
DEQVLQKRYPKTWAYLKRFEAVLRERSGFRRYFTRKDRNGRMVETGPFYSMFNVGDYTFAPWKVVWRYVA
hhhhhhhhhhhhhhhhhhhhhhhh
eeeee
700
685
770
755
840
825
910
895
980
965
9999999999999999999999999999999999999999999999999999999999999999999999
SDFIVAVVGPASDEKPVVPNEKLMLVPVEDDNEAFYLCGVLNSSPIRFAVQSFFVQTQIAPHVLQKLCIP 1050
SDFIVAVVGPASDEKPVVPNEKLMLVPVEDDNEAFYLCGVLNSSPIRFAVQSFFVQTQIAPHVLQKLCIP 1035
SDFIVAVVGPASDEKPVVPNEKLMLVPVEDDNEAFYLCGVLNSSPIRFAVQSFFVQTQIAPHVLQKLCIP
eeeee
eeee eeeee
hhhhhhhhhhh hhhhhhhhhh
ee
hh
Conservation:
TthHB27I_CAARCA 1051
Tth111II_CAARCA 1036
Consensus_aa:
Consensus_ss:
9999999999999999999999999999999999999999999999999999999999999999999999
RYEPNTDHQNRIAHLSRRAHELAPAAYNGDKAARAELRRVEEEIDRAAAQLWGLTEEELAEIRRSLEELR 1120
RYEPNTDHQNRIAHLSRRAHELAPAAYNGDKAARAELRRVEEEIDRAAAQLWGLTEEELAEIRRSLEELR 1105
RYEPNTDHQNRIAHLSRRAHELAPAAYNGDKAARAELRRVEEEIDRAAAQLWGLTEEELAEIRRSLEELR
hhhhhhhhhhhhhhhhhhhh
hhhhhhhhhhhhhhhhhhhhhh
hhhhhhhhhhhhhh
Conservation:
TthHB27I_CAARCA 1121
Tth111II_CAARCA 1106
Consensus_aa:
Consensus_ss:
9
G 1121
G 1106
G
Download