RNA binding protein Caprin-2 is a pivotal regulator of the central

advertisement
1
Supplementary File 1
RAT CAPRIN 2
(a) cDNA cloned from rat brain and verified by sequencing.
Exons marked black and blue were predicted on the basis of GT and AG flanking
regions of introns
10
20
30
40
>atgaagtcagccaagtcccaagtgaaccacactcagcaaggggaaaaccagcgggctctgagccccctgcagt
ctactctcagttctgctgcatctccttcccaggcatacgaaacctatattgataatggacttatatgccttaaacacaaaatt
aggaacatcgagaagaagaagctcaaactggaagattacaaagatcgcctgaaaaatggagagcagcttaaccc
agaccagttggaagcagtggaaaagtatgaagaagtacttcataatttggaatttgccaaggagcttcagaaaaccttt
tctgcactgagccaagatctcctgaaagcgcagaaaaaggcccagagaagggagcacatgctaaaacttgaggc
cgagaagaaaaagcttcgaactatacttcaaattcagtatgtattacagaacttgacacaagaacatgtacagaaag
acttcaaagggggcttgaatggtgcaatgtatttgccttcaaaagaacttgactacctcattaaattctcaaaactgacct
gccctgaaagaaatgaaagtttgagtgttgaagaccagatggagcagtatccttgtacttttgggaccttttggaaggta
gtgagaaagcagtggtaggaacaacatacaaacatgtgaaagacctgctgtccaaattgctgcactcaggttattttg
aaagtgtcccagttctcaggaattctaaggaaaaaacagaagaaatgttaatgcagtcagaaaagagaaagcagtt
actgaagactgagtctatcaaagagtcagaatctctgaaggaacttgtacagccagagatacagccgcaggagtttct
taacagacgctatatgacagaagtaaatttttcaagaaaacaagaaaatgaagaacaatcctgggaagcagattat
gctaggaaaccaggtctcctcaaatgctggaatacacttccagaaccagatggtcaggagaagaagaaggagtcct
tggagtcgtgggagtcttctcttaagtctcaggaggtatccaagcctgtggtgtctttcgaacaggagaagctcaggcca
acattacaggaagagcagaagcagcagatttccatggcacctgtcagtcaatggaagccagaaagccctaagtcc
aaagtgggcagccctcaagaagagcagaatgtacaggagacgccaaagccgtgggtggttcagccacagaaag
aacaagatccaaagaagctacctcctggatcctgggcagtatctgtgcagagtgaacagagtggcagcagatcctg
gaccactcctgtgtgcagagaacaggcttcagtgcagcctgggactccagtatcctgggagaacaatgctgagaacc
agaaacactccttagtaccacaatcacagatctctctgaagtcctggggagcagcttcagcaggcctcttaccaaatg
acaaggtccctcccaggaagttaaatgtagagcccaaagatgtgcctaagcccatgcctcagcctatagactcttcct
ctccctttccaaaggatccagcattgaggaaagaaaaactgcaggacctcatgacccagattcaaggaacttgtaac
tttatgcaagagtctgttctagatgtcgacacaccctcaagtgcaattccatcttctcagccgccttcagcttcgccagtctc
tacagtatctgcagaacaaaacttgtccaaccaaagtgattttcttcaagagccatcaaaggcttcttctccagttacttgt
agctcgaatgcttgcttggttactactgatcaggcttctctgggatctgaaacagagtttatgacctcagagacccctgag
atggtggctcccccctgcaagccagcatctgcacttgcttctccaaatcctccactgtcgaagggcttccagttacctcct
gcaagtgggagctcggcagccattagcacagcaccctttcaggccatgcagacagtatttaatgttaatgcacctctgc
ctccacggaaagaacaagcaatgaaagaatctccttattcatctggctacagtcaaagttttacttcatcaagtacaca
gacagtatcccaatgtcagctcccagctgtacacgtggagcagacaacccaacctcccgagactgctgcaggttacc
atcctgatggaactgttcaagtaagcaatgggagccttgccttttacccagcacccacgagtatgtttcccagacctgct
cagccatttatcagtagtaggggggctctgagaggatgttcacgtggagggaggttactaatgaatccttatcggtctcc
tggtagctacaaaggttttgatagttacagaggccttccctcagcttcaagtgggacttacagccaactgcagctgcaa
gctagagagtatcctgggacaccttactctcagagggataatttccagcagtgttataaaagatcagggacatctagtg
gtcttcaggcaaattcaagagcagggtggagcgactcctctcaggtgagcagcccagagagagacagcgagacttt
taacagtggagactctggggtaggagactcccggagcatgaccccagtggatgtgccagtgacaagcccagcagc
cgccattctgccagtacacgtctatcctctgcctcagcaaatgcgagttgccttctcagctgccagaacatccaatctgg
ctcctggaactttagaccaacctattgtgtttgatcttctcctgaacaacttgggagagacctttgatcttcaacttggtagat
tcaattgcccagtaaatggcacttacgtgttcatttttcacatgctaaagctggctgtgaatgtaccactgtatgtcaacctc
atgaagaatgaggaggtcttggtgtcagcctatgccaatgatggtgctccagaccatgagacagcaagcaaccatgc
tattctccagctcctccagggagataagatatggttgcgcttacacaggggagcgatttatggaagtagctggaaatac
tctacattttcaggctatcttctttatcaagattga//
2
(b) Protein sequence, rat Caprin2, 1029aa
MKSAKSQVNHTQQGENQRALSPLQSTLSSAASPSQAYETYIDNGLICLKHKIRNIEK
50 KKLKLEDYKDRLKNGEQLNPDQLEAVEKYEEVLHNLEFAKELQKTFSALSQDLLKA
QKKAQRREHMLKLEAEKKKLRTILQIQYVLQNLTQEHVQKDFKGGLNGAMYLPSKE
LDYLIKFSKLTCPERNESLSVEDQMEQSSLYFWDLLEGSEKAVVGTTYKHVKDLLSK
LLHSGYFESVPVLRNSKEKTEEMLMQSEKRKQLLKTESIKESESLKELVQPEIQPQE
FLNRRYMTEVNFSRKQENEEQSWEADYARKPGLLKCWNTLPEPDGQEKKKESLE
SWESSLKSQEVSKPVVSFEQEKLRPTLQEEQKQQISMAPVSQWKPESPKSKVGSP
QEEQNVQETPKPWVVQPQKEQDPKKLPPGSWAVSVQSEQSGSRSWTTPVCREQ
PVSQWKPESPKSKVGSPQEEQNVQETPKPWVVQPQKEQDPKKLPPGSWAVSVQSEQSGSRSWTTPVCREQASVQP
ASVQPGTPVSWENNAENQKHSLVPQSQISLKSWGAASAGLLPNDKVPPRKLNVEP
GTPVSWENNAENQKHSLVPQSQISLKSWGAASAGLLPNDKVPPRKLNVEPKDVPKPMPQPIDSSSPFPKDPALRK
KDVPKPMPQPIDSSSPFPKDPALRKEKLQDLMTQIQGTCNFMQESVLDVDTPSSAI
EKLQDLMTQIQGTCNFMQESVLDVDTPSSAIPSSQPPSASPVSTVSAEQNLSNQSDFLQEPSKASSPVTCSSNAC
LVTTDQASLGSETEFMTSETPEMVAPPCKPASALASPNPPLSKGFQLPPASGSSAAISTAPFQAMQTVFNVNAPL
PSSQPPSASPVSTVSAEQNLSNQSDFLQEPSKASSPVTCSSNACLVTTDQASLGSE
60 PPRKEQAMKESPYSSGYSQSFTSSSTQTVSQCQLPAVHVEQTTQPPETAAGYHPDGTVQVSNGSLAFYPAPTSMF
TEFMTSETPEMVAPPCKPASALASPNPPLSKGFQLPPASGSSAAISTAPFQAMQTV
PRPAQPFISSRGALRGCSRGGRLLMNPYRSPGSYKGFDSYRGLPSASSGTYSQLQLQAREYPGTPYSQRDNFQQC
FNVNAPLPPRKEQAMKESPYSSGYSQSFTSSSTQTVSQCQLPAVHVEQTTQPPET
YKRSGTSSGLQANSRAGWSDSSQVSSPERDSETFNSGDSGVGDSRSMTPVDVPVTSPAAAILPVHVYPLPQQMRV
AAGYHPDGTVQVSNGSLAFYPAPTSMFPRPAQPFISSRGALRGCSRGGRLLMNPY
AFSAARTSNLAPGTLDQPIVFDLLLNNLGETFDLQLGRFNCPVNGTYVFIFHMLKLAVNVPLYVNLMKNEEVLVS
AYANDGAPDHETASNHAILQLLQGDKIWLRLHRGAIYGSSWKYSTFSGYLLYQD*
!
RSPGSYKGFDSYRGLPSASSGTYSQLQLQAREYPGTPYSQRDNFQQCYKRSGTS
SGLQANSRAGWSDSSQVSSPERDSETFNSGDSGVGDSRSMTPVDVPVTSPAAAIL
PVHVYPLPQQMRVAFSAARTSNLAPGTLDQPIVFDLLLNNLGETFDLQLGRFNCPV
NGTYVFIFHMLKLAVNVPLYVNLMKNEEVLVSAYANDGAPDHETASNHAILQLLQGD
KIWLRLHRGAIYGSSWKYSTFSGYLLYQD*
(c)
70
double underline - nuclear export signal (NES)
wave underline and bold letters - RGG box (weak RNA binding)
3
(d)
CLUSTAL O(1.2.1) multiple sequence alignment
rCaprin2
MKSAKSQVNHTQQGENQRALSPLQSTLSSAASPSQAYETYIDNGLICLKHKIRNIEKKKL 60
XRNG105
rCaprin2
XRNG105
rCaprin2
XRNG105
rCaprin2
XRNG105
rCaprin2
XRNG105
rCaprin2
XRNG105
rCaprin2
XRNG105
rCaprin2
XRNG105
rCaprin2
XRNG105
rCaprin2
XRNG105
rCaprin2
XRNG105
rCaprin2
XRNG105
rCaprin2
XRNG105
rCaprin2
XRNG105
rCaprin2
XRNG105
rCaprin2
XRNG105
rCaprin2
XRNG105
rCaprin2
XRNG105
MPSATS---------------SKAVPGSTDAAPGNIQTEAMKQILGIIDKKLRNLEKKKG 45
* **.*
*: *:*.:
:.: * :.:*:**:****
KLEDYKDRLKNGEQLNPDQLEAVEKYEEVLHNLEFAKELQKTFSALSQDLLKAQKKAQRR 120
KLDDYQDRLDKGERLNQDQMDAVTKHQEVVANMEFARELQRNFMALGQDMQKTIKKAARR 105
**:**:***.:**:** **::** *::**: *:***:***:.* **.**: *: *** **
EHMLKLEAEKKKLRTILQIQYVLQNLTQEHVQKDFKGGLNGAMYLPSKELDYLIKFSKLT 180
EQLMREEAEQKRLKTVLEFQFVLDKLGDEEVRNDLKQGLDGVLVVSEEELSLLDEFYKLV 165
*:::: ***:*:*:*:*::*:**::* :*.*::*:* **:*.: : .:**. * :* **.
CPERNESLSVEDQMEQSSLYFWDLLEGSEKAVVGTTYKHVKDLLSKLLHSGYFESVPVLR 240
NPDRDTSVRLSDQYEQASIHLWDVLDSKEKSVCGTTYKSLKDLLDRILQSGYFDSAQNHQ 225
*:*: *: :.** **:*:::**:*:..**:* ***** :****.::*:****:*.
:
NSKEKTEEMLMQSEKRKQLLKTESIK------ESESLKEL-VQPEIQPQEFLNRRYMTEV 293
NGLCEEEE-------EEEPLAAPPVEEQAPELEPEPVEEYTEPSEVESTEFVNRQFMTEA 278
*. : **
.:: * : ::
* * ::*
*:: **:**::***.
NFSRKQENEEQSWEADYARKPGLLKCWNTLPEPDGQEKKKESLESWESSLKSQEVSKPVV 353
QYSGSEKEQVDEWTVETVEVVNSLQQA--------------------------------- 305
::* .:::: :.* .: ..
*:
SFEQEKLRPTLQEEQKQQISMAPVSQWKPESPKSKVGSPQEEQNVQETPKPWVVQPQKEQ 413
------------------------------------------------------------ 305
DPKKLPPGSWAVSVQSEQSGSRSWTTPVCREQASVQPGTPVSWENNAENQKHSLVPQSQI 473
------------------------------------------------------------ 305
SLKSWGAASAGLLPNDKVPPRKLNVEPKDVPKPMPQPIDSSSPFPKDPALRKEKLQDLMT 533
---------------------------ATPPIPEPLALNAIVQVQPDPIVRRQRVQDLMA 338
* * * :::
. ** :*::::****:
QIQGTCNFMQESVLDVDTPSS--AIPSSQPPSASPVSTVSAEQNLSNQSDFLQEPSKASS 591
QMQGPYNFMQDSMLEFESQPMDPAIVSAQPMNLSQSMD--LPQMLCPP---VHSEPRPSQ 393
*:** ****:*:*:.::
** *:** . *
* *.
::. : *.
PVTCSS--NACLVTTD-QASLGSETEFMTS--------------ETPE--MVAPPCKPAS 632
PIQVPDTTQVALVSSPSEAYTGSPEIYQPSHPIEARTQNDAMEQIQASLSLNPDPTQTLS 453
*:
. :..**:: :* **
: *
. :
* : *
AL-ASPNPPLSKGFQLPPASGSSAAISTAPFQAMQTVFNVNAPLPPRKEQ-AMKESPYSS 690
SIPAASQPQVFQTGSNKPLHSSGINVNAAPFQSMQTVFNMNAPVPPVNEPETLKQNQYQA 513
:: *: :* : : . * .*. :.:****:******:***:** :* ::*:. *.:
GYSQSFTSSSTQTVSQCQLPAVHVEQTTQPPETAAGYHPDGTVQVS-NGSLAFYPAPTSM 749
SYNQTFPGQPHQVEQ-TELQPEQL------QTVVNSYHATSEQAHQAPSGHQQPTQQNAG 566
.*.*:* .. *. . :*
::
.. .** .
. ..
.:
FPRPAQPFISSRGALRGCSRGGRLLMNPYRSP--GSYKGFDSYRGL-PSASSGTYSQLQL 806
FPRNSQPFYNNRGMARGGQRGNRGMMNGYRGQSNGFRGGYDGYRAAFPNTPNSGYPQAQF 626
*** :*** ..** ** .** * :** **.
*
*:*.**. *.: .. * * *:
QAREYPGTPYSQRDNFQQCYKRSGTSSGLQANSRAGWSDSSQVSSPERDSETFNSGDSGV 866
NAPRDYSN-NYQRDGYQQNFKRGAGQGGPRVAPRGH-GG---PPRPSRGIP--------- 672
:* . ..
*** :** :**.. ..* :. *. .
*.*
GDSRSMTPVDVPVTSPAAAILPVHVYPLPQQMRVAFSAARTSNLAPGTLDQPIVFDLLLN 926
----QMNPQQVN------------------------------------------------ 680
.*.* :*
NLGETFDLQLGRFNCPVNGTYVFIFHMLKLAVNVPLYVNLMKNEEVLVSAYANDGAPDHE 986
------------------------------------------------------------ 680
TASNHAILQLLQGDKIWLRLHRGAIYGSSWKYSTFSGYLLYQD
1029
------------------------------------------680
!Hypothetical functional domains of the rat Caprin-2 protein, based on alignment to
the Xenopus RNG105 protein sequence, a paralogue of the well-analyzed rat Caprin
1 protein, highly homologous to Caprin-2.
80
Solid underline - coiled-coil domain (strong RNA binding)
Dotted underline - nuclear localization signal (NLS)
Double underline - nuclear export signal (NES)
Wave underline and bold letters - RGG box (weak RNA binding)
Download