1 Supplementary File 1 RAT CAPRIN 2 (a) cDNA cloned from rat brain and verified by sequencing. Exons marked black and blue were predicted on the basis of GT and AG flanking regions of introns 10 20 30 40 >atgaagtcagccaagtcccaagtgaaccacactcagcaaggggaaaaccagcgggctctgagccccctgcagt ctactctcagttctgctgcatctccttcccaggcatacgaaacctatattgataatggacttatatgccttaaacacaaaatt aggaacatcgagaagaagaagctcaaactggaagattacaaagatcgcctgaaaaatggagagcagcttaaccc agaccagttggaagcagtggaaaagtatgaagaagtacttcataatttggaatttgccaaggagcttcagaaaaccttt tctgcactgagccaagatctcctgaaagcgcagaaaaaggcccagagaagggagcacatgctaaaacttgaggc cgagaagaaaaagcttcgaactatacttcaaattcagtatgtattacagaacttgacacaagaacatgtacagaaag acttcaaagggggcttgaatggtgcaatgtatttgccttcaaaagaacttgactacctcattaaattctcaaaactgacct gccctgaaagaaatgaaagtttgagtgttgaagaccagatggagcagtatccttgtacttttgggaccttttggaaggta gtgagaaagcagtggtaggaacaacatacaaacatgtgaaagacctgctgtccaaattgctgcactcaggttattttg aaagtgtcccagttctcaggaattctaaggaaaaaacagaagaaatgttaatgcagtcagaaaagagaaagcagtt actgaagactgagtctatcaaagagtcagaatctctgaaggaacttgtacagccagagatacagccgcaggagtttct taacagacgctatatgacagaagtaaatttttcaagaaaacaagaaaatgaagaacaatcctgggaagcagattat gctaggaaaccaggtctcctcaaatgctggaatacacttccagaaccagatggtcaggagaagaagaaggagtcct tggagtcgtgggagtcttctcttaagtctcaggaggtatccaagcctgtggtgtctttcgaacaggagaagctcaggcca acattacaggaagagcagaagcagcagatttccatggcacctgtcagtcaatggaagccagaaagccctaagtcc aaagtgggcagccctcaagaagagcagaatgtacaggagacgccaaagccgtgggtggttcagccacagaaag aacaagatccaaagaagctacctcctggatcctgggcagtatctgtgcagagtgaacagagtggcagcagatcctg gaccactcctgtgtgcagagaacaggcttcagtgcagcctgggactccagtatcctgggagaacaatgctgagaacc agaaacactccttagtaccacaatcacagatctctctgaagtcctggggagcagcttcagcaggcctcttaccaaatg acaaggtccctcccaggaagttaaatgtagagcccaaagatgtgcctaagcccatgcctcagcctatagactcttcct ctccctttccaaaggatccagcattgaggaaagaaaaactgcaggacctcatgacccagattcaaggaacttgtaac tttatgcaagagtctgttctagatgtcgacacaccctcaagtgcaattccatcttctcagccgccttcagcttcgccagtctc tacagtatctgcagaacaaaacttgtccaaccaaagtgattttcttcaagagccatcaaaggcttcttctccagttacttgt agctcgaatgcttgcttggttactactgatcaggcttctctgggatctgaaacagagtttatgacctcagagacccctgag atggtggctcccccctgcaagccagcatctgcacttgcttctccaaatcctccactgtcgaagggcttccagttacctcct gcaagtgggagctcggcagccattagcacagcaccctttcaggccatgcagacagtatttaatgttaatgcacctctgc ctccacggaaagaacaagcaatgaaagaatctccttattcatctggctacagtcaaagttttacttcatcaagtacaca gacagtatcccaatgtcagctcccagctgtacacgtggagcagacaacccaacctcccgagactgctgcaggttacc atcctgatggaactgttcaagtaagcaatgggagccttgccttttacccagcacccacgagtatgtttcccagacctgct cagccatttatcagtagtaggggggctctgagaggatgttcacgtggagggaggttactaatgaatccttatcggtctcc tggtagctacaaaggttttgatagttacagaggccttccctcagcttcaagtgggacttacagccaactgcagctgcaa gctagagagtatcctgggacaccttactctcagagggataatttccagcagtgttataaaagatcagggacatctagtg gtcttcaggcaaattcaagagcagggtggagcgactcctctcaggtgagcagcccagagagagacagcgagacttt taacagtggagactctggggtaggagactcccggagcatgaccccagtggatgtgccagtgacaagcccagcagc cgccattctgccagtacacgtctatcctctgcctcagcaaatgcgagttgccttctcagctgccagaacatccaatctgg ctcctggaactttagaccaacctattgtgtttgatcttctcctgaacaacttgggagagacctttgatcttcaacttggtagat tcaattgcccagtaaatggcacttacgtgttcatttttcacatgctaaagctggctgtgaatgtaccactgtatgtcaacctc atgaagaatgaggaggtcttggtgtcagcctatgccaatgatggtgctccagaccatgagacagcaagcaaccatgc tattctccagctcctccagggagataagatatggttgcgcttacacaggggagcgatttatggaagtagctggaaatac tctacattttcaggctatcttctttatcaagattga// 2 (b) Protein sequence, rat Caprin2, 1029aa MKSAKSQVNHTQQGENQRALSPLQSTLSSAASPSQAYETYIDNGLICLKHKIRNIEK 50 KKLKLEDYKDRLKNGEQLNPDQLEAVEKYEEVLHNLEFAKELQKTFSALSQDLLKA QKKAQRREHMLKLEAEKKKLRTILQIQYVLQNLTQEHVQKDFKGGLNGAMYLPSKE LDYLIKFSKLTCPERNESLSVEDQMEQSSLYFWDLLEGSEKAVVGTTYKHVKDLLSK LLHSGYFESVPVLRNSKEKTEEMLMQSEKRKQLLKTESIKESESLKELVQPEIQPQE FLNRRYMTEVNFSRKQENEEQSWEADYARKPGLLKCWNTLPEPDGQEKKKESLE SWESSLKSQEVSKPVVSFEQEKLRPTLQEEQKQQISMAPVSQWKPESPKSKVGSP QEEQNVQETPKPWVVQPQKEQDPKKLPPGSWAVSVQSEQSGSRSWTTPVCREQ PVSQWKPESPKSKVGSPQEEQNVQETPKPWVVQPQKEQDPKKLPPGSWAVSVQSEQSGSRSWTTPVCREQASVQP ASVQPGTPVSWENNAENQKHSLVPQSQISLKSWGAASAGLLPNDKVPPRKLNVEP GTPVSWENNAENQKHSLVPQSQISLKSWGAASAGLLPNDKVPPRKLNVEPKDVPKPMPQPIDSSSPFPKDPALRK KDVPKPMPQPIDSSSPFPKDPALRKEKLQDLMTQIQGTCNFMQESVLDVDTPSSAI EKLQDLMTQIQGTCNFMQESVLDVDTPSSAIPSSQPPSASPVSTVSAEQNLSNQSDFLQEPSKASSPVTCSSNAC LVTTDQASLGSETEFMTSETPEMVAPPCKPASALASPNPPLSKGFQLPPASGSSAAISTAPFQAMQTVFNVNAPL PSSQPPSASPVSTVSAEQNLSNQSDFLQEPSKASSPVTCSSNACLVTTDQASLGSE 60 PPRKEQAMKESPYSSGYSQSFTSSSTQTVSQCQLPAVHVEQTTQPPETAAGYHPDGTVQVSNGSLAFYPAPTSMF TEFMTSETPEMVAPPCKPASALASPNPPLSKGFQLPPASGSSAAISTAPFQAMQTV PRPAQPFISSRGALRGCSRGGRLLMNPYRSPGSYKGFDSYRGLPSASSGTYSQLQLQAREYPGTPYSQRDNFQQC FNVNAPLPPRKEQAMKESPYSSGYSQSFTSSSTQTVSQCQLPAVHVEQTTQPPET YKRSGTSSGLQANSRAGWSDSSQVSSPERDSETFNSGDSGVGDSRSMTPVDVPVTSPAAAILPVHVYPLPQQMRV AAGYHPDGTVQVSNGSLAFYPAPTSMFPRPAQPFISSRGALRGCSRGGRLLMNPY AFSAARTSNLAPGTLDQPIVFDLLLNNLGETFDLQLGRFNCPVNGTYVFIFHMLKLAVNVPLYVNLMKNEEVLVS AYANDGAPDHETASNHAILQLLQGDKIWLRLHRGAIYGSSWKYSTFSGYLLYQD* ! RSPGSYKGFDSYRGLPSASSGTYSQLQLQAREYPGTPYSQRDNFQQCYKRSGTS SGLQANSRAGWSDSSQVSSPERDSETFNSGDSGVGDSRSMTPVDVPVTSPAAAIL PVHVYPLPQQMRVAFSAARTSNLAPGTLDQPIVFDLLLNNLGETFDLQLGRFNCPV NGTYVFIFHMLKLAVNVPLYVNLMKNEEVLVSAYANDGAPDHETASNHAILQLLQGD KIWLRLHRGAIYGSSWKYSTFSGYLLYQD* (c) 70 double underline - nuclear export signal (NES) wave underline and bold letters - RGG box (weak RNA binding) 3 (d) CLUSTAL O(1.2.1) multiple sequence alignment rCaprin2 MKSAKSQVNHTQQGENQRALSPLQSTLSSAASPSQAYETYIDNGLICLKHKIRNIEKKKL 60 XRNG105 rCaprin2 XRNG105 rCaprin2 XRNG105 rCaprin2 XRNG105 rCaprin2 XRNG105 rCaprin2 XRNG105 rCaprin2 XRNG105 rCaprin2 XRNG105 rCaprin2 XRNG105 rCaprin2 XRNG105 rCaprin2 XRNG105 rCaprin2 XRNG105 rCaprin2 XRNG105 rCaprin2 XRNG105 rCaprin2 XRNG105 rCaprin2 XRNG105 rCaprin2 XRNG105 rCaprin2 XRNG105 MPSATS---------------SKAVPGSTDAAPGNIQTEAMKQILGIIDKKLRNLEKKKG 45 * **.* *: *:*.: :.: * :.:*:**:**** KLEDYKDRLKNGEQLNPDQLEAVEKYEEVLHNLEFAKELQKTFSALSQDLLKAQKKAQRR 120 KLDDYQDRLDKGERLNQDQMDAVTKHQEVVANMEFARELQRNFMALGQDMQKTIKKAARR 105 **:**:***.:**:** **::** *::**: *:***:***:.* **.**: *: *** ** EHMLKLEAEKKKLRTILQIQYVLQNLTQEHVQKDFKGGLNGAMYLPSKELDYLIKFSKLT 180 EQLMREEAEQKRLKTVLEFQFVLDKLGDEEVRNDLKQGLDGVLVVSEEELSLLDEFYKLV 165 *:::: ***:*:*:*:*::*:**::* :*.*::*:* **:*.: : .:**. * :* **. CPERNESLSVEDQMEQSSLYFWDLLEGSEKAVVGTTYKHVKDLLSKLLHSGYFESVPVLR 240 NPDRDTSVRLSDQYEQASIHLWDVLDSKEKSVCGTTYKSLKDLLDRILQSGYFDSAQNHQ 225 *:*: *: :.** **:*:::**:*:..**:* ***** :****.::*:****:*. : NSKEKTEEMLMQSEKRKQLLKTESIK------ESESLKEL-VQPEIQPQEFLNRRYMTEV 293 NGLCEEEE-------EEEPLAAPPVEEQAPELEPEPVEEYTEPSEVESTEFVNRQFMTEA 278 *. : ** .:: * : :: * * ::* *:: **:**::***. NFSRKQENEEQSWEADYARKPGLLKCWNTLPEPDGQEKKKESLESWESSLKSQEVSKPVV 353 QYSGSEKEQVDEWTVETVEVVNSLQQA--------------------------------- 305 ::* .:::: :.* .: .. *: SFEQEKLRPTLQEEQKQQISMAPVSQWKPESPKSKVGSPQEEQNVQETPKPWVVQPQKEQ 413 ------------------------------------------------------------ 305 DPKKLPPGSWAVSVQSEQSGSRSWTTPVCREQASVQPGTPVSWENNAENQKHSLVPQSQI 473 ------------------------------------------------------------ 305 SLKSWGAASAGLLPNDKVPPRKLNVEPKDVPKPMPQPIDSSSPFPKDPALRKEKLQDLMT 533 ---------------------------ATPPIPEPLALNAIVQVQPDPIVRRQRVQDLMA 338 * * * ::: . ** :*::::****: QIQGTCNFMQESVLDVDTPSS--AIPSSQPPSASPVSTVSAEQNLSNQSDFLQEPSKASS 591 QMQGPYNFMQDSMLEFESQPMDPAIVSAQPMNLSQSMD--LPQMLCPP---VHSEPRPSQ 393 *:** ****:*:*:.:: ** *:** . * * *. ::. : *. PVTCSS--NACLVTTD-QASLGSETEFMTS--------------ETPE--MVAPPCKPAS 632 PIQVPDTTQVALVSSPSEAYTGSPEIYQPSHPIEARTQNDAMEQIQASLSLNPDPTQTLS 453 *: . :..**:: :* ** : * . : * : * AL-ASPNPPLSKGFQLPPASGSSAAISTAPFQAMQTVFNVNAPLPPRKEQ-AMKESPYSS 690 SIPAASQPQVFQTGSNKPLHSSGINVNAAPFQSMQTVFNMNAPVPPVNEPETLKQNQYQA 513 :: *: :* : : . * .*. :.:****:******:***:** :* ::*:. *.: GYSQSFTSSSTQTVSQCQLPAVHVEQTTQPPETAAGYHPDGTVQVS-NGSLAFYPAPTSM 749 SYNQTFPGQPHQVEQ-TELQPEQL------QTVVNSYHATSEQAHQAPSGHQQPTQQNAG 566 .*.*:* .. *. . :* :: .. .** . . .. .: FPRPAQPFISSRGALRGCSRGGRLLMNPYRSP--GSYKGFDSYRGL-PSASSGTYSQLQL 806 FPRNSQPFYNNRGMARGGQRGNRGMMNGYRGQSNGFRGGYDGYRAAFPNTPNSGYPQAQF 626 *** :*** ..** ** .** * :** **. * *:*.**. *.: .. * * *: QAREYPGTPYSQRDNFQQCYKRSGTSSGLQANSRAGWSDSSQVSSPERDSETFNSGDSGV 866 NAPRDYSN-NYQRDGYQQNFKRGAGQGGPRVAPRGH-GG---PPRPSRGIP--------- 672 :* . .. *** :** :**.. ..* :. *. . *.* GDSRSMTPVDVPVTSPAAAILPVHVYPLPQQMRVAFSAARTSNLAPGTLDQPIVFDLLLN 926 ----QMNPQQVN------------------------------------------------ 680 .*.* :* NLGETFDLQLGRFNCPVNGTYVFIFHMLKLAVNVPLYVNLMKNEEVLVSAYANDGAPDHE 986 ------------------------------------------------------------ 680 TASNHAILQLLQGDKIWLRLHRGAIYGSSWKYSTFSGYLLYQD 1029 ------------------------------------------680 !Hypothetical functional domains of the rat Caprin-2 protein, based on alignment to the Xenopus RNG105 protein sequence, a paralogue of the well-analyzed rat Caprin 1 protein, highly homologous to Caprin-2. 80 Solid underline - coiled-coil domain (strong RNA binding) Dotted underline - nuclear localization signal (NLS) Double underline - nuclear export signal (NES) Wave underline and bold letters - RGG box (weak RNA binding)