Supplementary Tables Suppl. Table S2: Primers used for the amplification of antibiotic resistance genes, virulence genes, parts of the E. faecalis PAI, plasmid replicase genes and other genes. Gene GenBank Primer name Primer sequence (5' → 3') Amplicon [bp] Reference vanA I-1 TCTGCAATAGAGATAGCCGC 377 [1] vanA II-2 GGAGTAGCTATCCCAGCATT ermB-F AGCCATGCGTCTGACATCTAT 341 [1] ermB-R TGCTCATAAGTAACGGTACT tetM-F GGTGAACATCATAGACACGC 401 This study tetM-R CTTGTTCGAGTTCCAATGC gen-F TAATCCAAGAGCAATAAGGGC 227 This study gen-R GCCACACTATCATAACCACTA aadE-1 GCAGAACAGGATGAACGTATTCG 369 [1] aadE-2 ATCAGTCGGAACTATGTCCC PAIasc10-1 GCCAAAGTGGAACGTTAAATG 345 [2] PAIasc10-2 TCAATCCAGAAGGTCCTGTG cylM-TQ1 GATGCGTATTACTGTTGTTAGAATGAGAT 150 [2] cylM-TQ2 GAGTCTCCCTGTGATTCTGATATAGAGTT Acc. No. vanA erm(B) tet(M) aacA-aphD aadE asc10 cylM AF516335 AF516335 n.d. n.d. AF516335 EF0005 EF0046 1 esp xyl kinase gls24-like EF0056 EF0083 EF0117 AE016830.1 gelE fsrB cpsA cpsB cpsC cpsF M37185.1 EF_1821* EF_0095 EF_0094 EF_0093 EF_0090 esp-TIM1 CTTTGATTCTTGGTTGTCGGATAC esp-TIM2 TCCAACTACCACGGTTTGTTTATC PAIefc-83F GGAGCTGATAATGCTTGTGC PAIefc-83R AAGAATTACCTGCTGCCAAC gls24-F TGAAGCAAATTCTCCAGTAGC gls24-R TGGAGTGGATGTTGAAGTAGG PAI164 ATGCCATGTTCAGCGAAGTTGCCAATTATC PAI167 ATGTTGGTTGAAAGTTGCTTTTTGGCAAAC gelE-F TATGACAATGCTTTTTGGGAT gelE-R AGATGCACCCGAAATAATATA fsrB-1 GCATTGTTATCTATGTCGCCATACC fsrB-2 GGCTTAGTTCCCACACCATC cpsA-F GTGTCTCCTGAAAAATCAGGCC cpsA-R GTTAAAGTCAATGTAATGGGCTACC cpsB-F CTATCAAAACGATCTAAAATACCACC cpsB-R GATTAACGTTATTAAGTTTTGAAGGCG cpsC-F CCAACGCTTTGCTTCTTGAATGAC cpsC-R CCTGAATATCAATGTATTTGGGCAGTC cpsF-F GGCGATCTATTCTACCATCCGCGC 2 475 [3] 202 [4] 262 [2] * [4,5] 213 [3] 397 [6] 383 [7] 619 [7] 300 [6] 324 [7] CRISPR1-cas CRISPR1 cas_csn CRISPR2 CRISPR3-cas iolB iolG2 iolE cpsF-R CCAAAGAAAGATATTTTGGATTGAG CRISPR1-cas-F GCGATGTTAGCTGATACAAC CRISPR1-cas-R CGAATATGCCTGTGGTGAAA CRISPR1cas_csn1-F CAGAAGACTATCAGTTGGTG AE016830.1 NC_017316.1 CRISPR1cas_csn1-R CCTTCTAAATCTTCTTCATAG CRISPR2-F CTGGCTCGCTGTTACAGCT CRISPR2-R GCCAATGTTACAATATCAAACA CRISPR3-cas-F GATCACTAGGTTCAGTTATTTC CRISPR3-cas-R CATCGATTCATTATTCCTCCAA AE016830.1 AE016830.1 NC_017316.1 NC_017316.1 NC_017316.1 iolR NC_017316.1 PAI 1 AF454824.1: C:427 iolB-F CCATCTGGCACGCCGACAGGA iolB-R GCCAGTGCACGTGATTACCGCTG iolG2-F GCGTTTGCCAGTCGGGCGAAA iolG2-R TGGTACAGGTGGGCTTCATGCGT iolE-F ACGGATTGGCTTTGGCCGGATCT iolE-R TGGGGACAGGAGTCCAAACGACTG iolR-F TCCCTAATCGCCACACTA PAI164 ATGCCATGTTCAGCGAAGTTGCCAATTATC 3 316 [8] 783 [8] variabel [8] 225 [8] 1548 363 This study 465 This study 363 This study 379 [9] [4] PAI 2a PAI 2b PAI 2c PAI 3a PAI 3b PAI 4a PAI 4b PAI 5a PAI 5b AF454824: 240:9913 AF454824: 9869:15934 AF454824: 14033:21999 AF454824: 21566:32079 AF454824: 31320:42730 AF454824: 42395:53473 AF454824: 53440:65000 AF454824: 64944:75572 AF454824: 74158:85588 1 PAI R GGAAGATGGACGGTTGATGAAGCCTCAATATG 2a PAI F CAGTTGTCGAATACGATGCATGTCCCAGCC 2a PAI R AAACCAAAGGAACCGAAACGGAAAAACTTAGCATGG 2b PAI F TTTAACCAGCCATGCTAAGTTTTTCCGTTTCGGTTC 2b PAI R TTTGAAATAATCTCCAACTTTTCCCCCGTTCCACAC 2c PAI F AACCATAAAAAGGAACGGAGGGAGCACAACAAAAGG 2c PAI R ACTTGCAGTGTGACTGTCTGTCGTAACTTCACC 3a PAI F CTCGTCCGTAACGATCTGTTTTATCGCCCTTATC 3a PAI R TCAAGTCCGTACAACAGGCACTTTCTTTATCAAGC 3b PAI F GAAGGCCGTTGCCAATTTTGCATTAGCTTGC 3b PAI R TCCTAAGCCTATGGTAAAACATGCTGGAGTTGTCTC 4a PAI F CAAGGTAGTGGAGATGTTCAGGCTGAGACAACAC 4a PAI R CGGATGTTACTTCTGCTGGACTTAAAACAATCCC 4b PAI F GGGATTGTTTTAAGTCCAGCAGAAGTAACATCCG 4b PAI R ACGCCAAGCACAAGGGATAAAGATTGCGAAAG 5a PAI F GGACGACCTTTATAGACGCCGTTTGCTTTCG 5a PAI R AGTCCCCTTTTTCTGCCATGACACCAGTTAAAATC 5b PAI F GCTGTGGTCAAGATAGATGGGAAAGAGATTGAGCG 5b PAI R GGATCTGAACCGTCTTGTGTCATAGTGTGCCAG 4 9674 [4] 6066 [4] 7697 [4] 11645 [4] 11411 [4] 11079 [4] 11561 [4] 10629 [4] 11431 [4] PAI 6a PAI 6b PAI 7a PAI 7b PAI 8a PAI 8b PAI 9 rep-pCF10 rep-pRE25 AF454824: 85547:94661 AF454824: 93984:102421 AF454824: 101954:113046 AF454824: 113008:126865 AF454824: 125344:136351 AF454824: 135337:146384 AF454824: 146272:C AY885841 X92945 6a PAI F TGTAGCATACTGGCACACTATGACACAAGACGG 6a PAI R CGTGCCCCTAATTACCATAGAGATAGTCGCGTTG 6b PAI F TGGTAAACGCTGCTCCTGAAATGAAGAGTTTGAC 6b PAI R AGGTTTGATACGCAACTACCTTTCCCAACTGACG 7a PAI F TTTTGGGACAGGAACGCTATCAGTTAACGATTGC 7a PAI R CCTGCGGTCAAGCACAGTTGCCTTATCTTAG 7b PAI F ATTAAAGTCAAAAGAGACTGTTACTTGTGCGCCCTG 7b PAI R TCAGCAAACTAAGATAAGGCAACTGTGCTTGACC 8a PAI F TGCTTTAGTGGGTCGTACTAACGGAACAATAG 8a PAI R CAAACAACACGTCGTCGATCTTTACCTTG 8b PAI F CACCAATGCACATAATCAAACAATTCTAGGCGTAG 8b PAI R GTGGACAAGCACAGTCACAATTAGAAGCAATG 9 PAI F CATCATTTCTTCAGCAAATTGGTTGGCACGC PAI167R ATGTTGGTTGAAAGTTGCTTTTTGGCAAAC repCF10-1 GCTCGATCARTTTTCAGAAG repCF10-2 CGCAAACATTTGTCWATTTCTT repRE25-1 GAGAACCATCAAGGCGAAAT repRE25-2 ACCAGAATAAGCACTACGTACAATCT 5 9115 [4] 8432 [4] 10821 [4] 13858 [4] 11008 [4] 11048 [4] 8298 [4] 201 [10] 630 [10] Suppl Table S3: Quality report of 454 sequencing data assembled with Newbler software. Isolate Origin GC content Number of Calculated genome Coverage [%] contigs size [bp] [n-fold] UW6149 B 37.51 163 3,239,149 13.92 UW2860 B 37.51 104 3,062,478 15.78 UW6724 HC 37.31 298 3,050,235 10.04 UW7761 B 37.51 178 2,912,463 11.60 UW7777 HC 37.55 133 2,945,792 34.32 UW7780 HC 37.35 267 3,059,826 15.60 UW7753 B 37.32 148 3,158,895 13.54 UW1833 U 37.10 228 3,190,695 17.29 UW7779 HC 37.21 143 3,024,279 22.45 UW7729 AC 37.26 370 2,946,024 11.16 UW7801 M 37.51 218 2,900,593 10.55 UW6727 HC 37.02 236 3,275,508 18.39 D32 AC 37.63 71 2,840,807 24.36 UW7709 E 37.26 94 2,921,715 23.13 UW7742 AC 37.34 124 2,889,018 22.59 Genomes were de novo sequenced and assembly was done with Newbler assembler software. Assembly of only two strains D32 and UW7709 resulted in less than 100 contigs. Genome size varied between 2.8 to 3.3Mbp, irrespective of the clinical or non-clinical background of the isolates. Coverage of the genomes was between 10 to 34-fold; AC, animal colonizer; B, blood culture; E, endocarditis; HC, human colonizer; M, bovine mastitis; U, urine. 6 Suppl. Table S4: SwissProt and BLASTP analyses of a putative capsule-encoding region within 1891 UDP-glucose 6dehydrogenase UDP-glucose 6dehydrogenase (0.0, 64%, 100%) Glycosyltransferase Gtf1 (2e-07, 26%, 45%) 1892 Glycosyl transferase, group 1 family protein 1895 UDP-glucose 4epimerase UDP-glucose 4epimerase (2e-163, 65%, 98%) 1897 Undecaprenylphosphate galactosephosphotra nsferase 1901 Putative tyrosineprotein phosphatase CapC Undecaprenyl phosphate N,N'diacetylbacillosamine 1-phosphate transferase (3e-77, 58%, 96%)b Tyrosine-protein phosphatase YwqE (3e-61, 41%, 100%) 1902 Tyrosine-protein kinase YwqD Tyrosine-protein kinase YwqD (2e-69, 50%, 93%) 1903 Capsular polysaccharide synthesis enzyme 1904 Transcriptional regulator lytR Probable capsular polysaccharide biosynthesis protein YwqC (8e-42, 38%, 93%) Transcriptional regulator LytR (3e-163, 72%, 96%) None Group 1 glycosyl transferase (2e-08, 26%, 43%) Capsular polysaccharide biosynthesis protein Cps4J (1e-169, 65%, 98%) Capsular polysaccharide biosynthesis protein Cps4E (4e-38, 47%, 91%) Capsular polysaccharide biosynthesis protein Cps4B (9e-25, 27%, 88%) Capsular polysaccharide biosynthesis protein Cps4D (5e-40, 38%, 88%) Transcriptional regulator (1e-70, 41%, 94%) UDP-glucose dehydrogenase (2e-41, 29%, 86%) UDP-glucose: polyglycerol phosphate alphaglucosyltransferase (1e-07, 30%, 36%) EpsC, UDP-sugar epimerase (1e-42, 35%, 88%) Phosphotransferase (6e-78, 56%, 96%)c YwqE, protein tyrosinephosphatase (3e-63, 41%, 100%) YwqD, protein tyrosine kinase (2e-70; 50%, 93%) YwqC, modulator of YwqD protein tyrosine kinase activity (5e-41, 38%, 93%) Membrane-bound transcriptional regulator LytR (4e-81, 41%, 90%) a Values in parentheses are E value, % identity, % query coverage; b second best hit: uncharacterized sugar transferase EpsL (7e-76, 56%, 96%); c second best hit: TuaA, putative undecaprenyl-phosphate Nacetylgalactosaminyl 1-phosphate transferase (4e-26, 40%, 69%). 7 Transcriptional regulator (1e-70, 41%, 94%) BLASTP - best hitsa B. subtilis 168 BLASTP - best hitsa S. pneumoniae TIGR4 SwissProt - best hitsa annotated product name Locus Tag EFD32_ the E. faecalis D32 GI. Suppl. Table S5: Identification of CRISPR loci in selected E. faecalis ST40 strains by PCR. Isolate cas_csn1 CRISPR1-cas Size [bp] CRISPR2 No. of spacer V583 - - 773 + 2 OG1RF + + 1031 + 7 UW1833 + + 870 + 4 UW6724 + + 886 + 4 UW6727 + + 857 + 4 UW7777 + + 864 + 4 UW7779 + + 892 + 4 UW2860 + + 891 + 4 UW6149 + + 892 + 4 UW7801 + + 861 + 4 UW7729 - - 927 + 5 UW7709 + + 834 + 4 UW7742 + + 988 + 6 D32 + + 1432 + 13 UW7761 + + 915 + 5 UW7753 + + 860 + 4 UW7780 + + 831 + 4 Analog to [8], presence of CRISPR loci corresponding to OG1RF CRISPR1-cas and CRISPR2 loci [9,11] was checked by PCR. CRISPR2 locus was also sequenced to identify the integrated spacer. Analog to [11], existence of an empty CRISPR2 locus of the hospital-adapted V583 strain, also lacking the functional cas genes [8,11] was demonstrated. 8 Suppl. Table 6a: Aerobic utilization of carbon sources of Biolog MicroArray™ PM01. C-source Negative Control L-Arabinose N-Acetyl-D-Glucosamine D-Saccharic Acid Succinic Acid D-Galactose L-Aspartic Acid L-Proline D-Alanine D-Trehalose D-Mannose Dulcitol D-Serine D-Sorbitol Glycerol L-Fucose D-Glucuronic Acid D-Gluconic Acid D,L-a-Glycerol- Phosphate D-Xylose L-Lactic Acid Formic Acid D-Mannitol L-Glutamic Acid D-Glucose-6-Phosphate D-Galactonic Acid-g-Lactone D,L-Malic Acid D-Ribose Tween 20 L-Rhamnose D-Fructose Acetic Acid a-D-Glucose Maltose D-Melibiose Thymidine L-Asparagine D-Aspartic Acid D-Glucosaminic Acid 1,2-Propanediol Tween 40 a-Keto-Glutaric Acid a-Keto-Butyric Acid a-Methyl-D-Galactoside a-D-Lactose Lactulose Sucrose Uridine L-Glutamine m-Tartaric Acid D-Glucose-1-Phosphate D-Fructose-6-Phosphate Tween 80 a-Hydroxy Glutaric Acid-g-Lactone a-Hydroxy-Butyric Acid b-Methyl-D-Glucoside Adonitol Maltotriose 2`-Deoxy-Adenosine Adenosine Glycyl-L-Aspartic Acid Citric Acid V583 OG1RF UW6149 UW2860 UW6724 UW7761 UW7777 UW7780 UW7753 UW1833 UW7779 UW7729 UW7801 UW6727 D32 UW7709 UW7742 m-Inositol D-Threonine Fumaric Acid Bromo-Succinic Acid Propionic Acid Mucic Acid Glycolic Acid Glyoxylic Acid D-Cellobiose Inosine Glycyl-L-Glutamic Acid Tricarballylic Acid L-Serine L-Threonine L-Alanine L-Alanyl-Glycine Acetoacetic Acid N-Acetyl-b-D-Mannosamine Mono Methyl Succinate Methyl Pyruvate D-Malic Acid L-Malic Acid Glycyl-L-Proline p-Hydroxy-Phenylacetic Acid m-Hydroxy-Phenylacetic Acid Tyramine D-Psicose L-Lyxose Glucuronamide Pyruvic Acid L-Galactonic Acid-g-Lactone D-Galacturonic Acid b-Phenylethylamine Ethanolamine Mean area values were calculated of three independent experiments performed at 37°C for a 72h incubation time. For reasons of simplification, mean area values have been replaced by a color code. Grey color indicated no or only weak substrate utilization (mean area value below 10,000) and pink symbolized middle values (mean area value between 10,000 and 19,999). The green color represented strains with a high capability of utilization of the respective carbon source (mean area value greater or equal to 20,000). The red box highlighted mInositol as the carbon source of main interest. 9 Suppl. Table 6b: Aerobic utilization of carbon sources of Biolog MicroArray™ PM02. C-source V583 Negative Control Chondroitin Sulfate C a-Cyclodextrin b-Cyclodextrin g-Cyclodextrin Dextrin Gelatin Glycogen Inulin Laminarin Mannan Pectin N-Acetyl-D-Galactosamine N-Acetyl-Neuraminic Acid b-D-Allose Amygdalin D-Arabinose D-Arabitol L-Arabitol Arbutin 2-Deoxy-D-Ribose i-Erythritol D-Fucose 3-0-b-D-Galactopyranosyl-D-Arabinose Gentiobiose L-Glucose D-Lactitol D-Melezitose Maltitol a-Methyl-D-Glucoside b-Methyl-D-Galactoside 3-O-Methyl-Glucose b-Methyl-D-Glucuronic Acid a-Methyl-D-Mannoside b-Methyl-D-Xyloside Palatinose D-Raffinose Salicin Sedoheptulosan L-Sorbose Stachyose D-Tagatose Turanose Xylitol N-Acetyl-D-Glucosaminitol g-Amino-Butyric Acid d-Amino-Valeric Acid Butyric Acid Capric Acid Caproic Acid Citraconic Acid D,L-Citramalic Acid D-Glucosamine 2-Hydroxy-Benzoic Acid 4-Hydroxy-Benzoic Acid b-Hydroxy-Butyric Acid g-Hydroxy-Butyric Acid a-Keto-Valeric Acid Itaconic Acid 5-Keto-D-Gluconic Acid D-Lactic Acid Methyl Ester Malonic Acid Melibionic Acid Oxalic Acid Oxalomalic Acid Quinic Acid D-Ribono-1,4-Lactone Sebacic Acid Sorbic Acid Succinamic Acid D-Tartaric Acid L-Tartaric Acid Acetamide L-Alaninamide N-Acetyl-L-Glutamic Acid L-Arginine Glycine L-Histidine L-Homoserine 4-Hydroxy-L-Proline (trans) L-Isoleucine L-Leucine L-Lysine L-Methionine L-Ornithine L-Phenylalanine L-Pyroglutamic Acid L-Valine D,L-Carnitine Butylamine (sec) D,L-Octopamine Putrescine Dihydroxy-Acetone 2,3-Butanediol 2,3-Butanone 3-Hydroxy-2-Butanone OG1RF UW6149 UW2860 UW6724 UW7761 UW7777 UW7780 UW7753 UW1833 UW7779 UW7729 UW7801 UW6727 D32 UW7709 UW7742 Mean area values were calculated of three independent experiments performed at 37°C for a 72h incubation time. For reasons of simplification, mean area values have been replaced by a color code. Grey color indicated no or only weak substrate utilization (mean area value below 10,000) and pink symbolized middle values (mean area value between 10,000 and 19,999). The green color represented strains with a high capability of utilization of the respective carbon source (mean area value greater or equal to 20,000). 10 References 1. Werner G, Hildebrandt B, Witte W (2003) Linkage of erm(B) and aadE-sat4-aphA-3 in multipleresistant Enterococcus faecium isolates of different ecological origins. MicrobDrug Resist 9 Suppl 1: S9-16. 2. Shankar N, Baghdayan AS, Gilmore MS (2002) Modulation of virulence within a pathogenicity island in vancomycin-resistant Enterococcus faecalis. Nature 417: 746-750. 3. Vankerckhoven V, Van Autgaerden T, Vael C, Lammens C, Chapelle S, et al. (2004) Development of a multiplex PCR for the detection of asa1, gelE, cylA, esp, and hyl genes in enterococci and survey for virulence determinants among European hospital isolates of Enterococcus faecium. J Clin Microbiol 42: 4473-4479. 4. Laverde Gomez JA, Hendrickx AP, Willems RJ, Top J, Sava I, et al. (2011) Intra- and interspecies genomic transfer of the Enterococcus faecalis pathogenicity island. PLoS One 6: e16720. 5. Laverde Gomez JA (2011) Horizontally Transferable Elements among Enterococci. [Dissertation (Dr. rer. nat.)]: Technischen Universität Carolo-Wilhelmina zu Braunschweig. 6. McBride SM, Fischetti VA, Leblanc DJ, Moellering RC, Jr., Gilmore MS (2007) Genetic diversity among Enterococcus faecalis. PLoS One 2: e582. 7. Hancock LE, Gilmore MS (2002) The capsular polysaccharide of Enterococcus faecalis and its relationship to other polysaccharides in the cell wall. Proc Natl Acad Sci U S A 99: 1574-1579. 8. Palmer KL, Gilmore MS (2010) Multidrug-resistant enterococci lack CRISPR-cas. MBio 1: e0022700210. 9. Bourgogne A, Garsin DA, Qin X, Singh KV, Sillanpaa J, et al. (2008) Large scale variation in Enterococcus faecalis illustrated by the genome analysis of strain OG1RF. Genome Biol 9: R110. 10. Jensen LB, Garcia-Migura L, Valenzuela AJ, Lohr M, Hasman H, et al. (2010) A classification system for plasmids from enterococci and other Gram-positive bacteria. J Microbiol Methods 80: 2543. 11. Horvath P, Coute-Monvoisin AC, Romero DA, Boyaval P, Fremaux C, et al. (2009) Comparative analysis of CRISPR loci in lactic acid bacteria genomes. Int J Food Microbiol 131: 62-70. 11