1 Supplementary Methods 2 3 Description of Homologous Recombination (HR) hypotheses at the polymorphic 4 Mph region (Mph Cluster A gene) in the B. cereus group. 5 6 According to the differences in phylogenetic trees constructed from six housekeeping genes (Fig. 7 3), the 16 Kbp, 5.5 Kbp, and 4 Kbp surrounding regions of the Mph Cluster A gene, and the Mph 8 gene polymorphic region (Suppl. Fig. S3), seven homologous recombination hypotheses involving 9 the polymorphic Mph region were proposed. For all hypotheses, the recombinant is defined as the 10 strain in which the HR event have occurred (or its direct ancestor), the minor parent is defined as 11 the strain from which the acquired sequence belongs to, and the major parent is defined as the 12 closest strain to the recombinant which did not undergone the HR event. 13 14 1- B. cereus Rock 1-3 isolate have acquired the Mph allele from a strain related to B. cereus 15 Rock 3-28 isolate. 16 17 Phylogenetic information 18 B. cereus Rock 3-28 and B. cereus Rock 1-3 are sister strains in the polymorphic Mph region 19 phylogeny (Suppl. Fig. S3D), with B. cereus Rock 4-18 as an outgroup, while B. cereus Rock 3-28 20 is an outgroup of B. cereus Rock 4-18 /B. cereus Rock 1-3 strains in all other phylogenies (Fig. 3 21 and Suppl. Fig. S3A-C). 22 In the 16 Kbp phylogeny (Suppl. Fig. S3A), B. cereus Rock 3-28 is a sister group of B.cereus 23 VD115, while it is the sister group of B. cereus Rock 4-18/B. cereus Rock 1-3 in the housekeeping 24 genes phylogeny (Fig. 3) and in both 5 Kbp phylogenies (Suppl. Fig. S3B-C). 25 26 Hypothesis 27 This means that B. cereus Rock 1-3 and B. cereus Rock 3-28 Mph genes are more similar to each 28 other than to the B. cereus Rock 4-18 Mph gene, while the overall sequence of B. cereus Rock 1-3 1 29 is more similar to B. cereus Rock 4-18 than to B. cereus Rock 3-28. A HR event thus probably 30 occurred at the polymorphic Mph region, with B. cereus Rock 1-3 as the recombinant, a strain 31 related to B. cereus Rock 3-28 as the minor parent, and a strain related to B. cereus Rock 4-18 as 32 the major parent. 33 The inconsistency of B. cereus Rock 3-28 position in the 16 Kbp tree compared to other trees 34 may be explained by another HR event which may have occurred outside of the 5.5 kbp region but 35 in the 16 Kbp region, involving B. cereus Rock 3-28 as the recombinant, a strain related to B. 36 cereus VD115 as the minor parent, and a strain related to B. cereus Rock 3-28 as the major parent. 37 38 RDP analysis 39 Analysis A: 40 Included isolates: B. cereus Rock 1-3, B. cereus Rock 3-28, B. cereus Rock 4-18, B. cereus VD156 41 (outgroup). Analysis performed on the 16 Kbp region with the polymorphic Mph region included. 42 Four methods provided support with p-value < 10-3 for the Mph replacement in B. cereus Rock 43 1-3 by an allele related to the B. cereus Rock 3-28 allele (Suppl. Table S3). All methods infer a ~1 44 Kbp recombination event including ~ 500 bp from the Mph polymorphic region. 45 Analysis B: 46 Included isolates: B. cereus Rock 1-3, B. cereus Rock 3-28, B. cereus Rock 4-18, B. cereus 47 VD115, B. cereus HuB5-5 and B. cereus ST196 (outgroup). Analysis performed on the 16 Kbp 48 region without the polymorphic Mph region. 49 Five methods provided highly significant support for a ~2 Kbp recombination event that 50 occurred in B. cereus Rock 3-28 with B. cereus VD115 as the minor parent (Suppl. Table S2). All 51 methods locate the recombination event between positions ~14300 and ~16300 in the alignment, 52 thus outside of the 5.5 Kbp region. 53 54 2- B. cereus HuB5-5 isolate have lost the Mph allele after RH with a strain related to B. 55 cereus VD115 isolate. 56 57 Phylogenetic information 2 58 B. cereus HuB5-5 is included in a monophyletic group of Mph-carrying isolates (including B. 59 cereus Rock 1-3 and B. cereus Rock 4-18) in the housekeeping genes phylogeny (Fig. 3), while B. 60 cereus HuB5-5 branches as a sister group of B. cereus Rock 1-3 and B. cereus Rock 4-18 in the 16 61 Kbp and both 5.5 Kbp phylogenies (Suppl. Fig. S3A-C). 62 63 Hypothesis 64 As B. cereus HuB5-5 do not carry the polymorphic Mph region, this strain may have lost it 65 through RH with another strain not carrying the Mph gene. B. cereus HuB5-5 is the recombinant, 66 and any of the B. cereus Rock 1-3 / B. cereus Rock 3-28 / B. cereus Rock 4-18 may be the major 67 parent. B. cereus HuB5-5 is located in the same group with 100% bootstrap support in all 68 phylogenies, suggesting that the minor parent also belongs to this group, thus related to B. cereus 69 VD115. 70 71 RDP analysis 72 Included isolates: B. cereus HuB5-5, B. cereus Rock 1-3, B. cereus Rock 3-28, B. cereus Rock 73 4-18, B. cereus VD115, and B. cereus ST196 (outgroup). Analysis performed on the 16 Kbp 74 region without the polymorphic Mph region. 75 The RDP analysis did not returned any putative HR event in B. cereus HuB5-5 in the region of 76 interest, which does not support the Mph gene removal through HR with a B. cereus VD115 77 relative (Suppl. Table S2). Moreover no HR with B. cereus ST196 was inferred, excluding a 78 potential event with any of the known Mph-free isolates. 79 A potential explanation would be that a HR event larger than the 16 Kbp region have occurred 80 between B. cereus HuB5-5 and a strain related to the B. cereus VD115 isolate, but such a putative 81 RH event cannot be inferred by RDP with the present 16 kbp window. 82 83 3- B. cereus Rock 3-42 isolate have acquired the Mph allele from a strain related to B. 84 thuringiensis BGSC 4BA1 isolate. 85 86 Phylogenetic information 3 87 B. cereus Rock 3-42 groups with B. cereus ATCC 4342 with 100% bootstrap value in the 88 housekeeping genes phylogeny (Fig. 3), while it branches as the sister group of B. anthracis 89 isolates in the 16 kbp and both 5.5 Kbp phylogenies (Suppl. Fig. S3A-C). In the polymorphic Mph 90 region phylogeny, it branches as the sister group of B. thuringiensis BGSC 4BA1 (Suppl. Fig. 91 S3D). 92 93 Hypothesis 94 The inconsistent topology between housekeeping genes phylogeny and other phylogenies suggests 95 that B. cereus Rock 3-42 may have acquired the polymorphic Mph region from a relative of B. 96 anthracis strains. The major parent would be B. cereus ATCC 4342, and the minor parent would 97 be any of the B. anthracis strains. The fact that B. cereus Rock 3-42 still branches with B. 98 anthracis strains when the polymorphic Mph region is removed (Suppl. Fig. S3C) suggests that 99 the RH event may span the whole 5.5 Kbp region. A second HR event then may have occurred in 100 the polymorphic Mph region with B. cereus Rock 3-42 as the recombinant, a B. anthracis strain as 101 the major parent, and B. thuringiensis BGSC 4BA1 as the minor parent. 102 103 RDP analysis 104 Analysis A: 105 Included isolates: B. cereus Rock 3-42, B. anthracis BF1, B. thuringiensis BGSC 4BA1, B. 106 thuringiensis BGSC 4AW1, B. cereus ATCC 4342, and B. cereus MM3 (outgroup). Analysis 107 performed on the 16 Kbp region without the polymorphic Mph region. 108 The RDP analysis did not returned any putative HR event in B. cereus Rock 3-42 in the region 109 of interest, which does not support the Mph gene acquisition through HR from a B. anthracis 110 relative. 111 112 Again, a potential explanation would be that B. cereus Rock 3-42 has acquired the Mph gene through a recombination larger than the 16 Kbp region. 113 114 Analysis B: 115 Included isolates: B. cereus Rock 3-42, B. anthracis BF1, B. thuringiensis BGSC 4BA1, B. 4 116 thuringiensis BGSC 4AW1, B. cereus BGSC 6E1, and B. cereus VD156 (outgroup). Analysis 117 performed on the 16 Kbp region with the polymorphic Mph region included. 118 Four methods provided support for a ~2 Kbp recombination event that occurred in B. cereus 119 Rock 3-42 with B. thuringiensis BGSC 4BA1 as the minor parent (Suppl. Table S2). The 120 recombined section spans the whole polymorphic Mph region for most methods. 121 122 4- B. cereus R309803 isolate have acquired the Mph gene from a strain related to B. cereus 123 VD156 isolate. 124 125 Phylogenetic information 126 B. cereus R309803 groups as the root of Clade 1 and Clade 2 in the housekeeping genes 127 phylogeny (Fig. 3) and the 16 Kbp phylogeny (Suppl. Fig. S3A), while it branches as the sister 128 group of B. cereus VD156 and B. thuringiensis BGSC 4BD1 in all other phylogenies (Suppl. Fig. 129 S3B-D). 130 131 Hypothesis 132 B. cereus R309803 is the recombinant with B. cereus BAG2x1-1 as the major parent, and has 133 acquired the Mph gene from a minor parent related to B. cereus VD156. 134 135 RDP analysis 136 Included isolates: B. cereus R309803, B. cereus VD156, B. cereus BAG2x1-1, B. cereus ST196 137 (outgroup). Analysis performed on the 16 Kbp region without the polymorphic Mph region. 138 139 Five methods provided highly significant support for a ~3.5 Kbp recombination event that 140 occurred in B. cereus R309803 with B. cereus VD156 as the minor parent (Suppl. Table S2). The 141 recombined section spans the whole polymorphic Mph region. 142 143 5- B. cereus m1293 and B. cereus LCT BC244 isolates have acquired the Mph gene from a 144 strain related to the group B. anthracis/B. cereus Rock 3-42 /B. thuringiensis BGSC 4BA1/B. 5 145 thuringiensis BGSC 4AW1/B. cereus BGSC 6E1 isolates (hereafter referred to the Ba++ 146 group). 147 148 Phylogenetic information 149 B. cereus m1293/B. cereus LCT BC244 is the sister group of B. cereus VD102/B. cereus BDRD 150 ST26 in the housekeeping gene phylogeny (Fig. 3) and the 16 Kbp phylogeny (Suppl. Fig. S3A) 151 but become the sister group of Ba++ in the 5.5 Kbp phylogeny including the polymorphic Mph 152 region (Suppl. Fig. S3B). When the polymorphic region is removed, B. cereus m1293/B. cereus 153 LCT BC244 branches back in the B. cereus VD102/B. cereus BDRD ST26 group (Suppl. Fig. 154 S3C). 155 156 Hypothesis 157 The group composed of B. cereus m1293 and B. cereus LCT BC244 is the recombinant, with B. 158 cereus BDRD ST26 as the major parent, and have acquired the Mph gene from a minor parent 159 related to the Ba++ group. B. cereus VD102 is not chosen as the major parent because it is a 160 potential recombinant, see Hypothesis 6 below. 161 162 RDP analysis 163 Included isolates: B. cereus m1293, B. cereus BDRD ST26, B. thuringiensis BGSC 4AW1 164 (representative of the Ba++ group), B. cereus MM3 (outgroup). Analysis performed on the 16 Kbp 165 region without the polymorphic Mph region. 166 All six methods provide highly significant support for a ~2 Kbp recombination event that 167 occurred in B. cereus m1293 and B. cereus BDRD ST26, with B. thuringiensis BGSC 4AW1 as 168 the minor parent (Suppl. Table S2). The recombined section spans the whole polymorphic Mph 169 region. 170 171 6- B. cereus VD102 isolate have lost the Mph gene after RH with a strain related to B. cereus 172 BDSD ST26. 173 6 174 Phylogenetic information 175 B. cereus VD102 is the sister group of B. cereus m1293/B. cereus CT BC244 in the housekeeping 176 gene phylogeny (Fig. 3) and the 16 Kbp phylogeny (Suppl. Fig. S3A) but in the 5.5 Kbp 177 phylogeny without the polymorphic Mph region, B. cereus VD102 branches as the sister group of 178 B. cereus LCT BC244 with high bootstrap support, with B. cereus m1293 as the outgroup (Suppl. 179 Fig. S3C). 180 181 Hypothesis 182 B. cereus VD102 may be the closest relative of B. cereus LCT BC244, and may have lost the Mph 183 gene through HR with a Mph-free Clade 1 strain. Because of the extreme nucleotide similarity 184 between B. cereus m1293, B. cereus LCT BC244 and B. cereus VD102, the presence of the 185 polymorphic Mph region in B. cereus m1293 and B. cereus LCT BC244 may have artificially 186 inflated their genetic similarity compared to B. cereus VD102. B. cereus VD102 would thus be the 187 recombinant, with B. cereus LCT BC244 as the major parent and any of the Mph-free Clade 1 188 strain as the minor parent. 189 190 RDP analysis 191 Included isolates: B. cereus LCT BC244, B. cereus VD102, B. cereus BDRD ST26, B. cereus 192 ATCC 4342, B. cereus F, B. cereus BAG6X1-1 (outgroup). Analysis performed on the 16 Kbp 193 region without the polymorphic Mph region. 194 Four methods provided significant support for a ~1.5-2 Kbp recombination event that occurred 195 in B. cereus VD102, with B. cereus ATCC 4342 as the minor parent (Suppl. Table S2). The 196 recombined section spans the whole polymorphic Mph region. 197 The same recombination event is also detected in B. cereus BDRD ST26 by RDP and 198 GENECONV methods, but with a corrected p-value significant only for GENECONV. This event 199 thus does not fulfill our prerequisites and is therefore considered as an artefactual result. 200 201 7- B. cereus BGSC 6E1 isolate have acquired the Mph allele from a strain related to B. 202 thuringiensis BGSC 4AW1. 7 203 204 Phylogenetic information 205 In all phylogenies except the polymorphic Mph gene phylogeny, B. cereus BGSC 6E1 branches as 206 the root of the Ba++ group, while in the polymorphic Mph gene phylogeny, B. cereus BGSC 6E1 207 branches as the sister group of B. thuringiensis BGSC 4AW1 (Suppl. Fig. S3D). 208 209 Hypothesis 210 B. cereus BGSC 6E1 is the recombinant and have acquired the Mph allele of B. thuringiensis 211 BGSC 4AW1 (the minor parent). In this case, the major parent would be a relative of B. cereus 212 BGSC 6E1, root of the Ba++ group. 213 214 RDP analysis 215 Included isolates: B. cereus BGSC 6E1, B. thuringiensis BGSC 4AW1, all other members of the 216 Ba++ group, B. cereus VD156 (outgroup). Analysis performed on the 16 Kbp region with the 217 polymorphic Mph region included. 218 Five methods provided significant support for a ~1-2 Kbp recombination event that occurred in 219 B. cereus BGSC 6E1, with B. thuringiensis BGSC 4AW1 as the minor parent (Suppl. Table S2). 220 The recombined section spans over 1 Kbp of the polymorphic Mph region. 8