gb-2014-15-3-r44-S5

advertisement
Additional file 5. The 100 features describing each organism which were used in the search for
the phenotypes predictive of the changes in translation efficiency within COGs. All features are
binary variables, and can be undefined for some organisms. 70 features describing the phylogeny
(left/middle columns) and the 6 features describing genome size and GC content (right column, top)
are included to ensure that correlations detected with the remaining 24 features (phenotypes, right
column) could not be explained by the phylogeny or the genomic size/GC. "#pos" is the number of
organisms marked as positive for the feature, and "#neg" as negative.
domain / phylum /
class
#pos
#neg
domain: Archaea
domain: Bacteria
phylum: Actinobacteria
85
826
97
826
85
813
phylum: Bacteroidetes
phylum: Chlamydiae
phylum: Chlorobi
phylum: Chloroflexi
34
8
10
15
876
902
900
895
order
(taxonomy)
#pos
#neg
Actinomycetales
Alteromonadales
Bacillales
83
28
46
801
856
838
Bacteroidales
Bifidobacteriales
Burkholderiales
Campylobacterales
10
6
47
18
874
878
837
866
genomic features
G+C: above median(>47.5%)
G+C: high(>59.9%)
G+C: low(<37.6%)
Genome size: above
median(>3.2Mb)
Genome size: large(>4.8Mb)
Genome size: small(<2Mb)
23
28
887
882
Chlamydiales
Chlorobiales
8
10
876
874
phenotypes
10
58
900
852
Chroococcales
Clostridiales
20
37
864
847
phylum: Firmicutes
145
765
Desulfovibrionales
8
876
phylum: Proteobacteria
383
527
Enterobacteriales
46
838
phylum: Spirochaetes
phylum: Tenericutes
phylum: Thermotogae
17
28
11
893
882
899
Flavobacteriales
Halobacteriales
Lactobacillales
14
12
36
870
872
848
class: Actinobacteria
97
775
Legionellales
3
881
107
82
10
68
8
10
61
10
33
22
765
790
862
804
864
862
811
862
839
850
Methanococcales
Mycoplasmatales
Neisseriales
Pasteurellales
Prochlorales
Pseudomonadales
Rhizobiales
Rhodobacterales
Rickettsiales
Spirochaetales
9
22
5
10
1
18
47
12
25
17
875
862
879
874
883
866
837
872
859
867
14
858
5
879
152
720
Sulfolobales
Thermoanaerobacterales
20
864
class: Halobacteria
12
860
Thermococcales
8
876
class: Methanococci
9
863
Thermotogales
11
873
14
28
17
23
11
858
844
855
849
861
Thiotrichales
Vibrionales
Xanthomonadales
4
9
8
880
875
876
Gram Stain=positive
Growth in groups
Habitat=aquatic(pos) vs.
terrestrial(neg)
Habitat=free-living(pos) vs.
hostAssociated(neg)
Habitat=multiple(pos) vs.
single(neg)
Halophilic
Motility
Oxygen = aerotolerant (pos) vs.
strictly anaerobic (neg)
Oxygen = facultative aerobe (pos)
vs. strict aerobe (neg)
Psychrophilic
Radioresistance
Shape = coccus (pos) vs. rod (neg)
Thermophilic
Pathogenic in plants
Pathogenic in mammals
Mammalian pathogen = blood
Mammalian pathogen =enteric
Mammalian pathogen = heart
Mammalian pathogen = nervous
system
Mammalian pathogen =
oportunistic/nosocomial
Mammalian pathogen = oral
cavity
Mammalian pathogen =
respiratory
Mammalian pathogen = skin/soft
tissues
phylum: Crenarchaeota
phylum: Cyanobacteria
phylum: DeinococcusThermus
phylum: Euryarchaeota
class: α-proteobacteria
class: Bacilli
class: Bacteroidia
class: β-proteobacteria
class: Chlamydiae
class: Chlorobia
class: Clostridia
class: Deinococci
class: δ-proteobacteria
class: ε-proteobacteria
class: Flavobacteria
class: γ-proteobacteria
class: Methanomicrobia
class: Mollicutes
class: Spirochaetes
class: Thermoprotei
class: Thermotogae
Endospores
#pos
#ne
g
425
235
188
387
576
624
454
226
195
443
671
702
#pos
#ne
g
75
337
216
61
466
228
167
71
238
222
190
40
379
587
140
231
514
214
217
25
16
105
142
23
163
35
31
8
296
760
887
506
643
304
166
128
132
155
17
146
16
147
8
155
43
120
24
139
Download