IMG/M User Tasks List - Microbial Genome Program

advertisement
IMG User Scenario February11, 2011
IMG/M User Tasks List
A. Using Genome Browser and Metagenome Details
1. How many metagenomes are there in IMG/M?
2. How many bins are there in Acid Mine Drainage metagenome?
3. To which phyla the proteins in Acid Mine Drainage metagenome have
the highest number of hits?
4. How many scaffolds in Acid Mine Drainage metagenome have GC
content below 30%?
5. Using Phylogenetic Profiler on Microbiome Details page, find how
many genes with COG hits are present in Methylococcaceae bin of
Lake Washington (combined v2) metagenome, but not in the genome
of Methylococcus capsulatus.
B. Using Phylogenetic Distribution of Genes and Scaffold Cart
1. How many genes in Sludge/US, Phrap Assembly metagenome have
the best hit to Alphaproteobacteria with percent identity between 60 and
90%?
2. How many genes in Sludge US Phrap metagenome with the best hit to
Alphaproteobacteria at 60-90% identity belong to the COG Functional
Category of “Amino acid transport and metabolism”?
3. Which family of Betaproteobacteria has the highest number of best
hits from the genes in Sludge US Phrap metagenome (cumulative
above 30% identity)?
4. Which archaeal genome has most hits with >90% identity from the
metagenomes of human gut subject 7 and human gut subject 8?
5. What are the functions of genes in the region between 270 kb and 330
kb of isolate Methanobrevibacter smithii that are missing from
Methanobrevibacter smithii from human gut subject 7 metagenome?
IMG User Scenario February11, 2011
Are these genes also absent from Methanobrevibacter smithii from
human gut subject 8 metagenome?
6. What is the range of GC content of contigs assigned to the bin
“Accumulibacter” (binning method PhyloPythia) in Sludge US Phrap
metagenome? What is the range of read depths for this bin?
C. Using Find Functions
1. Which Pfams describe carbohydrate-binding modules (CBM)?
2. Are there any CBM-containing genes in human gut subject 7 and
subject 8?
3. Do A. phosphatis bins in Sludge/Australian, Phrap Assembly and
Sludge/US, Jazz Assembly metagenomes have all COGs assigned to
“Histidine biosynthesis” pathway? Do they have a complete pathway?
4. Representatives of which phyla are likely to be present in the
metagenome of human gut community subject 7 when COG0200
(Ribosomal protein L15) is used?
D. Using Find Genes
1. Which domains are associated with carbohydrate binding module
family 6 (CBM_6, pfam03422) in Soil microbial communities from
Minnesota farm metagenome?
E. Using Compare Genomes
1. Using Abundance Profile Search, find, how many Pfams are at least
twice as abundant in the metagenome of human gut community
subject 7 as compared to human gut community subject 8 using
frequency normalization. Which Pfam has the highest frequency in
human gut community subject 7 metagenome? Is it more abundant
than in human gut community subject 8 metagenome by frequency?
By raw counts?
2. Using Function Comparisons, find which COG is most
overrepresented in the metagenome of human gut community subject
7 as compared to human gut community subject 8. Is this result
IMG User Scenario February11, 2011
statistically significant? Which COGs are significantly
overrepresented in human gut community subject 7 as compared to
human gut community subject 8? Use D-score as statistic and gene
count.
3. Using Function Category Comparisons, find which COG Pathways
are overrepresented with p-value less than 1.0e-01 in human gut
subject 7 metagenome as compared to human gut subject 8
metagenome. How these results could be interpreted with respect to
the overrepresentation of individual COGs in the same metagenome?
4. Using Genome Clustering, find, which of 5 mouse gut community
metagenomes are the closest according to their COG frequency
distributions? According to their Pfam frequency distribution?
5. Using Phylogenetic Distribution, find, which phyla are
underrepresented in metagenomes of 2 obese mice as compared to 3
lean mice using “Gene count” and BLAST hits with identity above
60%. Which phyla are overrepresented?
F. Using SNP BLAST and SNP VISTA.
1. Using SNP BLAST and SNP VISTA find whether there are any
populations within Leptospirillum sp. group II bin of Acid Mine
Drainage metagenome.
2. Is there any evidence of recombination between populations within
Ferroplasma acidarmanus type I bin of Acid Mine Drainage
metagenome?
Download