Bioinformatics lab 2 Assignment

advertisement
MCB7300: Bioinformatics Lab 2 (60 points)
Programming
1) Select 10 random protein sequence IDs and write a Perl code to retrieve their
protein sequences from the protein database of your favorite species;
2) Use a key word and write a Perl code to retrieve 10 protein sequences that
are encoded by your favorite gene superfamily;
3) Apply a phylogenetic analysis to cluster the 20 protein sequences;
4) Write an R code to retrieve the expression data of the 20 encoding genes
from 5 different tissues and/or organs of your favorite species;
5) Use R to perform expression data analysis.
6) You may also apply the web-based programs learned from the first lab to
perform a further data analysis.
Final Report
Write a PNAS-format paper (5-6 figures) to summarize what you learn about the
relationship among the 20 protein coding genes.
(due in 2 weeks, 03/25/2015).
Download