Workshop databases tools

advertisement
Seminar&workshop Bioinformatica 18 okt 2011
Workshop Databases and algorithms
Part1

Skills: Blast, alignment and database browsing
Given human protein sequence:
MDLSALRVEEVQNVINAMQKILECPICLELIKEPVSTKCDHIFCKFCMLKLLNQKKGPSQ
CPLCKNDITKRSLQESTRFSQLVEELLKIICAFQLDTGLEYANSYNFAKKENNSPEHLKD
EVSIIQSMGYRNRAKRLLQSEPENPSLQETSLSVQLSNLGTVRTLRTKQRIQPQKTSVYI
ELGSDSSEDTVNKATYCSVGDQELLQITPQGTRDEISLDSAKKAACEFSETDVTNTEHHQ
1) Use the appropriate database to find your protein(ID)
Protein name: …………………………
uniprotID:……………………………….
refseqID:…………………………………
2) Find its homolog in M.musculus by using the databases, give its refseqID and explain the
prefix of this ID
refseqID mouse homolog: ………………………..
ID prefix means: ………………………………………
3) Make an alignment of the human and mouse using an online tool (e.g. clustalW). Download
the alignment file (e.g. .aln, clustalW format)
4) Look for the 3D protein structure of the protein (H.sapiens) find out which known protein
domains you can find within this protein by using the appropriate database. Give three.
i)
……………………………..
ii)
……………………………..
iii)
……………………………..
Part 2

Skills: SNP database/genome browsing, OMIM database
Sickle cell anemia is caused predominantly by a specific mutation in the HBB gene. The
mutant is called Hbs.
1)Locate the HBB gene on the genomic map below:
2) Write down the first ten amino acids of the HBB protein and the Hbs mutant. Within you
should find a difference in amino acid code, caused by the mutation in the gene. Make use of
the NCBI SNP and OMIM database. What is the genomic position (bp) of the mutation
according to het Huref genome sequence?
SNPdb ID: …………………………
HBB
Hbs
Position on the genome (Huref):……………………
Download