Supplemental File S4. Homologous Chromosomes

advertisement
Investigation 2: Non-homologous elements: The Sex Determining Region of The Y
chromosome
Let's return to the Y chromosome to explore some non-homologous elements. There are
obviously many more genes on the X chromosome that are not present on the Y chromosome,
but for our exercise, we will investigate a gene found only on the Y chromosome- the gene Sexdetermining Region of the Y chromosome, or SRY.
To conduct a search for the human SRY gene: Navigate back to "Browse your Genome"
http://www.ncbi.nlm.nih.gov/projects/genome/guide/human/index.shtml
Under the chromosomes image is the search field “Find a Gene.” One can search for a specific
gene by name here.
Type in "sex determining region of the Y" or “SRY” in the text field, and select Homo sapiens
in the drop-down menu.
Hit the go button.
In the results, find the record for the human SRY gene on chromosome Y, cytogenetic location
Yp11.3
Click the link for the human SRY gene record.
This gene is also known as TDF and TDY (for testis-determining factor and testis determining
region on Y, respectively).
Click the ‘reference sequence details’ link. This will take you to another section of the report
that links to the Reference Sequences for the SRY gene transcript and the SRY protein.
Scroll down to the “NCBI Reference Sequences (RefSeq)” heading, or click the “Reference
Sequences” option from the Table on contents on the far right of the SRY record page.
Click on the record for the mRNA transcript: NM_003140.2, as shown below. This is the
accession number, or unique identification number, for the nucleotide record for the human SRY
gene.
Clicking the record number will take you to the full GenBank database record for the nucleotide
coding sequence of the human SRY gene. The first line of the record gives the accession number
and length of the sequence. At the end of the record, the full nucleotide sequence is given.
Answer the following question in your laboratory notebook in the “Answers/Conclusions”
section:
Question 2-1: According to the Nucleotide database sequence record, how many base pairs (bp)
is the human SRY transcript?
Now go back to the Entrez Gene record, where you selected the record for the SRY transcript
sequence. This time, select the record for the protein sequence: NP_003131.1
Answer the following questions in your laboratory notebook in the “Answers/Conclusions”
section:
Question 2-2: According to the record, how many amino acids long is the SRY protein?
Question 2-3: Why do you think the amino acid sequence length is NOT the same number that
you would get by simply dividing the number of nucleotides in the SRY nucleotide sequence by
3?
Although having the gene sequence provides a lot of information, just having the sequence is
not enough. Again go back to the Entrez Gene record, where you selected the record for the SRY
transcript and protein sequence. This time, select the link for “CCDS”. This link will take you to
the coding sequence for SRY, which is a sequence that results from consistent annotation by
researchers.
The annotation of genes is a collaborative effort between international partners. The links below
will give you more information about how gene records are annotated by researchers.
http://www.ncbi.nlm.nih.gov/projects/RefSeq/
http://www.ncbi.nlm.nih.gov/projects/CCDS/
Note how the nucleotide sequence of the consensus record is shorter than the Nucleotide
database sequence for SRY that you reported above because the consensus sequence has been
revised/edited by researchers to show only those nucleotide bases that are translated into the
SRY protein.
Download