UCSC Genome Browser

advertisement
UCSC Genome Browser
Accessing the browser and tools:
1. Genome Wiki
 http://genomewiki.ucsc.edu/index.php/Main_Page
2. Genomes:
 Gateway Page: Genomes on the left blue bar and Genome Browser on top.
 Photo Gateway: Photographs of each species whose genome is available in the UCSC
genome browser
3. Genome Browser Interface
 Click on the hyperlink on the track name under track groups for track description.
 Right Click and View image to get image for publications.
 View-> Configure Browser
4. Searching the database:
1. Searching genes:
 PPP1R1B
2. Searching coordinates:
 chr21:33,031,597-33,041,570
3. Cytogenetic location
 17q12
5. Specific Tracks
1. Genetic Association Database (GAD) view
a. Example BRCA1, GAD
b. Mouse over red box to see the list of associated diseases
2. Gene and Gene Prediction Tracks
a. UCSC genes
b. TransMap Cross_species Alignments
c. Refseq
d. mRNA and EST Tracks
e. Conservation and Regulation Data
f. And many more.
6. Blat: On top blue bar: Tools -> Blat)
a. Fast alignment tool
b. BLAT on DNA is designed to quickly find sequences of 95% and greater similarity of
length 25 bases or more.
c. BLAT on proteins finds sequences of 80% and greater similarity of length 20 amino acids
or more.
d. Allows user to align DNA/protein to the genome assemblies.
e. Example: Align ‘PORIN_HUMAN’ from GCBA815 dataset page.
7. Tables: On the top blue bar and table browser on the left blue bar.
a. Retrieve data associated with tracks
b. Example: Mammal/Human/hg19/Genes and Gene Prediction Tracks/Refseq
Genes/refGene/genome
8. Gene Sorter : Allows to Explore gene families and relationships among genes. It displays a list of
genes that are related to each other in the selected genome.
a. Filter: human/hg19/ FOXA2/Expression(GNF Atlas 2)/configure(columns to display)
b. Description page:
i. http://genome.ucsc.edu/goldenPath/help/hgNearHelp.html
9. Genome Graphs:
a. Upload data set (genome-graph-example.txt)
i. Use hg18 as reference genome.
b. Description:
i. http://genome.ucsc.edu/goldenPath/help/hgGenomeHelp.html
10. PCR: Search for PCR primer sequence in a genome assembly.
a. Human/hg19/genome assembly/ TAACAGATTGATGATGCATGAAATGGG/
CCCATGAGTGGCTCCTAAAGCAGCTGC
11. VisiGene: a virtual microscope for viewing in situ hybridization images.
a. NM_007492
12. Utilities : tools to remove non-sequence-related characters from DNA or protein, to create a gif
image for a phylogenetic tree, and to convert genome coordinates between assemblies
a. http://genome.ucsc.edu/util.html
13. Downloads: Dumps of UCSC genome browser databases and open source tools.
14. Custom Tracks: Displaying your own annotations in the genome browser.
a. Session:
i. Creating account
ii. Resetting session from My Data -> Session -> Session management -> Click here
to reset
b. Example:
i. Upload test file “custom-track-example”
ii. URL example from golgi: http://genome.unmc.edu/ucsc/gcba_815/customtrack-example.bed
c. Description: http://genome.ucsc.edu/goldenPath/help/customTrack.html
Other Resources:
https://genome.ucsc.edu/training.html
Reference:
Harte, R. A., Diekhans, M., Kent, W. J. & Haussler, D. Guide to the UCSC Genome Browser. Cambridge,
MA: NPG Education, 2010
Galaxy (https://usegalaxy.org/)
Account Creation:
1. User -> Register
a. Required for saving history and accessing data in the future.
Input data:
1. Upload File from local computer
a. Download data from UCSC genome browser
i. Table Browser -> Mammal/Human/hg19/Genes and Gene Prediction
Tracks/UCSC Genes/knownGene/position:chr22/…./(output format) BEDbrowser extensible data/(output file)Exons/plain text/get output
2. UCSC Main
a. UCSC Main -> Mammal/Human/hg19/Variation/All
SNPs(138)/snp138/(position)chr22/…./(output format) BED-browser extensible
data/Galaxy checked/plain text/get output
b. Next Page -> Whole Gene/Send query to Galaxy
3. Other sources
Simple Data Manipulation
1. Finding Exons with highest number of SNPs
a. Operate on Genomic Intervals -> Join
i. Joins 1:Exons with 2:SNPs with min overlap 1
ii. Return: Only records that are joined(INNER) JOIN)
iii. Execute
b. Output:
chr2
2
16258185 16258303 uc002zlh.1_cds_1_0_chr22_16258186_r 0 - chr22 16258278 16258279 rs2845178
chr2
2
16258185 16258303 uc002zlh.1_cds_1_0_chr22_16258186_r 0 - chr22 16258285 16258286 rs113311305 0 +
chr2
2
16258185 16258303 uc002zlh.1_cds_1_0_chr22_16258186_r 0 - chr22 16258284 16258285 rs199586979 0 +
chr2
2
16258185 16258303 uc002zlh.1_cds_1_0_chr22_16258186_r 0 - chr22 16258293 16258294 rs200891952 0 +
chr2
2
16266928 16267095 uc002zlh.1_cds_2_0_chr22_16266929_r 0 - chr22 16266963 16266964 rs10154680
chr2
2
16266928 16267095 uc002zlh.1_cds_2_0_chr22_16266929_r 0 - chr22 16266984 16266985 rs376011635 0 +
c. Group the exons to count SNPs
i. Join, Subtract, and Group -> Group
1. Select Data ->Join on data 2 and data 1
2. Group by column 4
0 +
0 +
2.
3.
4.
5.
3. Operation =>Insert operation => Type -> count; On Column: 4
4. Exit
Sorting exons by SNP count
a. Filter and Sort -> Sort
Selecting top hits
a. Text Manipulation -> Select First
Recovering exon info and displaying data in genome browser
a. Join, Subtract and Group -> Compare two Datasets
i. 1:Exons/c4/7:Select first on data 6/c1/Matching rows of 1st dataset
Visualizing selected exons in genome browser
a. display at UCSC main
6. Histories
a. History of data, analysis and results.
7. Workflows
a. Pipeline that can be used to re run the analysis with minimal clicking
b. History Settings -> Extract Workflow -> give name -> Create Workflow
c. Select Workflow from top menu bar.
8. Other tools.
Reference
http://galaxyproject.org/
Download