Search by genomic coordinate

advertisement
Manual
seeQTL: A searchable database for human eQTLs
seeQTL portal
Kai Xia (kxia@bios.unc.edu)
Overview
Search by Gene Symbol
Search by SNP
Search by genomic coordinate
Select interested tracks
Upload user P-Value
Study example using seeQTL
Overview
seeQTL is a genomic browser based eQTL database developed under Gbrowse 2.0.
The Landmark or Region: genomic coordinate of current genomic window. User can also input the search
items here by Gene Symbol, dbSNP ID or genomic coordinate.
Overview section
CYT track: cytoband of the current chromosomes
Region section
GWAS track: GWAS results from other study
OMIM track: OMIM information from Online Mendelian Inheritance in Man (OMIM)
UCSC RefGenes track: UCSC annotated RefSeq genes
Details section
UCSC RefGenes track: UCSC annotated RefSeq genes in a zoom in view
HapMap LCL cis-eQTL meta-analysis track: segment plot for meta-analysis of cis-eQTL of HapMap
sample
HapMap LCL cis-eQTL meta-analysis Manhattan plot track: Manhattan plot for meta-analysis result of
cis-eQTL of HapMap sample
Other tracks: either segment plot or Manhattan plot for other eQTL study
Search by Gene Symbol
Input the Official Gene Symbol in the ‘Landmark or Region’ and click ‘Search’ on the right
Users will be directed to another page showing whole genome view of CD47 associated SNPs. The detail
of each association is also shown below in the tabular form. Users can also click each association and will
be directed to the new genomic view page for particular SNP.
Search by SNP
Same place to input dbSNP ID
Users will be directed to another page showing whole genome view of rs187462 associated genes. The
detail of each association is also shown below in the tabular form. Users can also click each association
and will be directed to the new genomic view page for particular genes.
Search by genomic coordinate
Users can also search by inputting genomic coordinate format like:
chr3:107714174..107914173
Select interested tracks
Users can also select their favored datasets and tracks to view. Once they go back to the ‘Brower’ window,
all the tracks will be updated based on the track they selected. The order of tracks can also be changed
just by dragging in the in window when placing on each track name
Upload user P-values
Upload user P-values
Users can freely upload their customized sets of tracks. The tracks can be either gene, SNP, SNP P-values
or other quantitative. The full description of uploading data and configuration can be found at
(http://gbrowse.csbio.unc.edu/gbrowse2/annotation_help.html#remote). For user customized eQTLs or
GWAS P-value, we suggest user to upload one track for each sets of SNP P-value. Users have to follow
the WIG rules to prepare the file before upload. To properly visualize the tracks, users also need to set up
the CONF files. We also suggest users to use –log10(P) to for each SNP P-value. We have prepared an
example for the WIG file (gwas_snp_stat_Teslovich_HDL_CEU.txt.wig) and CONF file (wig_plot.conf)
so that user can easily try to see how it works. See the documentation online for other details. Other
important steps are shown below:
To help users convert their own P-value files (example : gwas_snp_stat_Teslovich_HDL_CEU.txt) to
WIG format, we also provide a simple Python code: pval2wig.py, which can be used in Windows, Linux
or Mac.
The example SNP P-value is from Genome Wide Associations Scans for HDL-C
(http://www.sph.umich.edu/csg/abecasis/public/lipids2010/)
CONF file example:
[track_1]
database = database_0 # do not change this!
category = Custom Tracks:gwas_snp_stat_Teslovich_HDL_CEU.txt.wig
show summary = 0
glyph
= wiggle_xyplot
graph_type = points
fgcolor
= salmon
bgcolor
= salmon
height
= 100
point_symbol = filled_disc
point_radius = 7
key
= SNP_Pval (-log10)
feature = 001
label density = 10000
bump density = 10000
Select the file to upload:
After finishing uploading the track, Gbrowse will generate a default configure file for the track you
upload. It defines the track to show as box plot. By clicking the ‘edit’ button on the right, users will be
able to edit the configuration of their P-value track.
If users want to show their SNP P-value as Manhattan plot, we recommend users to use the following
configuration (CONF file). Please keep in mind that the users cannot change ‘database = database_0’
because it is where the uploaded tracks locate. After input the proper configuration file, user can submit
and go back to the browser.
A new track named as SNP_Pval is shown in the browser, where users can easily navigate across the
genome.
Study example using seeQTL
Let’s assume I am interested in gene CD47 and I would like to know how its gene expression is
associated with other SNPs. I can just use the search engine of seeQTL to find the top associated SNPs
across the genome among different eQTL datasets. All I need to do is:
1) Search the CD47. A whole genomic view of CD47 associated map will be shown. I can easily find out
where the CD47 associated SNPs locates across the genome. In this case, chr3 is enriched with a lot of
associated SNPs. Since CD47 is also in chr3, the enrichment of associated SNPs just shows strong cisassociations of CD47 with adjacent SNPs (cis-eQTL or local-eQTL). Below the genomic view, I can also
find the top associated list of SNPs together with their location and link. Each link will direct me to the
genomic location it points to.
Where to search:
Genome wide association map of CD47:
2) Then I want to see how the local eQTL affect the gene expression of CD47 gene location, I just click
the location link of CD47 and directed to the genomic location of CD47. In the new window, I found
many tracks of eQTLs and I am just interested the meta-analysis results here, I just look at the track
named as ‘hapmap LCL cis-eQTL meta-analysis Manhattan plot’, which shows the Manhattan plot of
location association of CD47. In the pictures below, I didn’t find many strong associated SNPs in the
gene body of CD47. The reason is that sometimes the regulatory element is upstream (3’-UTR) or
downstream (5’-UTR) of gene body. In order to check other SNPs close to CD47, I just select to zoomout into a 200 kb windows. Then I can find a strong associated SNP in the 5’-UTR of CD47 with q-value >
10 (marked in green). That particular SNP could be the SNP regulating the gene expression of CD47.
Click the gene link:
Directed to genomic location of CD47:
Zoom out to a 200 kbp window in the top-right panel of the page:
New windows showing Manhattan plot of more SNPs associated with CD47:
3) In the figure above, I also observed similar pattern of Manhattan plot for IFT57, which is a different
gene upstream of CD47, this could be an indication that both genes are correlated and regulated by the
same genomic regions. This example is just an illustration on how to explore seeQTL and find scientific
interesting results. All the other genes and SNPs can be searched and browsed in the same way.
Download