Manual seeQTL: A searchable database for human eQTLs seeQTL portal Kai Xia (kxia@bios.unc.edu) Overview Search by Gene Symbol Search by SNP Search by genomic coordinate Select interested tracks Upload user P-Value Study example using seeQTL Overview seeQTL is a genomic browser based eQTL database developed under Gbrowse 2.0. The Landmark or Region: genomic coordinate of current genomic window. User can also input the search items here by Gene Symbol, dbSNP ID or genomic coordinate. Overview section CYT track: cytoband of the current chromosomes Region section GWAS track: GWAS results from other study OMIM track: OMIM information from Online Mendelian Inheritance in Man (OMIM) UCSC RefGenes track: UCSC annotated RefSeq genes Details section UCSC RefGenes track: UCSC annotated RefSeq genes in a zoom in view HapMap LCL cis-eQTL meta-analysis track: segment plot for meta-analysis of cis-eQTL of HapMap sample HapMap LCL cis-eQTL meta-analysis Manhattan plot track: Manhattan plot for meta-analysis result of cis-eQTL of HapMap sample Other tracks: either segment plot or Manhattan plot for other eQTL study Search by Gene Symbol Input the Official Gene Symbol in the ‘Landmark or Region’ and click ‘Search’ on the right Users will be directed to another page showing whole genome view of CD47 associated SNPs. The detail of each association is also shown below in the tabular form. Users can also click each association and will be directed to the new genomic view page for particular SNP. Search by SNP Same place to input dbSNP ID Users will be directed to another page showing whole genome view of rs187462 associated genes. The detail of each association is also shown below in the tabular form. Users can also click each association and will be directed to the new genomic view page for particular genes. Search by genomic coordinate Users can also search by inputting genomic coordinate format like: chr3:107714174..107914173 Select interested tracks Users can also select their favored datasets and tracks to view. Once they go back to the ‘Brower’ window, all the tracks will be updated based on the track they selected. The order of tracks can also be changed just by dragging in the in window when placing on each track name Upload user P-values Upload user P-values Users can freely upload their customized sets of tracks. The tracks can be either gene, SNP, SNP P-values or other quantitative. The full description of uploading data and configuration can be found at (http://gbrowse.csbio.unc.edu/gbrowse2/annotation_help.html#remote). For user customized eQTLs or GWAS P-value, we suggest user to upload one track for each sets of SNP P-value. Users have to follow the WIG rules to prepare the file before upload. To properly visualize the tracks, users also need to set up the CONF files. We also suggest users to use –log10(P) to for each SNP P-value. We have prepared an example for the WIG file (gwas_snp_stat_Teslovich_HDL_CEU.txt.wig) and CONF file (wig_plot.conf) so that user can easily try to see how it works. See the documentation online for other details. Other important steps are shown below: To help users convert their own P-value files (example : gwas_snp_stat_Teslovich_HDL_CEU.txt) to WIG format, we also provide a simple Python code: pval2wig.py, which can be used in Windows, Linux or Mac. The example SNP P-value is from Genome Wide Associations Scans for HDL-C (http://www.sph.umich.edu/csg/abecasis/public/lipids2010/) CONF file example: [track_1] database = database_0 # do not change this! category = Custom Tracks:gwas_snp_stat_Teslovich_HDL_CEU.txt.wig show summary = 0 glyph = wiggle_xyplot graph_type = points fgcolor = salmon bgcolor = salmon height = 100 point_symbol = filled_disc point_radius = 7 key = SNP_Pval (-log10) feature = 001 label density = 10000 bump density = 10000 Select the file to upload: After finishing uploading the track, Gbrowse will generate a default configure file for the track you upload. It defines the track to show as box plot. By clicking the ‘edit’ button on the right, users will be able to edit the configuration of their P-value track. If users want to show their SNP P-value as Manhattan plot, we recommend users to use the following configuration (CONF file). Please keep in mind that the users cannot change ‘database = database_0’ because it is where the uploaded tracks locate. After input the proper configuration file, user can submit and go back to the browser. A new track named as SNP_Pval is shown in the browser, where users can easily navigate across the genome. Study example using seeQTL Let’s assume I am interested in gene CD47 and I would like to know how its gene expression is associated with other SNPs. I can just use the search engine of seeQTL to find the top associated SNPs across the genome among different eQTL datasets. All I need to do is: 1) Search the CD47. A whole genomic view of CD47 associated map will be shown. I can easily find out where the CD47 associated SNPs locates across the genome. In this case, chr3 is enriched with a lot of associated SNPs. Since CD47 is also in chr3, the enrichment of associated SNPs just shows strong cisassociations of CD47 with adjacent SNPs (cis-eQTL or local-eQTL). Below the genomic view, I can also find the top associated list of SNPs together with their location and link. Each link will direct me to the genomic location it points to. Where to search: Genome wide association map of CD47: 2) Then I want to see how the local eQTL affect the gene expression of CD47 gene location, I just click the location link of CD47 and directed to the genomic location of CD47. In the new window, I found many tracks of eQTLs and I am just interested the meta-analysis results here, I just look at the track named as ‘hapmap LCL cis-eQTL meta-analysis Manhattan plot’, which shows the Manhattan plot of location association of CD47. In the pictures below, I didn’t find many strong associated SNPs in the gene body of CD47. The reason is that sometimes the regulatory element is upstream (3’-UTR) or downstream (5’-UTR) of gene body. In order to check other SNPs close to CD47, I just select to zoomout into a 200 kb windows. Then I can find a strong associated SNP in the 5’-UTR of CD47 with q-value > 10 (marked in green). That particular SNP could be the SNP regulating the gene expression of CD47. Click the gene link: Directed to genomic location of CD47: Zoom out to a 200 kbp window in the top-right panel of the page: New windows showing Manhattan plot of more SNPs associated with CD47: 3) In the figure above, I also observed similar pattern of Manhattan plot for IFT57, which is a different gene upstream of CD47, this could be an indication that both genes are correlated and regulated by the same genomic regions. This example is just an illustration on how to explore seeQTL and find scientific interesting results. All the other genes and SNPs can be searched and browsed in the same way.