Department of Genetics Data Archival Policy

advertisement
The Department of Genetics and Genome Sciences
Genomics Core Sequencing Services:
Frequently Asked Questions (FAQs)
Thanks for your interest in the CWRU Genomics Core and deciding to undertake Next Gen Sequencing! It can
be a daunting task if you are new to Genomics and we are here to help. We ask that you please review our
FAQ to help answer some general questions you may have. After reviewing the FAQ please contact us at
nmb64@case.edu with any further questions or to begin the project planning process.
About the Genomics Core
1. Why should I choose the CWRU Genomics Core?
Experience: The CWRU Genomics Core has been providing quality services for over eight years to the
Case Western Reserve Community, University Hospitals, Cleveland Clinic, the surrounding Cleveland
area, as well as institutions outside of Ohio.
Personalized service: We are happy to help you through every step of your project from consultation on
the proper sequencing technology, sample preparation, through bioinformatics analysis.
Professional Project Management: Your samples, work-order, and time are very important to us.
Better Quality Data: Our unique post-sequencing pipeline offers sophisticated read processing and
contaminant filtering so that you receive only the highest quality data possible.
Premiere Journal Publications: Our clients have published in several high impact journals including
Science, Nature, and Cell.
Discounts: Please contact us to discuss specialized pricing for large volume orders and pilot projects.
2. What services does the Genomics Core have to offer?
The CWRU Genomics Core offers a wide range of services to all investigators at CWRU, its partnering
institutions, as well as external collaborators.
Sequencing: We leverage the Illumina Next Gen Sequencing (NGS) platforms consisting of the
HiScan SQ and Genome Analyzer IIx (GAIIx) instruments. Additionally, we also perform Sanger
sequencing on an Applied Biosystems 3730 DNA Analyzer.
NGS Library Prep: The Genomics Core has hands-on experience with various Illumina NGS DNA
and RNA library prep kits as well as target enrichment technologies for custom capture projects. We
can help you decide which of these kits fits best with your project needs and handle the work for you.
Genotyping/Gene Expression: The Genomics Core provides a variety of Illumina-based
genotyping array options including genome-wide studies with up to 5.0 million markers, customdesigned sets of up to several thousand markers, and small-scale projects comprising only a few
markers. DNA methylation profiling, with single CpG resolution, as well as human and mouse linkage
analysis projects can also be accommodated.
We can also assist with the design of custom genotyping projects.
Next Gen Sequencing Services
3. What types of NGS libraries do you construct?
For DNA libraries we typically use the Illumina library kits, TruSeq and Nextera, for samples including
ChIP-Seq and Whole Genome Sequencing. We also employ the Illumina TruSeq and Epicenter Scriptseq
kits for RNA library construction. We have worked with other library preparation kits including Agilent
and Illumina Custom Capture kits.
4. What are the sample submission requirements for library prep services?


Submit all PURIFIED DNA/RNA samples in 1.5ml tubes, clearly labeled with sample name, the
date, and your name/initials.
Fill out the Library Prep Request form and submit/email any QC performed on the samples.
DNA Libraries
Quantity
Volume
QC required
TruSeq DNA
Nextera DNA
chIP-seq DNA
1ug
50ng
~10ng
< 50ul
< 20ul
< 30ul
260/280: 1.8-2.0
260/280: 1.8-2.0
260/280: 1.8-2.0
0.1-4.0ng
50-250 ng
100-250 ng
< 50ul
< 10ul
< 10ul
RIN: 8.0-10.0
RIN: 8.0-10.0
RIN: 8.0-10.0
3ug
< 100ul
260/280: 1.8-2.0
RNA Libraries
TruSeq RNA
Script Seq – rRNA-depleted
Script-Seq – poly(A) eukaryotic RNA per reaction.
Custom Capture
Agilent Sure Select Target Enrichment
5. How many libraries can I pool/index/plex/barcode in one lane?
For indexing standard DNA and RNA libraries we use the Illumina TruSeq indexing protocol which allows
for a maximum of 24 uniquely indexed sample libraries to be pooled together and sequenced in one
lane. The number of reads needed per sample will dictate how many samples can be pooled per lane.
It is suggested to reduce the number of libraries in a pool when possible to minimize the chances of
uneven sequencing read distribution results. As a general warning read distributions may vary for each
sample in a multi-plexed lane increasing potential variation with each additional sample.
It is possible to pool prokaryotic samples using the Nextera DNA Library Prep kit which has the ability to
pool up to 96 samples. The Nextera kit is a specialized kit and cannot be used for all types of samples
and analysis. If you want to know if your samples fit the kit, contact the Genomics Core for a
consultation.
6. Can I supply my own libraries? What are the requirements?
Yes, the Genomics Core accepts and runs libraries that are not made in our lab. We ask that you
provide any QC (qPCR, Bioanalyzer HS chip, etc) performed on the completed library and fill out the
NGS User Agreement Form. The Genomics Core does offer a library quantification package to check
library quality and will QC all samples that are submitted to determine if they can continue onto
sequencing. When filling out the Agreement form make sure to include all pertinent information,
including samples names, pooling/indexing information and any custom sequencing primers required
for the run.



Submit at least 10ul of DNA/RNA library sample in 1.5ml tube, clearly labeled with sample
name, the date, and your name/initials.
For custom primers, please label the given concentration.
Fill out the Illumina NGS Agreement form and submit/email any QC performed on the libraries.
Please feel free to contact us with questions as you are prepping your samples to ensure that you get
only the highest quality results.
7. How much does it cost?
Please refer to the CWRU Genomics Core pricing list at the end of this document or check out our
website http://genomics.case.edu. We partner with several other cores on campus to provide a full
complement of services to allow investigators to go from tissue to data. One of our partners, the
Translational Research Core, provides DNA/RNA extraction or purification services from all manner of
tissue samples.
Special discounts are available for large scale projects. Please contact Nick Beckloff at
nmb64@case.edu for more information.
8. How much sequencing coverage/sequencing depth/many reads do I need?
The ideal amount of coverage is variable and depends on your specific project. Please read this very
helpful technical note from Illumina on how to best estimate sequencing coverage required for
different types of projects.
www.illumina.com/documents/products/technotes/technote_coverage_calculation.pdf
9. How many reads can I expect per lane on the Illumina HiScan SQ and the GAIIx?
For high quality “Standard DNA” or “Standard RNA” libraries sequenced on our HiScan SQ platform
(using V3 chemistry) we typically estimate reads results of approximately 90M reads (+/- 20%) per lane.
For high quality “Standard DNA” or “Standard RNA” libraries sequenced on our GAIIx platform, we
typically estimate reads results of approximately 35M reads (+/- 20%) per lane.
Please inquire with us to receive our most current sequencing results approximations for specialized
libraries (sRNA, ChIP, RRBS, etc.).
10. What is the average turnaround time?
While we try to process all samples as efficiently as possible the following time frames may vary.
Library Prep: We strive to make libraries within 1-2 weeks upon receipt of samples and reagents. The
total amount of time to process is ultimately dependent on the number, type, and quality of samples.
All samples are QC’ed to ensure quality and if the libraries fail the QC process the PI will be contacted
and we will discuss how to move forward.
Sequencing: We try to obtain a turnaround time of approximately 4-6 weeks (for a full flow cell project
of 7 lanes) upon receipt of samples/libraries passing our quality control testing. This can be extended
by several factors including sample failure, machine down time, or holidays. For expedited service it is
easier for us to run projects that can fill an entire flow cell.
Please note that turnaround times for non-full flow cell projects are completely dependent on other
projects filling the remaining lanes of the flow cell before sequencing can commence.
11. How long will Core hold onto my samples?
The Genomics Core stores all remaining library at -20 degrees for a period of 90 days after full project
completion. Users are welcome to pick up samples at any time before this point.
12. How can I set up a research support contract with the Genomics Core?
Please contact Dr. Nicholas Beckloff (nmb64@case.edu). He will discuss your project with you and
provide you with plans and cost estimates. We can also assist in the preparation of your grants
proposals and manuscripts.
Bioinformatics
13. How will my results be delivered?
Standard Results (base calling, quality scores, FASTQC and FASTQ/BAM files) can be delivered in one of
several ways:




Via a secure FTP site/Helix account
External hard drive
CWRU Galaxy delivery
CWRU High Performance Computing Cluster
CWRU Genomics Core’s unique post-sequencing pipeline offers sophisticated read processing and
contaminant filtering so that you receive only the highest quality data possible.
14. How can I work with the Genomics Core to analyze my sequencing data?
Depending upon your specific needs and in-house resources there are two approaches you may
consider:
1. Typically we QC the data off the sequencer and will do a post-run alignment converting the
data to BAM files. Some labs take their data as FASTQ files and others ask for alignments.
When your needs extend beyond this realm we have the second option available.
2. We collaborate with the Cancer Center Statistics & Bioinformatics Core to provide informatics
and statistics support for all manner of projects.
15. What are the standard bioinformatics options?
Through the activities of our researchers we developed informatics protocols for analyzing sequence
data. At present these include:

Expression Quantitation & Differential Expression Analysis (RNA-Seq)



SNP & Small Indel Prediction
Exome analysis and Variant Detection
De Novo Transcript Assembly of RNA-Seq Data
16. What is your data archiving policy?
The Genetics and Genome Sciences Department has a Data Archival Policy that is required to be signed
before sequencing with the Genomics Core. Once sequencing data has been delivered, the raw images
are discarded and standard output is retained for six months.
Special data storage and delivery requests may be accommodated.
Thank you very much for considering us for your project, if you have any additional questions please email:
nmb64@case.edu. We hope to hear from you soon!
Pricing
Illumina NGS
*Prices include necessary reagents and processing fees; does NOT include library prep fees or QC fees of libraries.
GAIIX
Single Read
36-base run
76-base run
100-base run
HiScan SQ
Price per Lane*
$650
$875
$1,075
Paired-End Read
2x 36-base run
2x 76-base run
2x 100-base run
Single Read
50 base run
$695
100 base run
$925
Paired-End Read
2x 50-base run
$1,500
Price per Lane*
2x 100-base run
$1,200
$1,450
$1,925
$2,385
Informatics
$100/hr
Illumina NGS Library Prep
*Prices include necessary reagents and processing fees; does NOT include sequencing fees or QC fees of libraries
Illumina chIP-seq Sample Prep
Total/sample
Price *
$400
Illumina TruSeq DNA prep
Total/sample
$365
Nextera DNA Prep Kit
Total/1 sample
$280
Total/8 samples
$850
Special Illumina Seq Reagent required for Nextera libraries per flow cell
$90
Illumina TruSeq RNA Prep
Total/1 sample
$370
Script-Seq RNA prep
Total/1 sample
TBD
Total/8 samples
TBD
Exome Capture Library Prep (Agilent)
Total/1 sample
$350
Covaris Shearing and Quality Control (QC) Services
Covaris Shearing
Price *
Total/sample
$10
Qubit of submitted total RNA , DNA, or prepared library
Fewer than 10 samples
n/c
More than 10 samples
$1.00/sample
qPCR of prepared library
Total per sample
$30
Special Illumina Seq Reagent required for Nextera libraries per flow cell
$90
Agilent Bioanalyzer HS Chip of prepared library
Total per chip – 11 samples per chip
$100
Other Services
*For Illumina projects, prices include ONLY processing fee. Reagents are extra and should be purchased directly through Illumina.
Instrument
Applied Biosystems
3730 DNA Analyzer
Price
Sanger Sequencing
$8.00
Per reaction
$6.00
Per reaction per full 96-well plate
EXO/SAP PCR Clean-up
$1.00
Per reaction
Microsatellite/SNPs/SNaPshot
$5.00
Per reaction
$3.00
Per reaction per full 96-well plate
Price *
Instrument
Illumina HiScan &
BeadXpress
Infinium assay
$150
Per 12-sample chip
$200
Per 24-sample chip
GoldenGate assay
$500
Per 96-well plate
VeraCode assay
$500
Per 96-well plate
Gene Expression assay
$150
Per 2 chips (12/24 samples)
Download