RAD Sequencing Design and Data Analysis John Davey RAD Sequencing Meeting

advertisement
RAD Sequencing Design and Data Analysis
John Davey
Institute of Evolutionary Biology
University of Edinburgh
RAD Sequencing Meeting
Wednesday 21 October 2009
RAD Sequencing Design and Data Analysis
Experimental Design
John Davey
Institute of Evolutionary Biology
University of Edinburgh
RAD Sequencing Meeting
Wednesday 21 October 2009
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
John Davey
Institute of Evolutionary Biology
University of Edinburgh
RAD Sequencing Meeting
Wednesday 21 October 2009
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
John Davey
Institute of Evolutionary Biology
University of Edinburgh
RAD Sequencing Meeting
Wednesday 21 October 2009
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
John Davey
Institute of Evolutionary Biology
University of Edinburgh
RAD Sequencing Meeting
Wednesday 21 October 2009
SNP Discovery
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Illumina GAIIx Sequencing Machine
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Illumina GAIIx Sequencing Machine
15 million reads per lane
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Illumina GAIIx Sequencing Machine
15 million reads per lane
50 base pairs per read (soon to be 100)
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Illumina GAIIx Sequencing Machine
15 million reads per lane
50 base pairs per read (soon to be 100)
750 Mb per lane
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Illumina GAIIx Sequencing Machine
15 million reads per lane
50 base pairs per read (soon to be 100)
750 Mb per lane
8 lanes per run
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Illumina GAIIx Sequencing Machine
15 million reads per lane
50 base pairs per read (soon to be 100)
750 Mb per lane
8 lanes per run
6 Gb per run
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Illumina GAIIx Sequencing Machine
15 million reads per lane
50 base pairs per read (soon to be 100)
750 Mb per lane
8 lanes per run
6 Gb per run
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Illumina GAIIx Sequencing Machine
15 million reads per lane
50 base pairs per read (soon to be 100)
750 Mb per lane
8 lanes per run
6 Gb per run
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Illumina GAIIx Sequencing Machine
15 million reads per lane
50 base pairs per read (soon to be 100)
750 Mb per lane
8 lanes per run
6 Gb per run
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Illumina GAIIx Sequencing Machine
15 million reads per lane
50 base pairs per read (soon to be 100)
750 Mb per lane
8 lanes per run
6 Gb per run
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Illumina GAIIx Sequencing Machine
15 million reads per lane
50 base pairs per read (soon to be 100)
750 Mb per lane
8 lanes per run
6 Gb per run
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Illumina GAIIx Sequencing Machine
15 million reads per lane
50 base pairs per read (soon to be 100)
750 Mb per lane
8 lanes per run
6 Gb per run
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Illumina GAIIx Sequencing Machine
15 million reads per lane
50 base pairs per read (soon to be 100)
750 Mb per lane
8 lanes per run
6 Gb per run
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Illumina GAIIx Sequencing Machine
15 million reads per lane
50 base pairs per read (soon to be 100)
750 Mb per lane
8 lanes per run
6 Gb per run
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Illumina GAIIx Sequencing Machine
15 million reads per lane
50 base pairs per read (soon to be 100)
750 Mb per lane
8 lanes per run
6 Gb per run
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
How many individuals?
Data Quality
SNP Discovery
Illumina GAIIx Sequencing Machine
15 million reads per lane
50 base pairs per read (soon to be 100)
750 Mb per lane
8 lanes per run
6 Gb per run
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
How many individuals?
How much coverage?
Data Quality
SNP Discovery
Illumina GAIIx Sequencing Machine
15 million reads per lane
50 base pairs per read (soon to be 100)
750 Mb per lane
8 lanes per run
6 Gb per run
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
How many individuals?
How much coverage?
Genome size (and GC content)
Data Quality
SNP Discovery
Illumina GAIIx Sequencing Machine
15 million reads per lane
50 base pairs per read (soon to be 100)
750 Mb per lane
8 lanes per run
6 Gb per run
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Data Quality
SNP Discovery
Illumina GAIIx Sequencing Machine
15 million reads per lane
50 base pairs per read (soon to be 100)
750 Mb per lane
8 lanes per run
6 Gb per run
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Number of reads per lane
Data Quality
SNP Discovery
Illumina GAIIx Sequencing Machine
15 million reads per lane
50 base pairs per read (soon to be 100)
750 Mb per lane
8 lanes per run
6 Gb per run
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Number of reads per lane
Estimated number of RAD tags
Data Quality
SNP Discovery
Illumina GAIIx Sequencing Machine
15 million reads per lane
50 base pairs per read (soon to be 100)
750 Mb per lane
8 lanes per run
6 Gb per run
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
How many individuals?
How much coverage?
Plutella xylostella
Genome size (and GC content)
Restriction enzyme
Number of reads per lane
Estimated number of RAD tags
339 Mb
SNP Discovery
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Number of reads per lane
Estimated number of RAD tags
Plutella xylostella
Lymnaea stagnalis
339 Mb
1.2 Gb
SNP Discovery
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Number of reads per lane
Estimated number of RAD tags
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
Restriction enzyme
Number of reads per lane
Estimated number of RAD tags
15 million
15 million
15 million
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
Restriction enzyme
Number of reads per lane
Estimated number of RAD tags
SbfI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
15 million
15 million
15 million
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
Restriction enzyme
Number of reads per lane
15 million
Estimated number of RAD tags
SbfI
EcoRI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
15 million
15 million
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
Restriction enzyme
Number of reads per lane
15 million
15 million
15 million
Estimated number of RAD tags
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
Restriction enzyme
Number of reads per lane
15 million
15 million
15 million
Estimated number of RAD tags
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
Restriction enzyme
Average fragment size
Symmetric?
Number of reads per lane
15 million
15 million
15 million
Estimated number of RAD tags
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
SbfI
Average fragment size
Symmetric?
Number of reads per lane
15 million
15 million
15 million
Estimated number of RAD tags
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
SbfI
Average fragment size
Symmetric?
Number of reads per lane
15 million
15 million
15 million
Estimated number of RAD tags
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
p=0.25
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
SbfI
Average fragment size
Symmetric?
Number of reads per lane
15 million
15 million
15 million
Estimated number of RAD tags
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
p=0.258
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Average fragment size
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
SbfI
1 / 0.258
Symmetric?
Number of reads per lane
15 million
15 million
15 million
Estimated number of RAD tags
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
p=0.258
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Average fragment size
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
SbfI
65 Kb
Symmetric?
Number of reads per lane
15 million
15 million
15 million
Estimated number of RAD tags
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
p=0.258
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Average fragment size
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
SbfI
65 Kb
Symmetric?
Number of reads per lane
Estimated number of RAD tags
15 million
15 million
15 million
339Mb / 65Kb
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
p=0.258
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Average fragment size
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
SbfI
65 Kb
Symmetric?
Number of reads per lane
Estimated number of RAD tags
15 million
15 million
15 million
5,172 tags
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
p=0.258
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
SbfI
Average fragment size
65 Kb
Symmetric?
Yes
Number of reads per lane
Estimated number of RAD tags
15 million
15 million
15 million
5,172 tags
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
p=0.258
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
SbfI
Average fragment size
65 Kb
Symmetric?
Yes
Number of reads per lane
Estimated number of RAD tags
15 million
15 million
15 million
10,345 tags
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
p=0.258
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
SbfI
Average fragment size
65 Kb
Symmetric?
Yes
Number of reads per lane
Estimated number of RAD tags
15 million
15 million
15 million
10,345 tags
Number of reads per tag
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
p=0.258
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
SbfI
Average fragment size
65 Kb
Symmetric?
Yes
Number of reads per lane
15 million
Estimated number of RAD tags
10,345 tags
Number of reads per tag
15m / 10345
15 million
15 million
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
p=0.258
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
SbfI
Average fragment size
65 Kb
Symmetric?
Yes
Number of reads per lane
Estimated number of RAD tags
Number of reads per tag
15 million
15 million
15 million
10,345 tags
1,450 reads
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
p=0.258
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
SbfI
Average fragment size
65 Kb
Symmetric?
Yes
Number of reads per lane
Estimated number of RAD tags
Number of reads per tag
Number of individuals per lane
15 million
15 million
15 million
10,345 tags
1,450 reads
1
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
p=0.258
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
SbfI
Average fragment size
65 Kb
Symmetric?
Yes
Number of reads per lane
Estimated number of RAD tags
Number of reads per tag
Number of individuals per lane
15 million
15 million
15 million
10,345 tags
1,450 reads
12
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
p=0.258
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
SbfI
Average fragment size
65 Kb
Symmetric?
Yes
Number of reads per lane
Estimated number of RAD tags
Number of reads per tag
Number of individuals per lane
15 million
15 million
15 million
10,345 tags
120 reads
12
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
p=0.258
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
SbfI
Average fragment size
65 Kb
Symmetric?
Yes
Number of reads per lane
Estimated number of RAD tags
15 million
15 million
15 million
10,345 tags
Number of reads per tag
60 reads
Number of individuals per lane
24
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
p=0.258
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
SbfI
SbfI
SbfI
Average fragment size
65 Kb
65 Kb
65 Kb
Symmetric?
Yes
Yes
Yes
15 million
15 million
15 million
Genome size (and GC content)
Restriction enzyme
Number of reads per lane
Estimated number of RAD tags
10,345 tags
Number of reads per tag
60 reads
Number of individuals per lane
24
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
p=0.258
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
SbfI
SbfI
SbfI
Average fragment size
65 Kb
65 Kb
65 Kb
Symmetric?
Yes
Yes
Yes
15 million
15 million
15 million
Genome size (and GC content)
Restriction enzyme
Number of reads per lane
Estimated number of RAD tags
10,345 tags
36,621 tags 103,759 tags
Number of reads per tag
60 reads
17 reads
Number of individuals per lane
24
24
6 reads
24
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
p=0.258
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
EcoRI
EcoRI
EcoRI
Yes
Yes
Yes
15 million
15 million
15 million
Average fragment size
Symmetric?
Number of reads per lane
Estimated number of RAD tags
Number of reads per tag
Number of individuals per lane
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
p=0.258
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
EcoRI
EcoRI
EcoRI
Yes
Yes
Yes
15 million
15 million
15 million
Average fragment size
Symmetric?
Number of reads per lane
Estimated number of RAD tags
Number of reads per tag
Number of individuals per lane
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
p=0.258
p=0.256
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Average fragment size
Symmetric?
Number of reads per lane
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
EcoRI
EcoRI
EcoRI
4 Kb
4 Kb
4 Kb
Yes
Yes
Yes
15 million
15 million
15 million
Estimated number of RAD tags
Number of reads per tag
Number of individuals per lane
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
p=0.258
p=0.256
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Average fragment size
Symmetric?
Number of reads per lane
Estimated number of RAD tags
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
EcoRI
EcoRI
EcoRI
4 Kb
4 Kb
4 Kb
Yes
Yes
Yes
15 million
15 million
15 million
165,526 tags
585,936 tags
1,660,156 tags
Number of reads per tag
Number of individuals per lane
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
p=0.258
p=0.256
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Average fragment size
Symmetric?
Number of reads per lane
Estimated number of RAD tags
Number of reads per tag
Number of individuals per lane
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
EcoRI
EcoRI
EcoRI
4 Kb
4 Kb
4 Kb
Yes
Yes
Yes
15 million
15 million
15 million
165,526 tags
90 reads
1
585,936 tags
1,660,156 tags
25 reads
1
9 reads
1
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
p=0.258
p=0.256
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
BbvCI
BbvCI
BbvCI
15 million
15 million
Average fragment size
Symmetric?
Number of reads per lane
15 million
Estimated number of RAD tags
Number of reads per tag
Number of individuals per lane
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
p=0.258
p=0.256
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
BbvCI
BbvCI
BbvCI
15 million
15 million
Average fragment size
Symmetric?
Number of reads per lane
15 million
Estimated number of RAD tags
Number of reads per tag
Number of individuals per lane
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
p=0.258
p=0.256
p=0.257
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Average fragment size
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
BbvCI
BbvCI
BbvCI
16 Kb
16 Kb
16 Kb
15 million
15 million
15 million
Symmetric?
Number of reads per lane
Estimated number of RAD tags
Number of reads per tag
Number of individuals per lane
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
p=0.258
p=0.256
p=0.257
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Average fragment size
Symmetric?
Number of reads per lane
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
BbvCI
BbvCI
BbvCI
16 Kb
16 Kb
16 Kb
No
15 million
No
No
15 million
15 million
Estimated number of RAD tags
Number of reads per tag
Number of individuals per lane
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
p=0.258
p=0.256
p=0.257
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Average fragment size
Symmetric?
Number of reads per lane
Estimated number of RAD tags
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
BbvCI
BbvCI
BbvCI
16 Kb
16 Kb
16 Kb
No
15 million
20,690 tags
No
No
15 million
73,242 tags
15 million
207,519 tags
Number of reads per tag
Number of individuals per lane
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
p=0.258
p=0.256
p=0.257
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
How many individuals?
How much coverage?
Genome size (and GC content)
Restriction enzyme
Average fragment size
Symmetric?
Number of reads per lane
Estimated number of RAD tags
Plutella xylostella
Lymnaea stagnalis
H. sapiens
339 Mb
1.2 Gb
3.4 Gb
BbvCI
BbvCI
BbvCI
16 Kb
16 Kb
16 Kb
No
15 million
20,690 tags
No
No
15 million
73,242 tags
15 million
207,519 tags
Number of reads per tag
60 reads
17 reads
Number of individuals per lane
12
12
6 reads
12
SbfI
EcoRI
BbvCI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
5’...GAATTC...3’
3’...CTTAAG...5’
5’...CCTCAGC...3’
3’...GGAGTCG...5’
p=0.258
p=0.256
p=0.257
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
How many individuals?
How much coverage?
Plutella xylostella
Genome size (and GC content)
Restriction enzyme
339 Mb
SbfI
Average fragment size
65 Kb
Symmetric?
Yes
Number of reads per lane
Estimated number of RAD tags
15 million
10,345 tags
Number of reads per tag
60 reads
Number of individuals per lane
24
SbfI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
p=0.258
SNP Discovery
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
How many individuals?
How much coverage?
Plutella xylostella
Genome size (and GC content)
Restriction enzyme
339 Mb (40%)
SbfI
Average fragment size
65 Kb
Symmetric?
Yes
Number of reads per lane
Estimated number of RAD tags
15 million
10,345 tags
Number of reads per tag
60 reads
Number of individuals per lane
24
SbfI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
p=0.258
SNP Discovery
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
How many individuals?
How much coverage?
Plutella xylostella
Genome size (and GC content)
Restriction enzyme
339 Mb (40%)
SbfI
Average fragment size
65 Kb
Symmetric?
Yes
Number of reads per lane
Estimated number of RAD tags
15 million
10,345 tags
Number of reads per tag
60 reads
Number of individuals per lane
24
SbfI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
p=0.26 * 0.32
SNP Discovery
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
How many individuals?
How much coverage?
Plutella xylostella
Genome size (and GC content)
Restriction enzyme
339 Mb (40%)
SbfI
Average fragment size
87 Kb
Symmetric?
Yes
Number of reads per lane
Estimated number of RAD tags
15 million
10,345 tags
Number of reads per tag
60 reads
Number of individuals per lane
24
SbfI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
p=0.26 * 0.32
SNP Discovery
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
How many individuals?
How much coverage?
Plutella xylostella
Genome size (and GC content)
Restriction enzyme
339 Mb (40%)
SbfI
Average fragment size
87 Kb
Symmetric?
Yes
Number of reads per lane
Estimated number of RAD tags
15 million
3,905 tags
Number of reads per tag
60 reads
Number of individuals per lane
24
SbfI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
p=0.26 * 0.32
SNP Discovery
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
How many individuals?
How much coverage?
Plutella xylostella
Genome size (and GC content)
Restriction enzyme
339 Mb (40%)
SbfI
Average fragment size
87 Kb
Symmetric?
Yes
Number of reads per lane
Estimated number of RAD tags
Number of reads per tag
Number of individuals per lane
SbfI
5’...CCTGCAGG...3’
3’...GGACGTCC...5’
p=0.26 * 0.32
15 million
3,905 tags
160 reads
24
SNP Discovery
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
60 reads
Symmetric?
Yes
Number of individuals per lane
24
Plutella xylostella
15 million
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
60 reads
Symmetric?
Yes
Number of individuals per lane
24
Plutella xylostella
15 million
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
60 reads
Symmetric?
Yes
Number of individuals per lane
24
Barcode
SbfI site
Plutella xylostella
15 million
ACGTATGCAGGGTAGTGTTGTGCCTTTTAGCATGTGCCTTTTAGCATGTG
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
60 reads
Symmetric?
Yes
Number of individuals per lane
24
Barcode
SbfI site
Plutella xylostella
15 million
ACGTATGCAGGGTAGTGTTGTGCCTTTTAGCATGTGCCTTTTAGCATGTG
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
60 reads
Symmetric?
Yes
Number of individuals per lane
24
Barcode
SbfI site
Plutella xylostella
15 million
ACGTATGCAGGGTAGTGTTGTGCCTTTTAGCATGTGCCTTTTAGCATGTG
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
60 reads
Symmetric?
Yes
Number of individuals per lane
24
Barcode
Plutella xylostella
15 million
SbfI site
ACGTATGCAGGGTAGTGTTGTGCCTTTTAGCATGTGCCTTTTAGCATGTG
RAD Tags
Genomic DNA
% of All Reads with p>0.01 at position
50.0
37.5
25.0
12.5
0
0
1
2
3
4
5
6
7
8
9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49
Position
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
60 reads
Symmetric?
Yes
Number of individuals per lane
24
Barcode
Plutella xylostella
15 million
SbfI site
ACGTATGCAGGGTAGTGTTGTGCCTTTTAGCATGTGCCTTTTAGCATGTG
RAD Tags
Genomic DNA
15.00
% of All Reads
11.25
7.50
3.75
0
0
1
2
3
4
5
6
7
8
9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49
Number of errors in read
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
60 reads
Symmetric?
Yes
Number of individuals per lane
24
Barcode
Plutella xylostella
15 million
SbfI site
ACGTATGCAGGGTAGTGTTGTGCCTTTTAGCATGTGCCTTTTAGCATGTG
F M C1 C2
C3
C4 C5
C6
C7
F
M
E
C
C8
=
=
=
=
C9 C10 E1 E2
Father
Mother
Experiment
Control
E3 E4
(R/R)
(R/S)
(R/R)
(R/R or R/S)
E5
E6
E7 E8 E9 E10 E11 E12
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
60 reads
Symmetric?
Yes
Number of individuals per lane
24
Barcode
Plutella xylostella
7.5 million
SbfI site
ACGTATGCAGGGTAGTGTTGTGCCTTTTAGCATGTGCCTTTTAGCATGTG
F M C1 C2
C3
C4 C5
C6
C7
F
M
E
C
C8
=
=
=
=
C9 C10 E1 E2
Father
Mother
Experiment
Control
E3 E4
(R/R)
(R/S)
(R/R)
(R/R or R/S)
E5
E6
E7 E8 E9 E10 E11 E12
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Barcode
Plutella xylostella
7.5 million
SbfI site
ACGTATGCAGGGTAGTGTTGTGCCTTTTAGCATGTGCCTTTTAGCATGTG
F M C1 C2
C3
C4 C5
C6
C7
F
M
E
C
C8
=
=
=
=
C9 C10 E1 E2
Father
Mother
Experiment
Control
E3 E4
(R/R)
(R/S)
(R/R)
(R/R or R/S)
E5
E6
E7 E8 E9 E10 E11 E12
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Barcode
Plutella xylostella
7.5 million
SbfI site
ACGTATGCAGGGTAGTGTTGTGCCTTTTAGCATGTGCCTTTTAGCATGTG
F M C1 C2
C3
C4 C5
24 7 72 45 121 122
3
C6
C7
C8
C9 C10 E1 E2
E3 E4
E5
E6
E7 E8 E9 E10 E11 E12
90 157 141 211 108 25 77 107 89 124 180 147 36 15 117 155 140
F
M
E
C
=
=
=
=
Father
Mother
Experiment
Control
(R/R)
(R/S)
(R/R)
(R/R or R/S)
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Barcode
Plutella xylostella
7.5 million
SbfI site
ACGTATGCAGGGTAGTGTTGTGCCTTTTAGCATGTGCCTTTTAGCATGTG
F M C1 C2
C3
C4 C5
24 7 72 45 121 122
9 0 38
0
0
1
C6
C7
C8
C9 C10 E1 E2
E3 E4
E5
E6
E7 E8 E9 E10 E11 E12
3
90 157 141 211 108 25 77 107 89 124 180 147 36 15 117 155 140
1
51
49
F
M
E
C
32
=
=
=
=
25
27
Father
Mother
Experiment
Control
0
0
0 35
(R/R)
(R/S)
(R/R)
(R/R or R/S)
0
55
45
0
5
44
47
43
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Barcode
Plutella xylostella
7.5 million
SbfI site
ACGTATGCAGGGTAGTGTTGTGCCTTTTAGCATGTGCCTTTTAGCATGTG
F M C1 C2
C3
C4 C5
24 7 72 45 121 122
C6
C7
C8
C9 C10 E1 E2
E3 E4
E5
E6
E7 E8 E9 E10 E11 E12
3
90 157 141 211 108 25 77 107 89 124 180 147 36 15 117 155 140
9 0 38
0
0
1
1
51
49
32
25
27
0
0
0 35
0
55
45
0
5
44
47
43
0 0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
F
M
E
C
=
=
=
=
Father
Mother
Experiment
Control
0
(R/R)
(R/S)
(R/R)
(R/R or R/S)
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Barcode
Plutella xylostella
7.5 million
SbfI site
ACGTATGCAGGGTAGTGTTGTGCCTTTTAGCATGTGCCTTTTAGCATGTG
F M C1 C2
C3
C4 C5
24 7 72 45 121 122
C6
C7
C8
C9 C10 E1 E2
E3 E4
E5
E6
E7 E8 E9 E10 E11 E12
3
90 157 141 211 108 25 77 107 89 124 180 147 36 15 117 155 140
9 0 38
0
0
1
1
51
49
32
25
27
0
0
0 35
0
55
45
0
5
44
47
43
0 0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0 0
0
1
0
0
0
0
0
1
0
0
0
0
0
0
1
0
1
0
0
0
0
0
F
M
E
C
=
=
=
=
Father
Mother
Experiment
Control
(R/R)
(R/S)
(R/R)
(R/R or R/S)
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Plutella xylostella
7.5 million
ACGTATGCAGGGTAGTGTTGTGCCTTTTAGCATGTGCCTTTTAGCATGTG
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Plutella xylostella
7.5 million
ACGTATGCAGGGTAGTGTTGTGCCTTTTAGCATGTGCCTTTTAGCATGTG
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Tag length = 26
Father
p<0.01
p<0.1
p<1
3,048
23,402
42,746
Mother
15,702
148,940
294,441
Control 1
22,739
173,717
315,764
Control 2
19,447
136,710
242,588
Control 3
63,508
189,824
284,603
Control 4
72,155
208,394
298,976
Control 5
3,081
8,435
12,113
Control 6
58,195
192,807
291,177
Control 7
123,558
352,517
522,155
Control 8
78,095
199,258
275,231
Control 9
100,405
224,210
302,329
64,353
154,814
216,053
Experiment 1
105,588
250,483
338,414
Experiment 2
112,224
282,414
399,781
Experiment 3
67,902
171,671
239,129
Experiment 4
30,581
77,581
107,889
Experiment 5
43,969
155,502
244,158
Experiment 6
57,167
169,644
255,064
Experiment 7
78,945
231,898
335,162
Experiment 8
68,111
211,327
316,014
Experiment 9
94,087
261,847
408,983
Experiment 10
19,954
229,096
337,275
Experiment 11
1,728
174,902
326,866
Experiment 12
79,528
13,078
24,111
Control 10
TOTAL
1,384,070 4,242,471 6,431,022
Plutella xylostella
7.5 million
ACGTATGCAGGGTAGTGTTGTGCCTTTTAGCATGTGCCTTTTAGCATGTG
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Tag length = 26
Father
p<0.01
p<0.1
p<1
3,048
23,402
42,746
Mother
15,702
148,940
294,441
Control 1
22,739
173,717
315,764
Control 2
19,447
136,710
242,588
Control 3
63,508
189,824
284,603
Control 4
72,155
208,394
298,976
Control 5
3,081
8,435
12,113
Control 6
58,195
192,807
291,177
Control 7
123,558
352,517
522,155
Control 8
78,095
199,258
275,231
Control 9
100,405
224,210
302,329
64,353
154,814
216,053
Experiment 1
105,588
250,483
338,414
Experiment 2
112,224
282,414
399,781
Experiment 3
67,902
171,671
239,129
Experiment 4
30,581
77,581
107,889
Experiment 5
43,969
155,502
244,158
Experiment 6
57,167
169,644
255,064
Experiment 7
78,945
231,898
335,162
Experiment 8
68,111
211,327
316,014
Experiment 9
94,087
261,847
408,983
Experiment 10
19,954
229,096
337,275
Experiment 11
1,728
174,902
326,866
Experiment 12
79,528
13,078
24,111
Control 10
TOTAL
1,384,070 4,242,471 6,431,022
Plutella xylostella
7.5 million
ACGTATGCAGGGTAGTGTTGTGCCTTTTAGCATGTGCCTTTTAGCATGTG
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Tag length = 26
Father
p<0.01
p<0.1
p<1
3,048
23,402
42,746
Mother
15,702
148,940
294,441
Control 1
22,739
173,717
315,764
Control 2
19,447
136,710
242,588
Control 3
63,508
189,824
284,603
Control 4
72,155
208,394
298,976
Control 5
3,081
8,435
12,113
Control 6
58,195
192,807
291,177
Control 7
123,558
352,517
522,155
Control 8
78,095
199,258
275,231
Control 9
100,405
224,210
302,329
64,353
154,814
216,053
Experiment 1
105,588
250,483
338,414
Experiment 2
112,224
282,414
399,781
Experiment 3
67,902
171,671
239,129
Experiment 4
30,581
77,581
107,889
Experiment 5
43,969
155,502
244,158
Experiment 6
57,167
169,644
255,064
Experiment 7
78,945
231,898
335,162
Experiment 8
68,111
211,327
316,014
Experiment 9
94,087
261,847
408,983
Experiment 10
19,954
229,096
337,275
Experiment 11
1,728
174,902
326,866
Experiment 12
79,528
13,078
24,111
Control 10
TOTAL
1,384,070 4,242,471 6,431,022
Plutella xylostella
7.5 million
ACGTATGCAGGGTAGTGTTGTGCCTTTTAGCATGTGCCTTTTAGCATGTG
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Tag length = 26
Father
p<0.01
p<0.1
p<1
3,048
23,402
42,746
Mother
15,702
148,940
294,441
Control 1
22,739
173,717
315,764
Control 2
19,447
136,710
242,588
Control 3
63,508
189,824
284,603
Control 4
72,155
208,394
298,976
Control 5
3,081
8,435
12,113
Control 6
58,195
192,807
291,177
Control 7
123,558
352,517
522,155
Control 8
78,095
199,258
275,231
Control 9
100,405
224,210
302,329
64,353
154,814
216,053
Experiment 1
105,588
250,483
338,414
Experiment 2
112,224
282,414
399,781
Experiment 3
67,902
171,671
239,129
Experiment 4
30,581
77,581
107,889
Experiment 5
43,969
155,502
244,158
Experiment 6
57,167
169,644
255,064
Experiment 7
78,945
231,898
335,162
Experiment 8
68,111
211,327
316,014
Experiment 9
94,087
261,847
408,983
Experiment 10
79,528
229,096
337,275
Experiment 11
19,954
174,902
326,866
Experiment 12
1,728
13,078
24,111
Control 10
TOTAL
1,384,070 4,242,471 6,431,022
Plutella xylostella
7.5 million
ACGTATGCAGGGTAGTGTTGTGCCTTTTAGCATGTGCCTTTTAGCATGTG
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Tag length = 26
Father
p<0.01
p<0.1
p<1
3,048
23,402
42,746
Mother
15,702
148,940
294,441
Control 1
22,739
173,717
315,764
Control 2
19,447
136,710
242,588
Control 3
63,508
189,824
284,603
Control 4
72,155
208,394
298,976
Control 5
3,081
8,435
12,113
Control 6
58,195
192,807
291,177
Control 7
123,558
352,517
522,155
Control 8
78,095
199,258
275,231
Control 9
100,405
224,210
302,329
64,353
154,814
216,053
Experiment 1
105,588
250,483
338,414
Experiment 2
112,224
282,414
399,781
Experiment 3
67,902
171,671
239,129
Experiment 4
30,581
77,581
107,889
Experiment 5
43,969
155,502
244,158
Experiment 6
57,167
169,644
255,064
Experiment 7
78,945
231,898
335,162
Experiment 8
68,111
211,327
316,014
Experiment 9
94,087
261,847
408,983
Experiment 10
79,528
229,096
337,275
Experiment 11
19,954
174,902
326,866
Experiment 12
1,728
13,078
24,111
Control 10
TOTAL
1,384,070 4,242,471 6,431,022
Plutella xylostella
7.5 million
ACGTATGCAGGGTAGTGTTGTGCCTTTTAGCATGTGCCTTTTAGCATGTG
Mother
294,441 /
10,345 = 28.4
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Tag length = 26
Father
p<0.01
p<0.1
p<1
3,048
23,402
42,746
Mother
15,702
148,940
294,441
Control 1
22,739
173,717
315,764
Control 2
19,447
136,710
242,588
Control 3
63,508
189,824
284,603
Control 4
72,155
208,394
298,976
Control 5
3,081
8,435
12,113
Control 6
58,195
192,807
291,177
Control 7
123,558
352,517
522,155
Control 8
78,095
199,258
275,231
Control 9
100,405
224,210
302,329
64,353
154,814
216,053
Experiment 1
105,588
250,483
338,414
Experiment 2
112,224
282,414
399,781
Experiment 3
67,902
171,671
239,129
Experiment 4
30,581
77,581
107,889
Experiment 5
43,969
155,502
244,158
Experiment 6
57,167
169,644
255,064
Experiment 7
78,945
231,898
335,162
Experiment 8
68,111
211,327
316,014
Experiment 9
94,087
261,847
408,983
Experiment 10
79,528
229,096
337,275
Experiment 11
19,954
174,902
326,866
Experiment 12
1,728
13,078
24,111
Control 10
TOTAL
1,384,070 4,242,471 6,431,022
Plutella xylostella
7.5 million
ACGTATGCAGGGTAGTGTTGTGCCTTTTAGCATGTGCCTTTTAGCATGTG
294,441 / 10,345 = 28.4
Mother
28.4 * 0.1 = 2.8
Threshold
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Tag length = 26
Father
p<0.01
p<0.1
p<1
3,048
23,402
42,746
Mother
15,702
148,940
294,441
Control 1
22,739
173,717
315,764
Control 2
19,447
136,710
242,588
Control 3
63,508
189,824
284,603
Control 4
72,155
208,394
298,976
Control 5
3,081
8,435
12,113
Control 6
58,195
192,807
291,177
Control 7
123,558
352,517
522,155
Control 8
78,095
199,258
275,231
Control 9
100,405
224,210
302,329
64,353
154,814
216,053
Experiment 1
105,588
250,483
338,414
Experiment 2
112,224
282,414
399,781
Experiment 3
67,902
171,671
239,129
Experiment 4
30,581
77,581
107,889
Experiment 5
43,969
155,502
244,158
Experiment 6
57,167
169,644
255,064
Experiment 7
78,945
231,898
335,162
Experiment 8
68,111
211,327
316,014
Experiment 9
94,087
261,847
408,983
Experiment 10
79,528
229,096
337,275
Experiment 11
19,954
174,902
326,866
Experiment 12
1,728
13,078
24,111
Control 10
TOTAL
1,384,070 4,242,471 6,431,022
Plutella xylostella
7.5 million
ACGTATGCAGGGTAGTGTTGTGCCTTTTAGCATGTGCCTTTTAGCATGTG
294,441 / 10,345 = 28.4
Mother
28.4 * 0.1 = 2.8
Threshold
42,746 / 10,345 =
Father
4.1 * 0.1 =
Threshold
4.1
0.4
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Tag length = 26
Father
p<0.01
p<0.1
p<1
3,048
23,402
42,746
Mother
15,702
148,940
294,441
Control 1
22,739
173,717
315,764
Control 2
19,447
136,710
242,588
Control 3
63,508
189,824
284,603
Control 4
72,155
208,394
298,976
Control 5
3,081
8,435
12,113
Control 6
58,195
192,807
291,177
Control 7
123,558
352,517
522,155
Control 8
78,095
199,258
275,231
Control 9
100,405
224,210
302,329
64,353
154,814
216,053
Experiment 1
105,588
250,483
338,414
Experiment 2
112,224
282,414
399,781
Experiment 3
67,902
171,671
239,129
Experiment 4
30,581
77,581
107,889
Experiment 5
43,969
155,502
244,158
Experiment 6
57,167
169,644
255,064
Experiment 7
78,945
231,898
335,162
Experiment 8
68,111
211,327
316,014
Experiment 9
94,087
261,847
408,983
Experiment 10
79,528
229,096
337,275
Experiment 11
19,954
174,902
326,866
Experiment 12
1,728
13,078
24,111
Control 10
TOTAL
1,384,070 4,242,471 6,431,022
Plutella xylostella
7.5 million
ACGTATGCAGGGTAGTGTTGTGCCTTTTAGCATGTGCCTTTTAGCATGTG
294,441 / 10,345 = 28.4
Mother
28.4 * 0.1 = 2.8
Threshold
42,746 / 10,345 =
Father
4.1 * 0.1 =
Threshold
4.1
0.4
With thresholding of p<0.1 reads:
Reads
4,242,471
3,978,032
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Tag length = 26
Father
p<0.01
p<0.1
p<1
3,048
23,402
42,746
Mother
15,702
148,940
294,441
Control 1
22,739
173,717
315,764
Control 2
19,447
136,710
242,588
Control 3
63,508
189,824
284,603
Control 4
72,155
208,394
298,976
Control 5
3,081
8,435
12,113
Control 6
58,195
192,807
291,177
Control 7
123,558
352,517
522,155
Control 8
78,095
199,258
275,231
Control 9
100,405
224,210
302,329
64,353
154,814
216,053
Experiment 1
105,588
250,483
338,414
Experiment 2
112,224
282,414
399,781
Experiment 3
67,902
171,671
239,129
Experiment 4
30,581
77,581
107,889
Experiment 5
43,969
155,502
244,158
Experiment 6
57,167
169,644
255,064
Experiment 7
78,945
231,898
335,162
Experiment 8
68,111
211,327
316,014
Experiment 9
94,087
261,847
408,983
Experiment 10
79,528
229,096
337,275
Experiment 11
19,954
174,902
326,866
Experiment 12
1,728
13,078
24,111
Control 10
TOTAL
1,384,070 4,242,471 6,431,022
Plutella xylostella
7.5 million
ACGTATGCAGGGTAGTGTTGTGCCTTTTAGCATGTGCCTTTTAGCATGTG
294,441 / 10,345 = 28.4
Mother
28.4 * 0.1 = 2.8
Threshold
42,746 / 10,345 =
Father
4.1 * 0.1 =
Threshold
4.1
0.4
With thresholding of p<0.1 reads:
Reads
Tags
4,242,471
3,978,032
126,004
15,541
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Tag length = 26
Father
p<0.01
p<0.1
p<1
3,048
23,402
42,746
Mother
15,702
148,940
294,441
Control 1
22,739
173,717
315,764
Control 2
19,447
136,710
242,588
Control 3
63,508
189,824
284,603
Control 4
72,155
208,394
298,976
Control 5
3,081
8,435
12,113
Control 6
58,195
192,807
291,177
Control 7
123,558
352,517
522,155
Control 8
78,095
199,258
275,231
Control 9
100,405
224,210
302,329
64,353
154,814
216,053
Experiment 1
105,588
250,483
338,414
Experiment 2
112,224
282,414
399,781
Experiment 3
67,902
171,671
239,129
Experiment 4
30,581
77,581
107,889
Experiment 5
43,969
155,502
244,158
Experiment 6
57,167
169,644
255,064
Experiment 7
78,945
231,898
335,162
Experiment 8
68,111
211,327
316,014
Experiment 9
94,087
261,847
408,983
Experiment 10
79,528
229,096
337,275
Experiment 11
19,954
174,902
326,866
Experiment 12
1,728
13,078
24,111
Control 10
TOTAL
1,384,070 4,242,471 6,431,022
Plutella xylostella
7.5 million
ACGTATGCAGGGTAGTGTTGTGCCTTTTAGCATGTGCCTTTTAGCATGTG
294,441 / 10,345 = 28.4
Mother
28.4 * 0.1 = 2.8
Threshold
42,746 / 10,345 =
Father
4.1 * 0.1 =
Threshold
4.1
0.4
With thresholding of p<0.1 reads:
Reads
Tags
4,242,471
3,978,032
126,004
15,541
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Tag length = 26
Father
p<0.01
p<0.1
p<1
3,048
23,402
42,746
Mother
15,702
148,940
294,441
Control 1
22,739
173,717
315,764
Control 2
19,447
136,710
242,588
Control 3
63,508
189,824
284,603
Control 4
72,155
208,394
298,976
Control 5
3,081
8,435
12,113
Control 6
58,195
192,807
291,177
Control 7
123,558
352,517
522,155
Control 8
78,095
199,258
275,231
Control 9
100,405
224,210
302,329
64,353
154,814
216,053
Experiment 1
105,588
250,483
338,414
Experiment 2
112,224
282,414
399,781
Experiment 3
67,902
171,671
239,129
Experiment 4
30,581
77,581
107,889
Experiment 5
43,969
155,502
244,158
Experiment 6
57,167
169,644
255,064
Experiment 7
78,945
231,898
335,162
Experiment 8
68,111
211,327
316,014
Experiment 9
94,087
261,847
408,983
Experiment 10
79,528
229,096
337,275
Experiment 11
19,954
174,902
326,866
Experiment 12
1,728
13,078
24,111
Control 10
TOTAL
1,384,070 4,242,471 6,431,022
Plutella xylostella
7.5 million
ACGTATGCAGGGTAGTGTTGTGCCTTTTAGCATGTGCCTTTTAGCATGTG
294,441 / 10,345 = 28.4
Mother
28.4 * 0.1 = 2.8
Threshold
42,746 / 10,345 =
Father
4.1 * 0.1 =
Threshold
4.1
0.4
With thresholding of p<0.1 reads:
Reads
Tags
4,242,471
3,978,032
126,004
15,541
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
All reads
Father
26 bp
Unique tags
23,402
3,389
Mother
148,940
7,097
Control 1
173,717
6,410
Control 2
136,710
5,960
Control 3
189,824
6,836
Control 4
208,394
6,318
Control 6
192,807
6,886
Control 7
352,517
6,874
Control 8
199,258
6,817
Control 9
224,210
6,484
Control 10
154,814
6,453
Experiment 1
250,483
6,949
Experiment 2
282,414
7,167
Experiment 3
171,671
6,632
Experiment 4
77,581
5,987
Experiment 5
155,502
6,507
Experiment 6
169,644
6,698
Experiment 7
231,898
6,737
Experiment 8
211,327
6,355
Experiment 9
261,847
6,724
Experiment 10
229,096
6,452
Experiment 11
174,902
6,578
TOTAL
4,220,958
Plutella xylostella
7.5 million
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
All reads
Father
26 bp
Unique tags
23,402
3,389
Mother
148,940
7,097
Control 1
173,717
6,410
Control 2
136,710
5,960
Control 3
189,824
6,836
Control 4
208,394
6,318
Control 6
192,807
6,886
Control 7
352,517
6,874
Control 8
199,258
6,817
Control 9
224,210
6,484
Control 10
154,814
6,453
Experiment 1
250,483
6,949
Experiment 2
282,414
7,167
Experiment 3
171,671
6,632
Experiment 4
77,581
5,987
Experiment 5
155,502
6,507
Experiment 6
169,644
6,698
Experiment 7
231,898
6,737
Experiment 8
211,327
6,355
Experiment 9
261,847
6,724
Experiment 10
229,096
6,452
Experiment 11
174,902
6,578
TOTAL
4,220,958
Plutella xylostella
SbfI site number variation
with GC content
7.5 million
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
All reads
Father
26 bp
Unique tags
23,402
3,389
Mother
148,940
7,097
Control 1
173,717
6,410
Control 2
136,710
5,960
Control 3
189,824
6,836
Control 4
208,394
6,318
Control 6
192,807
6,886
Control 7
352,517
6,874
Control 8
199,258
6,817
Control 9
224,210
6,484
Control 10
154,814
6,453
Experiment 1
250,483
6,949
Experiment 2
282,414
7,167
Experiment 3
171,671
6,632
Experiment 4
77,581
5,987
Experiment 5
155,502
6,507
Experiment 6
169,644
6,698
Experiment 7
231,898
6,737
Experiment 8
211,327
6,355
Experiment 9
261,847
6,724
Experiment 10
229,096
6,452
Experiment 11
174,902
6,578
TOTAL
4,220,958
Plutella xylostella
SbfI site number variation
with GC content
50%: 10,345
7.5 million
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
All reads
Father
26 bp
Unique tags
23,402
3,389
Mother
148,940
7,097
Control 1
173,717
6,410
Control 2
136,710
5,960
Control 3
189,824
6,836
Control 4
208,394
6,318
Control 6
192,807
6,886
Control 7
352,517
6,874
Control 8
199,258
6,817
Control 9
224,210
6,484
Control 10
154,814
6,453
Experiment 1
250,483
6,949
Experiment 2
282,414
7,167
Experiment 3
171,671
6,632
Experiment 4
77,581
5,987
Experiment 5
155,502
6,507
Experiment 6
169,644
6,698
Experiment 7
231,898
6,737
Experiment 8
211,327
6,355
Experiment 9
261,847
6,724
Experiment 10
229,096
6,452
Experiment 11
174,902
6,578
TOTAL
4,220,958
Plutella xylostella
SbfI site number variation
with GC content
50%: 10,345
45%: 6,652
40%: 3,905
7.5 million
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
All reads
Father
26 bp
Unique tags
23,402
3,389
Mother
148,940
7,097
Control 1
173,717
6,410
Control 2
136,710
5,960
Control 3
189,824
6,836
Control 4
208,394
6,318
Control 6
192,807
6,886
Control 7
352,517
6,874
Control 8
199,258
6,817
Control 9
224,210
6,484
Control 10
154,814
6,453
Experiment 1
250,483
6,949
Experiment 2
282,414
7,167
Experiment 3
171,671
6,632
Experiment 4
77,581
5,987
Experiment 5
155,502
6,507
Experiment 6
169,644
6,698
Experiment 7
231,898
6,737
Experiment 8
211,327
6,355
Experiment 9
261,847
6,724
Experiment 10
229,096
6,452
Experiment 11
174,902
6,578
TOTAL
4,220,958
Plutella xylostella
7.5 million
SbfI site number variation
with GC content
50%: 10,345
45%: 6,652
40%: 3,905
From 42 454 contigs
with SbfI sites
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
All reads
Father
26 bp
Unique tags
23,402
3,389
Mother
148,940
7,097
Control 1
173,717
6,410
Control 2
136,710
5,960
Control 3
189,824
6,836
Control 4
208,394
6,318
Control 6
192,807
6,886
Control 7
352,517
6,874
Control 8
199,258
6,817
Control 9
224,210
6,484
Control 10
154,814
6,453
Experiment 1
250,483
6,949
Experiment 2
282,414
7,167
Experiment 3
171,671
6,632
Experiment 4
77,581
5,987
Experiment 5
155,502
6,507
Experiment 6
169,644
6,698
Experiment 7
231,898
6,737
Experiment 8
211,327
6,355
Experiment 9
261,847
6,724
Experiment 10
229,096
6,452
Experiment 11
174,902
6,578
TOTAL
4,220,958
Plutella xylostella
7.5 million
SbfI site number variation
with GC content
From 42 454 contigs
with SbfI sites
50%: 10,345
45%: 6,652
40%: 3,905
28 tag pairs found
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
All reads
Father
26 bp
Unique tags
23,402
3,389
Mother
148,940
7,097
Control 1
173,717
6,410
Control 2
136,710
5,960
Control 3
189,824
6,836
Control 4
208,394
6,318
Control 6
192,807
6,886
Control 7
352,517
6,874
Control 8
199,258
6,817
Control 9
224,210
6,484
Control 10
154,814
6,453
Experiment 1
250,483
6,949
Experiment 2
282,414
7,167
Experiment 3
171,671
6,632
Experiment 4
77,581
5,987
Experiment 5
155,502
6,507
Experiment 6
169,644
6,698
Experiment 7
231,898
6,737
Experiment 8
211,327
6,355
Experiment 9
261,847
6,724
Experiment 10
229,096
6,452
Experiment 11
174,902
6,578
TOTAL
4,220,958
Plutella xylostella
7.5 million
SbfI site number variation
with GC content
From 42 454 contigs
with SbfI sites
50%: 10,345
45%: 6,652
40%: 3,905
28 tag pairs found
11 singletons found
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
All reads
Father
26 bp
Unique tags
23,402
3,389
Mother
148,940
7,097
Control 1
173,717
6,410
Control 2
136,710
5,960
Control 3
189,824
6,836
Control 4
208,394
6,318
Control 6
192,807
6,886
Control 7
352,517
6,874
Control 8
199,258
6,817
Control 9
224,210
6,484
Control 10
154,814
6,453
Experiment 1
250,483
6,949
Experiment 2
282,414
7,167
Experiment 3
171,671
6,632
Experiment 4
77,581
5,987
Experiment 5
155,502
6,507
Experiment 6
169,644
6,698
Experiment 7
231,898
6,737
Experiment 8
211,327
6,355
Experiment 9
261,847
6,724
Experiment 10
229,096
6,452
Experiment 11
174,902
6,578
TOTAL
4,220,958
Plutella xylostella
7.5 million
SbfI site number variation
with GC content
From 42 454 contigs
with SbfI sites
50%: 10,345
45%: 6,652
40%: 3,905
28 tag pairs found
11 singletons found
3 tag pairs not found
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
F
26 bp
Plutella xylostella
7.5 million
M C1 C2 C3 C4 C6 C7 C8 C9 C10 E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11
F = Father (R/R)
M = Mother (R/S)
E = Experiment (R/R)
C = Control (R/R or R/S)
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
F
26 bp
Plutella xylostella
7.5 million
M C1 C2 C3 C4 C6 C7 C8 C9 C10 E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11
●
F = Father (R/R)
-
M = Mother (R/S)
-
-
-
-
E = Experiment (R/R)
-
-
-
-
-
-
C = Control (R/R or R/S)
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
F
26 bp
Plutella xylostella
7.5 million
M C1 C2 C3 C4 C6 C7 C8 C9 C10 E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11
Tags
●
602
F = Father (R/R)
-
M = Mother (R/S)
-
-
-
-
E = Experiment (R/R)
-
-
-
-
-
-
C = Control (R/R or R/S)
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
F
-
Plutella xylostella
26 bp
7.5 million
M C1 C2 C3 C4 C6 C7 C8 C9 C10 E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11
Tags
●
5
-
F = Father (R/R)
-
-
-
-
-
-
M = Mother (R/S)
-
-
-
-
-
-
-
-
-
-
-
-
-
602
-
-
-
-
-
-
-
-
-
-
-
440
E = Experiment (R/R)
C = Control (R/R or R/S)
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
F
Plutella xylostella
26 bp
7.5 million
M C1 C2 C3 C4 C6 C7 C8 C9 C10 E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11
Tags
●
-
-
-
-
-
-
-
-
-
-
-
602
-
5
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
440
-
5
*
*
*
*
*
*
*
*
*
-
-
-
-
-
-
-
-
-
-
-
100
F = Father (R/R)
M = Mother (R/S)
E = Experiment (R/R)
C = Control (R/R or R/S)
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
F
Plutella xylostella
26 bp
7.5 million
M C1 C2 C3 C4 C6 C7 C8 C9 C10 E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11
Tags
●
-
-
-
-
-
-
-
-
-
-
-
602
-
5
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
440
-
5
*
*
*
*
*
*
*
*
*
-
-
-
-
-
-
-
-
-
-
-
100
- 19 18
-
-
-
-
- 45 41
-
-
-
-
-
-
-
-
-
-
-
-
62
F = Father (R/R)
M = Mother (R/S)
E = Experiment (R/R)
C = Control (R/R or R/S)
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
F
Plutella xylostella
26 bp
7.5 million
M C1 C2 C3 C4 C6 C7 C8 C9 C10 E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11
Tags
●
-
-
-
-
-
-
-
-
-
-
-
602
-
5
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
440
-
5
*
*
*
*
*
*
*
*
*
-
-
-
-
-
-
-
-
-
-
-
100
- 19 18
-
-
-
-
- 45 41
-
-
-
-
-
-
-
-
-
-
-
-
62
62 /
F = Father (R/R)
M = Mother (R/S)
15,441 = 0.4%
E = Experiment (R/R)
C = Control (R/R or R/S)
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
F
Plutella xylostella
26 bp
7.5 million
M C1 C2 C3 C4 C6 C7 C8 C9 C10 E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11
Tags
●
-
-
-
-
-
-
-
-
-
-
-
602
-
5
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
440
-
5
*
*
*
*
*
*
*
*
*
-
-
-
-
-
-
-
-
-
-
-
100
- 19 18
-
-
-
-
- 45 41
-
-
-
-
-
-
-
-
-
-
-
-
62
62 /
15,441 = 0.4%
222 = 4,194,304
F = Father (R/R)
M = Mother (R/S)
E = Experiment (R/R)
C = Control (R/R or R/S)
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
F
Plutella xylostella
26 bp
7.5 million
M C1 C2 C3 C4 C6 C7 C8 C9 C10 E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11
Tags
●
-
-
-
-
-
-
-
-
-
-
-
602
-
5
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
440
-
5
*
*
*
*
*
*
*
*
*
-
-
-
-
-
-
-
-
-
-
-
100
- 19 18
-
-
-
-
- 45 41
-
-
-
-
-
-
-
-
-
-
-
-
62
62 /
15,441 = 0.4%
222 = 4,194,304
1 / 4,194,301 = 0.00002%
F = Father (R/R)
M = Mother (R/S)
E = Experiment (R/R)
C = Control (R/R or R/S)
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
F
Plutella xylostella
26 bp
7.5 million
M C1 C2 C3 C4 C6 C7 C8 C9 C10 E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11
Tags
●
-
-
-
-
-
-
-
-
-
-
-
602
-
5
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
440
-
5
*
*
*
*
*
*
*
*
*
-
-
-
-
-
-
-
-
-
-
-
100
- 19 18
-
-
-
-
- 45 41
-
-
-
-
-
-
-
-
-
-
-
-
62
62 /
15,441 = 0.4%
222 = 4,194,304
1 / 4,194,301 = 0.00002%
1 /
F = Father (R/R)
M = Mother (R/S)
15,441 = 0.006%
E = Experiment (R/R)
C = Control (R/R or R/S)
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
F
Plutella xylostella
26 bp
7.5 million
M C1 C2 C3 C4 C6 C7 C8 C9 C10 E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11
Tags
●
-
-
-
-
-
-
-
-
-
-
-
602
-
5
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
440
-
5
*
*
*
*
*
*
*
*
*
-
-
-
-
-
-
-
-
-
-
-
100
- 19 18
-
-
-
-
- 45 41
-
-
-
-
-
-
-
-
-
-
-
-
62
62 /
15,441 = 0.4%
222 = 4,194,304
1 / 4,194,301 = 0.00002%
1 /
15,441 = 0.006%
Most common ‘interesting’ segmentation pattern (of 6408 segmentation patterns)
F = Father (R/R)
M = Mother (R/S)
E = Experiment (R/R)
C = Control (R/R or R/S)
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
F
Plutella xylostella
26 bp
7.5 million
M C1 C2 C3 C4 C6 C7 C8 C9 C10 E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11
Tags
●
-
-
-
-
-
-
-
-
-
-
-
602
-
5
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
440
-
5
*
*
*
*
*
*
*
*
*
-
-
-
-
-
-
-
-
-
-
-
100
- 19 18
-
-
-
-
- 45 41
-
-
-
-
-
-
-
-
-
-
-
-
62
62 /
15,441 = 0.4%
222 = 4,194,304
1 / 4,194,301 = 0.00002%
1 /
15,441 = 0.006%
Most common ‘interesting’ segmentation pattern (of 6408 segmentation patterns)
F = Father (R/R)
M = Mother (R/S)
E = Experiment (R/R)
C = Control (R/R or R/S)
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
F
Plutella xylostella
26 bp
7.5 million
M C1 C2 C3 C4 C6 C7 C8 C9 C10 E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11
Tags
●
-
-
-
-
-
-
-
-
-
-
-
602
-
5
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
440
-
5
*
*
*
*
*
*
*
*
*
-
-
-
-
-
-
-
-
-
-
-
100
- 19 18
-
-
-
-
- 45 41
-
-
-
-
-
-
-
-
-
-
-
-
62
F = Father (R/R)
M = Mother (R/S)
E = Experiment (R/R)
C = Control (R/R or R/S)
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
F
26 bp
Plutella xylostella
7.5 million
M C1 C2 C3 C4 C6 C7 C8 C9 C10 E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11
- 19 18
F = Father (R/R)
-
-
-
-
- 45 41
M = Mother (R/S)
-
-
-
-
-
-
E = Experiment (R/R)
-
-
-
-
-
-
Tags
62
C = Control (R/R or R/S)
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
F
26 bp
Plutella xylostella
7.5 million
M C1 C2 C3 C4 C6 C7 C8 C9 C10 E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11
- 19 18
-
-
-
-
- 45 41
-
-
-
-
-
-
-
-
-
-
-
-
Tags
62
36 of 62 tags have variants 1bp apart
F = Father (R/R)
M = Mother (R/S)
E = Experiment (R/R)
C = Control (R/R or R/S)
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
F
Plutella xylostella
26 bp
7.5 million
M C1 C2 C3 C4 C6 C7 C8 C9 C10 E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11
- 19 18
-
-
-
-
- 45 41
-
-
-
-
-
-
Tags
-
-
-
-
-
-
62
-
-
-
-
-
-
7
36 of 62 tags have variants 1bp apart
-
5
-
F = Father (R/R)
-
-
-
3
-
-
M = Mother (R/S)
-
-
-
-
-
-
-
E = Experiment (R/R)
C = Control (R/R or R/S)
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
F
Plutella xylostella
26 bp
7.5 million
M C1 C2 C3 C4 C6 C7 C8 C9 C10 E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11
- 19 18
-
-
-
-
- 45 41
-
-
-
-
-
-
Tags
-
-
-
-
-
-
62
-
-
-
-
-
-
7
11 24 40 64 65 91 93 96 57 98
84
71
11
36 of 62 tags have variants 1bp apart
-
5
-
-
-
-
3
-
-
-
6 43 47 43 75 62 78 24 35 74
F = Father (R/R)
M = Mother (R/S)
-
-
-
-
-
-
E = Experiment (R/R)
C = Control (R/R or R/S)
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
F
Plutella xylostella
26 bp
7.5 million
M C1 C2 C3 C4 C6 C7 C8 C9 C10 E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11
- 19 18
-
-
-
-
- 45 41
-
-
-
-
-
-
Tags
-
-
-
-
-
-
62
-
-
-
-
-
-
7
6 43 47 43 75 62 78 24 35 74
11 24 40 64 65 91 93 96 57 98
84
71
11
* 26
22 42 37 23 17 15
41
29
7
36 of 62 tags have variants 1bp apart
-
5
-
-
-
-
3
-
- 26 43 16 22 61
F = Father (R/R)
-
-
M = Mother (R/S)
-
-
-
-
-
-
-
-
E = Experiment (R/R)
8 29 35 32
C = Control (R/R or R/S)
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
F
Plutella xylostella
26 bp
7.5 million
M C1 C2 C3 C4 C6 C7 C8 C9 C10 E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11
- 19 18
-
-
-
-
- 45 41
-
-
-
-
-
-
Tags
-
-
-
-
-
-
62
-
-
-
-
-
-
7
6 43 47 43 75 62 78 24 35 74
11 24 40 64 65 91 93 96 57 98
84
71
11
* 26
- 26 43 16 22 61
-
-
22 42 37 23 17 15
41
29
7
4
-
- 13 54
-
-
-
-
-
-
3
10
-
9
- 31 49
15
- 26 29
35
20
2
36 of 62 tags have variants 1bp apart
-
5
-
F = Father (R/R)
-
-
-
-
3
-
- 19 45
- 26
-
-
M = Mother (R/S)
-
-
-
9
-
-
-
-
- 11 12 22
- 17
-
-
8 29 35 32
- 32
- 19
E = Experiment (R/R)
-
C = Control (R/R or R/S)
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
F
Plutella xylostella
26 bp
7.5 million
M C1 C2 C3 C4 C6 C7 C8 C9 C10 E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11
- 19 18
-
-
-
-
- 45 41
-
-
-
-
-
-
Tags
-
-
-
-
-
-
62
-
-
-
-
-
-
7
6 43 47 43 75 62 78 24 35 74
11 24 40 64 65 91 93 96 57 98
84
71
11
* 26
- 26 43 16 22 61
-
-
22 42 37 23 17 15
41
29
7
4
-
- 13 54
-
-
-
-
-
-
3
10
-
9
- 31 49
15
- 26 29
35
20
2
36 of 62 tags have variants 1bp apart
-
5
-
-
-
-
-
3
-
- 19 45
- 26
-
-
-
-
-
9
-
-
-
-
- 11 12 22
- 17
-
-
8 29 35 32
- 32
- 19
-
6
F = Father (R/R)
M = Mother (R/S)
E = Experiment (R/R)
C = Control (R/R or R/S)
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
F = Father (R/R)
26 bp
M = Mother (R/S)
Plutella xylostella
E = Experiment (R/R)
7.5 million
C = Control (R/R or R/S)
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
26 bp
F
Plutella xylostella
M C1 C2 C3 C4 C6
CTACACGCTGAAAGACCCATATTCGA - 22 28 - - - CTACACGCTGAAAGACCCATGTTCGA - 26 - 26 43 16 22
CTACACGCTGAAAGACCCATTTTCGA 13 - 29 16 20 22 20
F = Father (R/R)
M = Mother (R/S)
7.5 million
C7 C8 C9 C10 E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11
- 19 19
61 - 31 15 20
- - - - - - - - - 22 42 37 23 17 15 8 29 35 32
20 35 53 20 8 22 15 29 19 19
E = Experiment (R/R)
41
33
29
9
C = Control (R/R or R/S)
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
26 bp
F
Plutella xylostella
M C1 C2 C3 C4 C6
CTACACGCTGAAAGACCCATATTCGA - 22 28 - - - CTACACGCTGAAAGACCCATGTTCGA - 26 - 26 43 16 22
CTACACGCTGAAAGACCCATTTTCGA 13 - 29 16 20 22 20
F = Father (R/R)
M = Mother (R/S)
7.5 million
C7 C8 C9 C10 E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11
- 19 19
61 - 31 15 20
- - - - - - - - - 22 42 37 23 17 15 8 29 35 32
20 35 53 20 8 22 15 29 19 19
E = Experiment (R/R)
41
33
29
9
C = Control (R/R or R/S)
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
26 bp
F
Plutella xylostella
M C1 C2 C3 C4 C6
7.5 million
C7 C8 C9 C10 E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11
CTACACGCTGAAAGACCCATATTCGA - 22 28 - - - CTACACGCTGAAAGACCCATGTTCGA - 26 - 26 43 16 22
CTACACGCTGAAAGACCCATTTTCGA 13 - 29 16 20 22 20
- 19 19
61 - 31 15 20
- - - - - - - - - 22 42 37 23 17 15 8 29 35 32
20 35 53 20 8 22 15 29 19 19
41
33
29
9
ATATCAGTGATCTTCCAAGTGCGATC
ATATCAGTGATCTTCCAAGTGCGGTC
ATATCAGTAATCTTCCAAGTGCGATC
- 12 21 - - - - 13 - 16 47 47 32
3 - 36 19 43 37 51
- 39 44
64 - 73 38 40
- - - - - - - - - 34 53 46 34 15 33 47 56 57 38
22 43 44 35 33 38 31 36 36 39
54
29
34
27
GCGCCCCGCGCTTTGTCCGTGTGTAA
GCGCCCCGCGCTTTGTCCGTGTGTAG
- 4
- 11
- 16 24
39 - -
- - - 16 34 32 15
29
30
F = Father (R/R)
9 - - - - 16 21 12 16
M = Mother (R/S)
E = Experiment (R/R)
- - 9 22 20
- - 8 19 48
C = Control (R/R or R/S)
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
26 bp
F
Plutella xylostella
M C1 C2 C3 C4 C6
7.5 million
C7 C8 C9 C10 E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11
CTACACGCTGAAAGACCCATATTCGA - 22 28 - - - CTACACGCTGAAAGACCCATGTTCGA - 26 - 26 43 16 22
CTACACGCTGAAAGACCCATTTTCGA 13 - 29 16 20 22 20
- 19 19
61 - 31 15 20
- - - - - - - - - 22 42 37 23 17 15 8 29 35 32
20 35 53 20 8 22 15 29 19 19
41
33
29
9
ATATCAGTGATCTTCCAAGTGCGATC
ATATCAGTGATCTTCCAAGTGCGGTC
ATATCAGTAATCTTCCAAGTGCGATC
- 12 21 - - - - 13 - 16 47 47 32
3 - 36 19 43 37 51
- 39 44
64 - 73 38 40
- - - - - - - - - 34 53 46 34 15 33 47 56 57 38
22 43 44 35 33 38 31 36 36 39
54
29
34
27
GCGCCCCGCGCTTTGTCCGTGTGTAA
GCGCCCCGCGCTTTGTCCGTGTGTAG
- 4
- 11
- 16 24
39 - -
- - - 16 34 32 15
29
30
F = Father (R/R)
9 - - - - 16 21 12 16
M = Mother (R/S)
E = Experiment (R/R)
- - 9 22 20
- - 8 19 48
C = Control (R/R or R/S)
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
26 bp
F
Plutella xylostella
M C1 C2 C3 C4 C6
7.5 million
C7 C8 C9 C10 E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11
CTACACGCTGAAAGACCCATATTCGA - 22 28 - - - CTACACGCTGAAAGACCCATGTTCGA - 26 - 26 43 16 22
CTACACGCTGAAAGACCCATTTTCGA 13 - 29 16 20 22 20
- 19 19
61 - 31 15 20
- - - - - - - - - 22 42 37 23 17 15 8 29 35 32
20 35 53 20 8 22 15 29 19 19
41
33
29
9
ATATCAGTGATCTTCCAAGTGCGATC
ATATCAGTGATCTTCCAAGTGCGGTC
ATATCAGTAATCTTCCAAGTGCGATC
- 12 21 - - - - 13 - 16 47 47 32
3 - 36 19 43 37 51
- 39 44
64 - 73 38 40
- - - - - - - - - 34 53 46 34 15 33 47 56 57 38
22 43 44 35 33 38 31 36 36 39
54
29
34
27
GCGCCCCGCGCTTTGTCCGTGTGTAA
GCGCCCCGCGCTTTGTCCGTGTGTAG
- 4
- 11
- 16 24
39 - -
- - - 16 34 32 15
29
30
F = Father (R/R)
9 - - - - 16 21 12 16
M = Mother (R/S)
E = Experiment (R/R)
- - 9 22 20
- - 8 19 48
C = Control (R/R or R/S)
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
Genome size (and GC content) 339 Mb (40%)
Number of reads per lane
Restriction enzyme
SbfI
Estimated number of RAD tags 10,345 tags
Average fragment size
65 Kb
Number of reads per tag
30 reads
Symmetric?
Yes
Number of individuals per lane
24
Read error threshold
p<0.1
Read trim length
26 bp
F
Plutella xylostella
M C1 C2 C3 C4 C6
7.5 million
C7 C8 C9 C10 E1 E2 E3 E4 E5 E6 E7 E8 E9 E10 E11
CTACACGCTGAAAGACCCATATTCGA - 22 28 - - - CTACACGCTGAAAGACCCATGTTCGA - 26 - 26 43 16 22
CTACACGCTGAAAGACCCATTTTCGA 13 - 29 16 20 22 20
- 19 19
61 - 31 15 20
- - - - - - - - - 22 42 37 23 17 15 8 29 35 32
20 35 53 20 8 22 15 29 19 19
41
33
29
9
ATATCAGTGATCTTCCAAGTGCGATC
ATATCAGTGATCTTCCAAGTGCGGTC
ATATCAGTAATCTTCCAAGTGCGATC
- 12 21 - - - - 13 - 16 47 47 32
3 - 36 19 43 37 51
- 39 44
64 - 73 38 40
- - - - - - - - - 34 53 46 34 15 33 47 56 57 38
22 43 44 35 33 38 31 36 36 39
54
29
34
27
GCGCCCCGCGCTTTGTCCGTGTGTAA
GCGCCCCGCGCTTTGTCCGTGTGTAG
- 4
- 11
9 - - - - 16 21 12 16
- 16 24
39 - -
- - - 16 34 32 15
- - 8 19 48
29
30
CTTATAGGGACATGCTGGTTAAGGCT
CTTATAGGGACATGCTGGTGAAGGCT
- 13
4 19
6 - - - - 32 42 30 36 103
4 10
- -
- - - - - - - - - 20 77 62 47 18 32 34 26 71 40
99
22
TAGACCAGATGTCTGATGAATGGTGA
TAGACCAGATGTCTGATGACTGGTGA
- 8 17 - - - 3
4 13 - 31 54 31 46
- 17 27
91 - -
- - - - - - - - - 17 70 46 55 19 50 58 68 44 37
77
32
TCATAATGGGCTCTTTTCCACCCACT
TCATAATGGGCTCTTTTCCACCTACT
- 14 20 - - - - 15 22 31 17 14 41
- 18 21
40 - -
- - - 32 58 52 59
52
29
F = Father (R/R)
M = Mother (R/S)
E = Experiment (R/R)
- - 9 22 20
- - - - - 6 35 50 33 15 26
C = Control (R/R or R/S)
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Data Quality
SNP Discovery
RAD Sequencing Design and Data Analysis
Experimental Design
Restriction Enzymes
Mark Blaxter
Marian Thomson
Urmi Trivedi
Karim Gharbi
Simon Baxter
Maureen Liu
Eric Johnson
Paul Etter
Data Quality
SNP Discovery
Download