DATASET 1

advertisement
Sarah Carrier
SILS/MRC
February 24, 2009
This document includes a very simple and informal example of metadata for
datasets associated with one publication that have been put into the structure of the
Dryad Application Profile, version 1.0. Some notes for discussion and lingering
issues are at the bottom. Some other notes are also included along with certain
elements.
DATASET 1
dryad:status: incomplete
PUBLICATION
dc:type: article
dc:creator: Arnar Palsson
dc:creator:Ann Rouse
dc:creator:Rebecca Riley-Berger
dc:creator:Ian Dworkin
dc:creator:Greg Gibson
dc:contributor: Greg Gibson (He is the corresponding author!)
dc:title: Nucleotide Variation in the EGFR Locus of Drosophila melanogaster
dcterms:issued: 2004
dc:publisher: The Genetics Society of America
dc:subject: (NO KEYWORDS ASSOCIATED WITH THIS JOURNAL)
dcterms:abstract: The Epidermal growth factor receptor is an essential gene with diverse
pleiotropic roles in development throughout the animal kingdom. Analysis of sequence
diversity in 10.9 kb covering the complete coding region and 6.4 kb of potential
regulatory regions in a sample of 250 alleles from three populations of Drosophila
melanogaster suggests that the intensity of different population genetic forces varies
along the locus. A total of 238 independent common SNPs and 20 indel polymorphisms
were detected, with just six common replacements affecting >1475 amino acids, four of
which are in the short alternate first exon. Sequence diversity is lowest in a 2-kb portion
of intron 2, which is also highly conserved in comparison with D. simulans and D.
pseudoobscura. Linkage disequilibrium decays to background levels within 500 bp of
most sites, so haplotypes are generally restricted to up to 5 polymorphisms. The two
North American samples from North Carolina and California have diverged in allele
frequency at a handful of individual SNPs, but a Kenyan sample is both more divergent
and more polymorphic. The effect of sample size on inference of the roles of population
structure, uneven recombination, and weak selection in patterning nucleotide variation in
the locus is discussed.
dcterms:temporal: (n/a)
dcterms:spatial: (n/a)
darwincore:Scientific Name: Drosophila melanogaster
darwincore:Scientific Name: D. simulans
darwincore:Scientific Name: D. pseudoobscura
dcterms:isPartOf: 1943-2361 (**this is the ONLINE ISSN, not print)
dc:identifier: 10.1534/genetics.104.026252
dcterms:bibliographicCitation: Palsson, Arnar, Rouse, Ann, Riley-Berger, Rebecca,
Dworkin, Ian, Gibson, Greg. Nucleotide Variation in the Egfr Locus of Drosophila
melanogaster Genetics 2004 167: 1199-1212
dcterms:hasPartOf: (handles for datasets here)
DATA OBJECT 1
dc:type: data
dc:creator: Arnar Palsson
dc:creator:Ann Rouse
dc:creator:Rebecca Riley-Berger
dc:creator:Ian Dworkin
dc:creator:Greg Gibson
dc:contributor: Greg Gibson (corresponding author)
dc:title: GenBank File 17571116 (Egfr Sequence)
dc:identifier: (Dryad handle)
dcterms:isPartOf: 10.1534/genetics.104.026252
DDI:depositr: Sarah Carrier
dcterms:available: 2/19/06
dcterms:issued: 2/19/06
dcterms:extent: (however big it is)
dcterms:format: Microsoft Word
dcterms:temporal: (n/a)
dcterms:spatial: (n/a)
darwincore:Scientific Name: Drosophila melanogaster
darwincore:Scientific Name: D. simulans
darwincore:Scientific Name: D. pseudoobscura
dc:subject: (keywords would be by default inherited from the article…NO KEYWORDS
ASSOCIATED WITH THIS JOURNAL)
dc:rights: (Dryad statement)
dc:description: (nothing immediately available to cut and paste into this field)
DATA OBJECT 2
dc:type: data
dc:creator: Arnar Palsson
dc:creator:Ann Rouse
dc:creator:Rebecca Riley-Berger
dc:creator:Ian Dworkin
dc:creator:Greg Gibson
dc:contributor: Greg Gibson (corresponding author)
dc:title: List of Amino Acid Replacement variants in DER
dc:identifier: (Dryad handle)
dcterms:isPartOf: 10.1534/genetics.104.026252
DDI:depositr: Sarah Carrier
dcterms:available: 2/19/06
dcterms:issued: 2/19/06
dcterms:extent: (however big it is)
dcterms:format: PDF
dcterms:temporal: (n/a)
dcterms:spatial: (n/a)
darwincore:Scientific Name: Drosophila melanogaster
darwincore:Scientific Name: D. simulans
darwincore:Scientific Name: D. pseudoobscura
dc:subject: (NO KEYWORDS ASSOCIATED WITH THIS JOURNAL)
dc:rights: (Dryad statement)
dc:description: (nothing immediately available to cut and paste into this field)
DATA OBJECT 3
dc:type: data
dc:creator: Arnar Palsson
dc:creator:Ann Rouse
dc:creator:Rebecca Riley-Berger
dc:creator:Ian Dworkin
dc:creator:Greg Gibson
dc:contributor: Greg Gibson (corresponding author)
dc:title: LD plot for all common polymorphisms
dc:identifier: (Dryad handle)
dcterms:isPartOf: 10.1534/genetics.104.026252
DDI:depositr: Sarah Carrier
dcterms:available: 2/19/06
dcterms:issued: 2/19/06
dcterms:extent: (however big it is)
dcterms:format: PDF
dcterms:temporal: (n/a)
dcterms:spatial: (n/a)
darwincore:Scientific Name: Drosophila melanogaster
darwincore:Scientific Name: D. simulans
darwincore:Scientific Name: D. pseudoobscura
dc:subject: (NO KEYWORDS ASSOCIATED WITH THIS JOURNAL)
dc:rights: (Dryad statement)
dc:description: (nothing immediately available to cut and paste into this field)
...etc. - there are many other files associated with this.
WHERE TO PUT THIS CONTACT INFORMATION?
--remember: DSpace cannot use hierarchical elements like DDI:contact...this is why we
have to drop this element.
Department of Genetics, North Carolina State University, Raleigh, North Carolina
27513-7614
Corresponding author: (GREG) Department of Genetics, Gardner Hall, North Carolina
State University, Raleigh, NC 27695-7614.
E-mail: ggibson@unity.ncsu.edu
Download