Repository_notes - CBO

advertisement
Repository notes
Started: 2/28/2011
The goal is a distributed database of images, biological data and models. The hope is that if the files
are adequately annotated, using an agreed upon set of reference ontologies, then the distributed
files can be located via a Google search. The user should not need to know where the files are
located. Furthermore, the annotation syntax should be as non-restrictive as possible and annotation
of a file should not change the usability of the file for its original porpose.
Longer range goals might include annotation styles linking annotations to entities in the data file.
For example, in a CC3D model linking cell names and/or type IDs to annotations. Annotations in an
SBML file that link variable names to protein names.
File and data types:
Major issue is that the file type has to be "crawlable" and "indexable" by Google. File types to try
include:






Any plain text files including computational scripts in python, C++ etc.
Tab or csv data table files containing numeric values (e.g., from ToxCast, a high throughput
assay, or an archive of common physiochemical parameters like affinity constants, diffusion
rates and the like).
MSWord, PowerPoint and PDF files.
HTML files.
Images (jpg, gif …) either as photographs (or micrographs) of tissues, cells etc. or images
generated as output of a modeling program.
Movies of tissues, cells etc. or images generated as from the output of a modeling program.
For the image type of files it would be nice if Google would index the file format's native annotation
block. That would allow the annotation to actually live in the file. (See the file
somatogenesis_image_CC3D_model.jpg for an example.) Unfortunately, it doesn't look like Google
does that. To associate annotation with an image file we need to place both in an HTML (or PDF or
…) file.
Annotations:
The goal is to include the annotation without interfering with the native file format. Annotations in
code or script files should live inside comment delimiters suitable for the particular language. In
non-code files (like HTML, word, csv) this isn't as much of an issue. Major issue is that there needs
to be an agreed upon set of reference ontologies so that end users know which ontologies to search
for query terms. A secondary issue is that a user friendly tool for finding terms in the reference
ontologies would be very helpful.
Possible Reference Ontologies:
Major issue is multi-species requirements. FMA is human can it be adequately used for other
species? In some cases yes, in many cases no.
Ontologies / vocabularies searchable by the querying tools marked with *.











Mammalian body parts, locations and processes: FMA*
(http://sig.biostr.washington.edu/projects/fm/FME/index.html)
Non-mammalian body parts: ?
Gene Processes: GO*
(http://amigo.geneontology.org/cgi-bin/amigo/go.cgi)
Gene and protein names: PRO*
(http://pir.georgetown.edu/pro/)
Small molecules: ChEBI*
(http://www.ebi.ac.uk/chebi/)
Images: FBbi*
(http://www.obofoundry.org/cgi-bin/detail.cgi?id=image)
Experimental descriptions: ?
Model descriptions: EDAM*
Diseases: ?
Catch-all Reference vocabulary: MeSH
(http://www.nlm.nih.gov/mesh/MBrowser.html)
Another Catch-all Reference Ontology: UMLS
(http://www.nlm.nih.gov/research/umls/ don't know of any online search engines.)
Possible user tools for querying the reference ontologies:
http://www.ebi.ac.uk/ontology-lookup/ (doesn't include UMLS or MeSH!)
http://bioportal.bioontology.org/
Annotation Style:
The key requirement is that the annotation contains the terms, IDs and reference ontology
identifier. For example "FMA:82840 embryonic cell" identifies the ontology (FMA), ID (82840) and
the term name ("embryonic cell"). There should be a unique ontology identifier for the typical
ontologies so that the ID and ontology can be closely linked, perhaps to a single "word" level, such
as "FMA:82840". An explicit listing of the ontologies being used might also be useful. It may be
advantageous to have a text template that lists the major types of annotations that might be
included along with the suitable reference ontology names. The template is not part of the data
structure, it is simply meant to help the annotator work through the process in an orderly fashion.
Major sections of the annotation template might be something along the lines of:
1. Process being modeled/studied (examples: tumor growth, immune response,
embryogenesis)
2. Tissues in the model/system (examples: liver, mammary tumor)
3. Cells in the model/system (examples: macrophage, neuron, epithelium)
4. Non-cell structures in the model/system (examples: bone, basement membrane)
5. Genes and gene products in the model system (examples: VEGF-A, IL-1b)
6. Molecules not covered by 5. (examples: oxygen, glucose, potassium ions)
7. Processes and behaviors of the things (cells, genes, tissues) in the model/system (examples:
gene expression, cell growth, division and death, diffusion)
Some annotated files:
Jpeg image from Julio & Susan’s poster of somitogenesis. Annotations added to the jpeg file
somitogenesis_image.jpg:
Somitogenesis image
Foundational Model of Anatomy FMA:
FMA:69068 embryo
FMA:82840 embryonic cell
Gene Ontology GO:
GO:0009790 embryogenesis
GO:0009792 embryo development ending in birth or egg hatching
GO:0016055 Wnt receptor signaling pathway
Protein Ontology PRO:
PRO:000017443 protein Wnt-3a
Biological Imaging Methods FBbi:
FBbi:00000265 micrograph
MeSH:A16.504.660.750 somites
HERBS AND JIMS BIG DATA AND COMPUTATIONAL MODEL REPOSITORY
Indiana University Bloomington
University of Washington
CC3D (CompuCell3D) Result Image CC3D_somitogenesis_image.jpg:
Somitogenesis computational model image
Foundational Model of Anatomy FMA:
FMA:69068 embryo
FMA:82840 embryonic cell
Gene Ontology GO:
GO:0009790 embryogenesis
GO:0009792 embryo development ending in birth or egg hatching
GO:0016055 Wnt receptor signaling pathway
Protein Ontology PRO:
PRO:000017443 protein Wnt-3a
EMBRACE Ontology for Data and Methods EDAM:
EDAM:0000950 Biological model
Medical Subject Heading MeSH:
MeSH:E05.599 Models, Theoretical
MeSH:E05.599.395 Models, Biological
MeSH:L01.224.160 Computer Simulation
MeSH:A16.504.660.750 somites
Authors:
Susan Hester, Julio Belmonte
Indiana University, Bloomington
Computational Platform:
CompuCell3D CC3D
HERBS AND JIMS BIG DATA AND COMPUTATIONAL MODEL REPOSITORY
Indiana University Bloomington
University of Washington
MP4 Movie file: Crawling_Neutrophil_chasing_bacterium.mp4
Neutrophil chasing a bacterium
UniProt Taxonomy Database NEWT:
NEWT:2 Bacteria
Cell Type Ontology CL:
CL:0000775 neutrophil "polynuclear neutrophilic leucocyte" neutrocyte
Foundational Model of Anatomy FMA:
FMA:62860 "Polymorphonuclear leukocyte"
FMA:82840 embryonic cell
Gene Ontology GO:
GO:0006909 phagocytosis
GO:0060326 "cell chemotaxis"
GO:0048870 "cell motility"
Crawling_Neutrophil_chasing_bacter
Protein Ontology PRO:
Biological Imaging Methods FBbi:
FBbi:00000265 micrograph
FBbi:00000267 animation
HERBS AND JIMS BIG DATA AND COMPUTATIONAL MODEL REPOSITORY
Indiana University Bloomington
University of Washington
Current file list at IU:
8 March 2011 4:38 PM















0_Read_me.html
3D_multi-cell_simulation_of_tumor_growth_and_angiogenesis.html
3D_multi-cell_simulation_of_tumor_growth_and_angiogenesis.txt
BIOMD0000000201.xml
Crawling_Neutrophil_chasing_bacterium.mp4
Crawling_Neutrophil_chasing_bacterium_annot.wmv
Crawling_Neutrophil_chasing_bacterium_annotation.html
JimsStyle.css
Repository_notes.docx
document_list_IU.html
somitogenesis_CC3D_script.txt
somitogenesis_image.jpg
somitogenesis_image_CC3D_model.jpg
somitogenesis_image_CC3D_model_annotation.html
somitogenesis_image_annotation.html
3/8/2011
Google finds the pages using, e.g., "CHEBIO:15379 cc3d herbs and jims …"
But problem with the svn paths FIXED
MeSH terms used for typical biomodels.
Abbas et al. Tumor
Niko et al. Tumor
Cardiac ventricular action
potential
PLoS One. 2009 Oct 16;4(10):e7190.
PLoS One. 2010 May 26;5(5):e10641.
Circ Res. 1994 Jun;74(6):1071-96.
"3D multi-cell simulation of
tumor growth and
angiogenesis" Shirinifard A,
Gens JS, Zaitlen BL, Popławski
NJ, Swat M, Glazier JA.
"Front instabilities and
invasiveness of simulated 3D
avascular tumors" Poplawski
NJ, Shirinifard A, Agero U, Gens
JS, Swat M, Glazier JA.
"A dynamic model of the
cardiac ventricular action
potential. I. Simulations of ionic
currents and concentration
changes" Luo CH, Rudy Y.
MH
MH
MH
MH
-
MH
MH
MH
MH
-
MH
MH
MH
-
MH
MH
MH
MH
MH
MH
MH
-
MH
MH
MH
-
MH
MH
MH
MH
MH
-
MH
-
MH
MH
-
MH
MH
MH
MH
-
Animals
Anoxia
Cell Proliferation
Cell Transformation,
Neoplastic
Computer Simulation
Humans
Imaging, ThreeDimensional
Models, Biological
Models, Molecular
Neoplasms/*blood
supply/*pathology
*Neovascularization,
Pathologic
Oxygen/chemistry
Software
Spheroids, Cellular
Time Factors
MH
-
Cell Proliferation
*Computer Simulation
Humans
*Models, Biological
Neoplasm Invasiveness
Neoplasms/*blood
supply/*pathology
- Time Factors
*Action Potentials
Animals
Calcium/metabolism
Calcium
Channels/physiology
Carrier
Proteins/physiology
Guinea Pigs
Heart/*physiology
Ion Channels/*physiology
Models, Biological
Potassium
Channels/physiology
Sodium-Calcium Exchanger
Sodium-PotassiumExchanging ATPase
Annotations for typical high throughput screens
3/16/2011
Google with "BIOMD:0000000201 BIOMODELS" returns one link, the correct one to the biomodels
DB.
Google with "GO:0007219 biomodels" tags from within the SBML file doesn't find the Biomodels file
at the BioModelsDB but does find a pdf version of the model at CalTech. (Three links total returned)
Google with "GO:0016055 PRO:000017443 HERBS AND JIMS BIG DATA AND COMPUTATIONAL
MODEL REPOSITORY" find the somitogenesis files in HERBS/JIMS but not the original or reannotated SBML file. (Google hasn't indexed it yet?)
3/23/2011
Still don't seem to find the biomodel files. Used the Biomodels DB tool to create a PDF summary
description of the model file (BIOMD0000000201.pdf) and added it to the repository.
Current file list on server:
model_repo_v0 - Revision 209:






















.3D_multi-cell_simulation_of_tumor_growth_and_angiogenesis.html.swp
.3D_multi-cell_simulation_of_tumor_growth_and_angiogenesis.html.swx
.DS_Store
.TemporaryItems/
0_Read_me.html
3D_multi-cell_simulation_of_tumor_growth_and_angiogenesis.html
3D_multi-cell_simulation_of_tumor_growth_and_angiogenesis.txt
BIOMD0000000201.pdf
BIOMD0000000201.xml
BIOMD0000000201_mod.xml
Crawling_Neutrophil_chasing_bacterium.mp4
Crawling_Neutrophil_chasing_bacterium_annot.wmv
Crawling_Neutrophil_chasing_bacterium_annotation.html
JimsStyle.css
Repository_notes.docx
Schoetz_cell_sorting/
document_list_IU.html
somitogenesis_CC3D_script.txt
somitogenesis_image.jpg
somitogenesis_image_CC3D_model.jpg
somitogenesis_image_CC3D_model_annotation.html
somitogenesis_image_annotation.html
4/5/2011
Checked again using "GO:0008543 HERBS AND JIMS BIG DATA AND COMPUTATIONAL MODEL
REPOSITORY" and "urn:miriam:obo.go:GO:0008543 HERBS AND JIMS BIG DATA AND
COMPUTATIONAL MODEL REPOSITORY", still not finding the original BioModels or the LaTex
versions of the BioModels file. Perhaps Google doesn't index terms such as
"urn:miriam:obo.go:GO:0008543"?
Added a table to the top of the BioModels pdf file with:
------------- added annotations jps -------------------------------HERBS AND JIMS BIG DATA AND COMPUTATIONAL MODEL REPOSITORY
Annotations added to a BioModels generated LaTex version of a BioModels SBML file.
urn:miriam:obo.go:GO%3A0007219
GO%3A0007219
urn:miriam:obo.go:GO%3A0008543
GO%3A0008543
urn:miriam:obo.go:GO%3A0016055
GO:0008543
GO:0016055
Notch signaling pathway
fibroblast growth factor receptor
signaling pathway
Wnt receptor signaling pathway
------------- end added annotations jps -------------------------------The new pdf file is BIOMD0000000201_mod.pdf


Fixed misspelling of "biological" in several of the html file's keywords.
Gave the various HTML pages suitable HTML <title> tags.
4/7/2011
Google with:
"GO:0008543 HERBS AND JIMS BIG DATA AND COMPUTATIONAL MODEL REPOSITORY"
"Wnt receptor signaling pathway HERBS AND JIMS BIG DATA AND COMPUTATIONAL MODEL REPOSITORY"
"urn:miriam:obo.go:GO%3A0008543 HERBS AND JIMS BIG DATA AND COMPUTATIONAL MODEL REPOSITORY"
"urn:miriam:obo.go:GO:0008543 HERBS AND JIMS BIG DATA AND COMPUTATIONAL MODEL REPOSITORY"
"urn Miriam obo go GO:0008543 HERBS AND JIMS BIG DATA AND COMPUTATIONAL MODEL REPOSITORY"
"urn Miriam obo go GO 0008543 HERBS AND JIMS BIG DATA AND COMPUTATIONAL MODEL REPOSITORY"
No hits. Wait a couple more days and try again.
Multiple data sources should share a common biological description.
("C:\Users\James Sluka\Desktop\Biocomplexity\Ontologies\Common markup across multiple data source.pptx")
There is a newer version of the above figure (with an SBML model)
4/20/11
From Herbert:
James, sorry to take so long but I now have a url for you:
http://sys-bio.org/TestRepository/model_repo_v0.htm
This has a list of 10 biomodels in sbml and pdf format. To each sbml file I added this
text: "HERBS AND JIMS BIG DATA AND COMPUTATIONAL MODEL REPOSITORY Indiana University
Bloomington University of Washington"
Is there anything else I need to do?
Herb
Added his list of files to the index file (document_list_IU.html) and sent him a copy.
8/2/2011
FOAF annotations in RDF so Google will index
A FOAF file in RDF:
<?xml version="1.0"?>
<rdf:RDF xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"
xmlns:foaf="http://xmlns.com/foaf/0.1/"
xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
<foaf:Person rdf:about="#JW">
<foaf:name>Jimmy Wales</foaf:name>
<foaf:mbox rdf:resource="mailto:jwales@bomis.com" />
<foaf:homepage rdf:resource="http://www.jimmywales.com/" />
<foaf:nick>Jimbo</foaf:nick>
<foaf:depiction rdf:resource="http://www.jimmywales.com/aus_img_small.jpg" />
<foaf:interest rdf:resource="http://www.wikimedia.org" rdfs:label="Wikipedia" />
<foaf:knows>
<foaf:Person>
<foaf:name>Angela Beesley</foaf:name>
</foaf:Person>
</foaf:knows>
</foaf:Person>
</rdf:RDF>
In RDF/XML
<?xml version="1.0"?>
<rdf:RDF xmlns="http://www.semanticweb.org/owl/owlapi/turtle#"
xml:base="http://www.semanticweb.org/owl/owlapi/turtle"
xmlns:dc="http://purl.org/dc/elements/1.1/"
xmlns:protege="http://protege.stanford.edu/plugins/owl/protege#"
xmlns:foaf="http://xmlns.com/foaf/0.1/"
xmlns:xsp="http://www.owl-ontologies.com/2005/08/07/xsp.owl#"
xmlns:p1="#"
xmlns:xml="http://www.w3.org/XML/1998/namespace"
xmlns:daml="http://www.daml.org/2000/12/daml+oil#"
xmlns:oiled="http://oiled.man.example.net/test#"
xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"
xmlns:swrl="http://www.w3.org/2003/11/swrl#"
xmlns:ns0="http://cohse.semanticweb.org/ontologies/people#"
xmlns:swrlb="http://www.w3.org/2003/11/swrlb#"
xmlns:map="http://www.gate.ac.uk/ns/ontologies/MappingMetamodel.owl#"
xmlns:owl="http://www.w3.org/2002/07/owl#"
xmlns:xsd="http://www.w3.org/2001/XMLSchema#"
xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
xmlns:skos="http://www.w3.org/2004/02/skos/core#">
<!-///////////////////////////////////////////////////////////////////////////////////////
// Annotation properties
///////////////////////////////////////////////////////////////////////////////////////
-->
<!-///////////////////////////////////////////////////////////////////////////////////////
// Individuals
///////////////////////////////////////////////////////////////////////////////////////
-->
<!-- #JW -->
<foaf:Person rdf:about="#JW">
<rdf:type rdf:resource="http://www.w3.org/2002/07/owl#NamedIndividual"/>
<foaf:interest rdf:resource="http://www.wikimedia.org"/>
<foaf:homepage rdf:resource="http://www.jimmywales.com/"/>
<foaf:depiction rdf:resource="http://www.jimmywales.com/aus_img_small.jpg"/>
<foaf:mbox rdf:resource="mailto:jwales@bomis.com"/>
<foaf:knows>
<foaf:Person/>
</foaf:knows>
</foaf:Person>
<!-///////////////////////////////////////////////////////////////////////////////////////
// Annotations
///////////////////////////////////////////////////////////////////////////////////////
-->
<rdf:Description rdf:about="http://www.wikimedia.org">
<rdfs:label>Wikipedia</rdfs:label>
</rdf:Description>
</rdf:RDF>
<!-- Generated by the OWL API (version 3.2.1.1753) http://owlapi.sourceforge.net -->
RDF converters:
http://owl.cs.manchester.ac.uk/converter/convert
http://www.rdfabout.com/demo/validator/validate.xpd
My version using urn:Miriam:obo tags (GO:0008543 is FGFr signaling pathway). Not sure if the
GO:0008543 part should use something else for the ":". Not sure if the Miriam markup should be the
"about", the "name" or something else. Same with the "HERBS AND JIMS…"
<?xml version="1.0"?>
<rdf:RDF xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"
xmlns:foaf="http://xmlns.com/foaf/0.1/"
xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
<foaf:Person rdf:about="#urn:miriam:obo.go:GO:0008543">
<foaf:name>fibroblast growth factor receptor signaling pathway</foaf:name>
<foaf:nick>FGFr signaling pathway</foaf:nick>
<foaf:knows>
<foaf:Person>
<foaf:name>HERBS AND JIMS BIG DATA AND COMPUTATIONAL MODEL REPOSITORY</foaf:name>
</foaf:Person>
</foaf:knows>
</foaf:Person>
</rdf:RDF>
Added to the top of the BIOMD0000000201_mod.xml file and saved as
BIOMD0000000201_mod_FOAF.xml.
8/8/2011
Added biomodel with the DBBR annotations in the <notes> block of the header.
BIOMD0000000201_DBBR_notes.xml
8/10/2011
Google q's for the BIOMD0000000201_DBBR_notes.xml:
urn:miriam:interpro:IPR013284 Distributed Biodata and Biomodel Resource
IPR013284 Distributed Biodata and Biomodel Resource
Beta-catenin Distributed Biodata and Biomodel Resource Distributed Biodata
and Biomodel Resource
No hits yet
Created Word and Excel versions of the raw XML BIOMD0000000201.xml file. Also created
versions of the DBBR annotated file.
BIOMD0000000201_DBBR_notes.xlsx
BIOMD0000000201.xlsx
BIOMD0000000201_DBBR_notes.docx
BIOMD0000000201.docx
8/15/2011
Still not found
8/22/2011
Google with query:
urn:miriam:interpro:IPR013284 Distributed Biodata and Biomodel Resource
Finds:
 BIOMD0000000201_DBBR_notes.xml - Cell Behavior Ontology
cellbehaviorontology.org/.../BIOMD0000000201_DBBR_notes.xml - Cached
Distributed Biodata and Biomodel Resource Annotations ... ERK1/2 MAP kinase ## DBBR: Bp
urn:miriam:interpro:IPR013284 Beta-catenin ## DBBR: Fgf ...
 [DOC]
BIOMD0000000201_DBBR_notes.docx - Cell Behavior Ontology
cellbehaviorontology.org/models/BIOMD0000000201_DBBR_notes.docx
Distributed Biodata and Biomodel Resource Annotations ######################## ..... <rdf:li
rdf:resource="urn:miriam:interpro:IPR013284"/> ...
 [XLS]
BIOMD0000000201_DBBR_notes.xlsx - Cell Behavior Ontology
cellbehaviorontology.org/models/BIOMD0000000201_DBBR_notes.xlsx
10, ########## Distributed Biodata and Biomodel Resource Annotations ..... 301, <rdf:li
rdf:resource="urn:miriam:interpro:IPR013284"/> ...
File name:
Q: urn:miriam:interpro:IPR013284 Distributed Biodata and Biomodel Resource
(IPR013284 is Beta-catenin) 4 hits total, first hit is the repository_notes.doc file.
BIOMD0000000201.docx
BIOMD0000000201.pdf
BIOMD0000000201.xlsx
BIOMD0000000201.xml
BIOMD0000000201_DBBR_notes.docx
BIOMD0000000201_DBBR_notes.xlsx
BIOMD0000000201_DBBR_notes.xml
BIOMD0000000201_mod.pdf
BIOMD0000000201_mod.xml
BIOMD0000000201_mod_FOAF.xml
Found by
Google?
no
no
no
no
Yes
Yes
Yes
no
no
no
Q: IPR013284 Distributed Biodata and Biomodel Resource
Finds the same three files
Q: Beta-catenin Distributed Biodata and Biomodel Resource
Finds the same three plus five non-DBBR files (relevant hits are all first)
Note that no files are found directly in the BioModels database!
5 April 2012: the sentence above no longer appears to be true. There are now static pages, e.g.,
http://www.ebi.ac.uk/biomodels-main/static-pages.do?page=ModelMonth%2F2009-04
File: BIOMD0000000201_DBBR_notes.xml
<?xml version="1.0" encoding="UTF-8"?>
<!-- This model was downloaded from BioModels Database -->
<!-- Thu Mar 03 22:08:56 GMT 2011
-->
<!-- http://www.ebi.ac.uk/biomodels/
-->
<sbml xmlns="http://www.sbml.org/sbml/level2/version3" metaid="metaid_0000001" level="2" version="3">
<model metaid="metaid_0000002" id="Goldbeter2008_Somite_Segmentation_Clock_Notch_Wnt_FGF">
<notes>
########## Distributed Biodata and Biomodel Resource Annotations ########################
##
In this file
Ontology Term URN
Ontology Term
Synonyms
## DBBR: Rasi
urn:miriam:interpro:IPR001806
Small GTPase superfamily
## DBBR: D
urn:miriam:interpro:IPR003351
Dishevelled protein
## DBBR: N
urn:miriam:interpro:IPR008297
Notch
## DBBR: ERKa
urn:miriam:interpro:IPR008349
ERK1/2 MAP kinase
## DBBR: Bp
urn:miriam:interpro:IPR013284
Beta-catenin
## DBBR: Fgf
urn:miriam:interpro:IPR017338
Fibroblast growth factor, 15/19/21
## DBBR: MF
urn:miriam:interpro:IPR017374
Fringe
## DBBR: Bp
urn:miriam:kegg.compound:C00562
Phosphoprotein
## DBBR:
urn:miriam:kegg.pathway:ko04330
Notch signaling pathway
## DBBR: Rasa
urn:miriam:obo.chebi:CHEBI%3A15996
GTP
Guanosine 5'-triphosphate
## DBBR: Rasi
urn:miriam:obo.chebi:CHEBI%3A17552
GDP
Guanosine 5'-diphosphate, Guanosine diphosphate
## DBBR:
urn:miriam:obo.chebi:CHEBI%3A33699
messenger RNA
## DBBR:
urn:miriam:obo.go:GO%3A0001756
somitogenesis
formation of mesodermal clusters
## DBBR: cytosol
urn:miriam:obo.go:GO%3A0005829
cytosol
## DBBR:
urn:miriam:obo.go:GO%3A0006402
mRNA catabolic process
mRNA breakdown, mRNA catabolism, mRNA degradation
## DBBR:
urn:miriam:obo.go:GO%3A0006412
translation
protein biosynthesis, protein formation
## DBBR:
urn:miriam:obo.go:GO%3A0006468
protein phosphorylation
protein amino acid phosphorylation
## DBBR:
urn:miriam:reactome:REACT_11045.1
Pathway: Signaling by Wnt (Homo sapiens)
## DBBR:
urn:miriam:reactome:REACT_11065.1
Pathway: Beta-catenin phosphorylation cascade (Homo sapiens)
## DBBR:
urn:miriam:reactome:REACT_1482.4
Pathway: ERK activation (Homo sapiens)
## DBBR:
urn:miriam:reactome:REACT_299.2
Pathway: Signaling by Notch (Homo sapiens)
## DBBR:
urn:miriam:reactome:REACT_9470.2
Pathway: Signaling by FGFR (Homo sapiens)
## DBBR:
urn:miriam:taxonomy:32524
Amniota
amniotes
## DBBR: D
urn:miriam:uniprot:O14640
DVL1
Segment polarity protein dishevelled homolog DVL-1
Dishevelled-1, DSH homolog 1
## DBBR: Bp
urn:miriam:uniprot:P35222
Catenin beta-1
Beta-catenin, CTNNB1, CTNNB
## DBBR: N
urn:miriam:uniprot:P46531
Notch 1
Neurogenic locus notch homolog protein 1, hN1, Translocation-associated notch protein TAN1, Notch1, Tan1
## DBBR: K
urn:miriam:uniprot:P49841
GSK-3 beta
Glycogen synthase kinase-3 beta, GSK3B
## DBBR: MDusp
urn:miriam:uniprot:Q16828
DUSP6
Dual specificity protein phosphatase 6, Dual specificity protein phosphatase PYST1,
Mitogen-activated protein kinase phosphatase 3, MKP3, PYST1
## DBBR: MF
urn:miriam:uniprot:Q8NES3
LFNG
Beta-1,3-N-acetylglucosaminyltransferase lunatic fringe, O-fucosylpeptide 3-beta-Nacetylglucosaminyltransferase
## DBBR: A
urn:miriam:uniprot:Q9Y2T1
AXIN2
Axin-2, Axin-like protein, Axil, Axis inhibition protein 2, Conductin
<p xmlns="http://www.w3.org/1999/xhtml">This is a model of the coupled Natch, Wnt and FGF modules as described in:<br/>
<b>A. Goldbeter and O. Pourqui�</b>,
<em>Modeling the segmentation clock as a network of coupled oscillations in the Notch, Wnt and FGF signaling pathways.</em> J Theor Biol. 2008 Jun 7;252(3):574-85, pubmed ID:
<a href="http://www.ncbi.nlm.nih.gov/pubmed/18308339">18308339</a>
<br/>
To uncouple the modules remove the reaction
<em>MAx_trans_Xa</em> and set
<em>vsFK=vsF</em>.
<br/>
The SBML version of the model was converted from the CellML
<a href="http://www.cellml.org/models/goldbeter_pourquie_2008_version02">version</a> by Catherine Lloyd for the
<a href="http://www.cellml.org/models"> CellML repository </a>.
</p>
<p xmlns="http://www.w3.org/1999/xhtml">This model originates from BioModels Database: A Database of Annotated Published Models (http://www.ebi.ac.uk/biomodels/). It is copyright (c)
2005-2010 The BioModels.net Team.<br/>
For more information see the <a href="http://www.ebi.ac.uk/biomodels/legal.html" target="_blank">terms of use</a>.<br/>
To cite BioModels Database, please use: <a href="http://www.ncbi.nlm.nih.gov/pubmed/20587024" target="_blank">Li C, Donizelli M, Rodriguez N, Dharuri H, Endler L, Chelliah V, Li L, He E,
Henry A, Stefan MI, Snoep JL, Hucka M, Le Nov�re N, Laibe C (2010) BioModels Database: An enhanced, curated and annotated resource for published quantitative kinetic models. BMC Syst Biol.,
4:92.</a></p>
</notes>
<annotation>
<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:dcterms="http://purl.org/dc/terms/"
xmlns:vCard="http://www.w3.org/2001/vcard-rdf/3.0#" xmlns:bqbiol="http://biomodels.net/biology-qualifiers/" xmlns:bqmodel="http://biomodels.net/model-qualifiers/">
<rdf:Description rdf:about="#metaid_0000002">
<dc:creator rdf:parseType="Resource">
<rdf:Bag>
<rdf:li rdf:parseType="Resource">
<vCard:N rdf:parseType="Resource">
<vCard:Family>Lloyd</vCard:Family>
<vCard:Given>Catherine</vCard:Given>
Test to see if Google indexes Mathematica and MatLab files. File BIOMD0000000201.xml converted
to Mathematica (BIOMD0000000201_jasrnacLite2_2_mathematica.m) and MatLab
(BIOMD0000000201_jasrnacLite2_2_matlab.m) using JarnacLite2.
Added DBBR annotations from BIOMD0000000201_DBBR_notes.xml
BIOMD0000000201_jasrnacLite2_2_mathematica_DBBR_notes.m (In the "Synopsis" section.)
BIOMD0000000201_jasrnacLite2_2_matlab_DBBR_notes.m (% comments in the "Synopsis"
section.)
9/9/2011
Google with query: (same as last time I searched)
urn:miriam:interpro:IPR013284 Distributed Biodata and Biomodel Resource
Found the same 4 files (one is the word doc) as before
Google query for the somitogenesis files (using the old tag for DBBR) :
FMA:82840 embryonic cell GO:0016055 Wnt receptor signaling pathway HERBS AND
JIMS BIG DATA AND COMPUTATIONAL MODEL REPOSITORY
Finds just four files, all in the IU model repository (http://cbo.biocomplexity.indiana.edu/):
/models/somitogenesis_CC3D_script.txt
/svn/model_repo_v0/Repository_notes.docx
/svn/model_repo_v0/somitogenesis_image_CC3D_model_annotation.html
/svn/model_repo_v0/somitogenesis_image_annotation.html
(There are two paths to the same directory, http://cbo.biocomplexity.indiana.edu/models/ and
http://cbo.biocomplexity.indiana.edu/svn/model_repo_v0/ )
FOAF test. Looking for the FOAF annotated file BIOMD0000000201_mod_FOAF.xml
Google query:
HERBS AND JIMS BIG DATA AND COMPUTATIONAL MODEL REPOSITORY urn:miriam:obo.go:GO:0008543
Finds the word index doc and BIOMD0000000201_mod-2.pdf but not the FOAF marked file. Not
sure if FOAF just doesn't work or if it is a syntax problem and the xmlns:foaf tag isn't working right.
Biomodels style file (local) PDF version test, looking for BIOMD0000000201.pdf.
Google query (terms selected from "Table 3: Properties of each species.":
nuclear beta-catenin Notch protein Dusp6
Finds 29,700 hits
http://www.ebi.ac.uk/biomodels-main/BIOMD0000000201 (first non-"scholarly article" hit)
http://cytosolve.mit.edu/remote-model.php?model=../Models/Public/BioModels/BIOMD0000000201.xml
(2nd hit, page has a "species" table extracted from the SBML file)
http://antimony.sourceforge.net/examples/biomodels/BIOMD0000000201.txt (4th hit,
Jarnac type fie from the origianl SBML)
BIOMD0000000201_DBBR_notes.xml (7th hit)
BIOMD0000000201_DBBR_notes.docx (8th hit)
So, it doesn’t find the Biomodel in the Biomodel repository but does find a couple reformatted
versions (one in Jarnac and a page with the extracted table of included species).
A "light-weight" annotation block
Include URN to naming resources, include the name in the file being annotated, include any
available aliases and AKAs, include DBBR tag line
########## Distributed Biodata and Biomodel Resource Annotations ########################
##
In this file
Ontology Term URN
Ontology Term
## DBBR: Rasi
urn:miriam:interpro:IPR001806
Small GTPase superfamily
## DBBR: D
urn:miriam:interpro:IPR003351
Dishevelled protein
## DBBR: N
urn:miriam:interpro:IPR008297
Notch
## DBBR: ERKa
urn:miriam:interpro:IPR008349
ERK1/2 MAP kinase
## DBBR: Bp
urn:miriam:interpro:IPR013284
Beta-catenin
## DBBR: Fgf
urn:miriam:interpro:IPR017338
Fibroblast growth factor, 15/19/21
## DBBR: MF
urn:miriam:interpro:IPR017374
Fringe
## DBBR: Bp
urn:miriam:kegg.compound:C00562
Phosphoprotein
## DBBR:
urn:miriam:kegg.pathway:ko04330
Notch signaling pathway
## DBBR: Rasa
urn:miriam:obo.chebi:CHEBI%3A15996
GTP
## DBBR: Rasi
urn:miriam:obo.chebi:CHEBI%3A17552
GDP
## DBBR:
urn:miriam:obo.chebi:CHEBI%3A33699
messenger RNA
## DBBR:
urn:miriam:obo.go:GO%3A0001756
somitogenesis
## DBBR: cytosol
urn:miriam:obo.go:GO%3A0005829
cytosol
Synonyms
Guanosine 5'-triphosphate
Guanosine 5'-diphosphate
formation of mesodermal clusters
Columns are tab delimited. Within a column individual entries are comma delimited. Multiword
entries are not quoted but imbedded commas need to be escaped. If a given entity or concept has
multiple urns then each gets its own line in the table?
Colum
n
Contents
1
2
File type specific comment
Label/tag for this entity in this file
3
Ontology terms as a urn
4
5
Ontology term official name
Pseudonym list
Style
Notes
e.g. ## DBBR:
text
urn
text
text list
Often end is a numeric, e.g.,
urn:miriam:obo.chebi:CHEBI%3A15996
single name
list of names
4/2/2012
Added three files as test classes for OWL ontologies:
http://cbo.biocomplexity.indiana.edu/svn/model_repo_v0/pizz_jps_embed_edited_xml_in_comment.owl
http://cbo.biocomplexity.indiana.edu/svn/model_repo_v0/pizz_jps_manchester.owl
http://cbo.biocomplexity.indiana.edu/svn/model_repo_v0/pizz_jps.owl
The pizz_jps_embed_edited_xml_in_comment.owl file has the entire xml file "de-xml-ed" then
embedded into a comment tag.
The server appears to be MIA, sent Alin an email about it. Fixed.
TO DO: Check if Google indexes any of these ontology files.
11/9/2012
Added an annotated CC3D Python script:
angio_growth_steppables_08052009_01_45_36__CBOannotated.py
# CBO CBO_Object SystemQuality ComputationalPlatform Compucell3D
# CBO CBO_Process CellProcess CellGrowth
class VolumeParamSteppable(SteppablePy):
...
# CBO CBO_Process CellProcess CellDeath Necrosis
# GO:0008219 cell death
#Necrotic Cells
if cell.type == 3:
#set growth rate equation
cell.targetVolume-=0.5
cell.lambdaSurface=0
Download