navigator - SRI International

advertisement
The Pathway/Genome Navigator
(These slides are a guide as you
experiment with the Navigator)
SRI International
Bioinformatics
Pathway/Genome Navigator – topics
 Getting
started with the Navigator
 Organism pages
 Queries in general
 Object displays and queries
 Miscellaneous commands
 User preferences
 Lab exercises
Introduction
 Navigator
SRI International
Bioinformatics
runs both on the desktop and on the
web
 The
desktop version generally has more
capabilities, but each has unique features
Desktop Mode -- Introduction
SRI International
Bioinformatics
Desktop Layout
 One
Large Window
 Several
Panes:
 Display pane
 Command menu
 LISP listener
Desktop Menus
 Main
SRI International
Bioinformatics
command menu
 Single-choice
menu
 Multiple-choice
 Aborting
menu (e.g. after a search)
out of menus
 Click Cancel or No Select
 Click outside the menu
 Type ^z
Using the Mouse
SRI International
Bioinformatics
 Left
mouse button: to invoke specific commands
and for hypertext navigation
 Right
mouse button: to bring up menus of
additional operations (for example, when editing a
frame)
 Middle
mouse button: for very specialized uses
(you probably won’t use it)
 Mouse
documentation line (shows what you’re
over, what you can do)
Organism Pages
 All
Organisms Page
 Organism grouping
 Summary of organisms
 Single
organism page
 Summary of organism stats
 Notion
of current organism
 Command mode queries
 Comparative analyses
 Clicking through links – organism continuity
SRI International
Bioinformatics
SRI International
Bioinformatics
Queries with multiple answers
 Results
in form of a menu to:
 select one
 some
 all
 Answer
List
 Next Answer
Indirect Queries
 Related
objects
 Objects
are clickable
 Objects
are color-coded by type
SRI International
Bioinformatics
History List
 Backward
 Forward
 Select
history
history
from history
SRI International
Bioinformatics
SRI International
Bioinformatics
Writing Complex Queries
 Web
Search  Advanced
 Write
queries in LISP
 Must understand features of schema


class names
slot names
Pathway tools site has example searches
 Definitely learnable
 Can place results on the answer list

Using the BioCyc Web Site
and Other
Pathway Tools Based Web Sites
BioCyc Searches
SRI International
Bioinformatics

Multiple searches available for finding information in different
ways
The easiest searches to use are fairly coarse

Start by selecting database to search

Simplest search: Quick Search
 At upper right of most pages

Search Menu
 Object-specific searches
 Advanced search
 Ontologies search
 Google search
 BLAST search
 Search of full-text articles (EcoCyc only)

Database Selector
SRI International
Bioinformatics
 Click
on word “change” under Search menu or
under Quick Search button
 In
resulting selector, choose a PGDB
 Start typing a word in organism name
 Click on letter to navigate to organisms starting with that
letter
 Click a frequently used PGDB
 All
subsequent searches will apply to that
database
Quick Search




SRI International
Bioinformatics
Enter a search string in quick search box
Gene Search button searches gene names and synonyms
Quick Search button searches names and synonyms of
 Genes
 Proteins
 RNAs
 GO terms
 Compounds
 Reactions
 Pathways
 Transcription Units
Result page shows all matching objects, organized by
object type
Object Specific Searches
SRI International
Bioinformatics
 Allow
you to create detailed searches against
single types of objects
 Specify
multiple constraints on the object you
want to find
 Click
on triangles at the left to expand or hide
filters
 Note
that if a filter is hidden it will not be used in a
search
SRI International
Bioinformatics
BioCyc Web Site: Object Page Types

Genes + Gene Products
 Enzymes, transporters, transcription factors, others
 Extensive comments and citations
 Monomers and multimers are represented, and in different chemical
states
Reactions
Pathways
Compounds

Searches are organized around these page types



Textpresso Search
 EcoCyc
SRI International
Bioinformatics
only
 Searches
full text of a large corpus of articles
(2010: 24,000 articles)
 Keyword
 Scope
and/or ontology-based searches
is sentence by default
 Results
presented at bottom of page
Textpresso Search
 Keyword
search examples:
 tryptophan yanofsky -- keyword and
 tryptophan,yanofsky -- keyword or
 attenuation –tryptophan – exclude tryptophan
 “excess tryptophan” -- phrase
 Ontology
search examples:
SRI International
Bioinformatics
Additional Searches
SRI International
Bioinformatics
 Advanced
search allows user to formulate
detailed multi-object searches
 Ontology
searches allow the user to drill down
through a classification system to find
information of interest
 Google
search performs search across all web
pages within this site for words of interest
 BLAST
search performs sequence search against
a single specified genome
Web Account System
 Page
formatting preferences
 Omics
viewer preferences
 Comparative
 Receive
analysis preferences
email updates on new releases
SRI International
Bioinformatics
Overview of Object Displays
SRI International
Bioinformatics
Shared Display Characteristics
 Gene-Reaction
schematic
 Citations
and comments
 Database
Links
 Classes
SRI International
Bioinformatics
Gene-Reaction Schematic
 Drawn
in reaction, protein, and gene windows
 Representations
(ArgB)
 Genes are boxes on the right
 Proteins are circles in the middle – numbers show complexes
 Reactions in box on left, with E.C. number if available
 Allows navigation between genes, proteins, rxns
 Links proteins with shared reactions
 ArgD
 Links members of protein complexes
 Pol III – extreme example
SRI International
Bioinformatics
Citations and Comments
 Citations
 Click
in mnemonic form
on citation – go to citations at bottom of
page
 Click
there, go to PubMed ref, if available
Database Links
 Unification
SRI International
Bioinformatics
links (info about the protein
elsewhere)
 PDB
 PIR
 RefSeq
 UniProt
 Gene page: For coli, we added links to coli-specific sites
 Relationship
links:
 PDB-Homolog-P34554
Class Hierarchies
SRI International
Bioinformatics
 Reactions

Enzyme-nomenclature system (full EC system in MetaCyc
only)
 Proteins
Gene Ontology terms are assigned to proteins
 Can also assign MultiFun terms

 Compounds
 Pathways
Menu Categories
 Pathway
 Reaction
 Protein
 (RNA)
 Gene
 Compound
SRI International
Bioinformatics
SRI International
Bioinformatics
Pathway Mode Commands
 Search
by pathway name
 Search
by substring
 Search
by class
 Search
by substrates (can pick role in pathway)
SRI International
Bioinformatics
What’s in a pathway frame?

Go to arginine biosynthesis I (from ArgD)

Intermediates and reactions

Can toggle level of detail

Feedback regulation can be shown

Locations of mapped genes

Genetic regulation schematic

Note presence of comments, citations, class hierarchy
SRI International
Bioinformatics
Reaction Mode Commands
 Search
by reaction name
 Search
by E.C. #
 Search
by class (another E.C. interface)
 Search
by pathway
 Search
by substrates
SRI International
Bioinformatics
What’s in a reaction frame?
 Search
by EC for 2.6.1.11 (pick one)
 Picture
of reaction with clickable compounds
 Pathways
 Place
the reaction is involved in
in class hierarchy
 Enzymes
carrying out reaction (note schematic)
SRI International
Bioinformatics
Protein Mode Commands
 Search
by protein name
 Search by substring
 Search by pathway
 Search by organism (MetaCyc)
 Search by UniProt Acc
 Search by GO term
 Search by MultiFun term
 Search by Weight, pI
 Search by modulation of activity
SRI International
Bioinformatics
What’s in a protein frame?
 Sample
frame (ArgD)
 Synonyms,
general features, comments
 Unification
links, gene-reaction schematic
 GO
terms
reaction frames – how this protein
carries out that reaction (bridging the two)
 Enzymatic
 Evidence
codes
Gene Mode Commands
SRI International
Bioinformatics
 Search
by gene name (can also put in TU IDs)
 Search
by substring
 Get
gene by class
 Basically
the same for RNAs
SRI International
Bioinformatics
What’s in a gene frame?
 Sample
frame (argC)
 Synonyms,
classification (GO), link to browser
 Unification
links, gene-reaction schematic
 Regulation
schematic
 Gene
local context and TUs
What’s in a TU frame?
 Sample
 Genes
frame (argCBH)
in context, with TFs
 Promoter
 TF
SRI International
Bioinformatics
with start site and citations
binding sites, with citations
 Regulatory
interactions (ilvL attenuator in TU524)
SRI International
Bioinformatics
Compound Mode Commands
 Search
compound by name
 Search
compound by substring
 Search
by SMILES (structure)
 Search
by class
 Advanced
search
The SMILES Language
 Simplified
SRI International
Bioinformatics
Molecular Input Line Entry System
 Formal language for describing chemical
structures
 Used within the Pathway Tools in a substructure
search
 Case is significant (lowercase for aromatic rings)
 Examples:
formate C(=O)O
malate OC(=O)CC(O)C(O)=O
 For more information, see the Help facility
SRI International
Bioinformatics
What’s in a compound frame?
 Sample
(N-acetylglutamyl-phosphate)
 Synonyms,
 Structure
 SMILES
empirical formula, MW, links
(you can add these in editors)
code
 Pathways
and reactions involving this compound
SRI International
Bioinformatics
Miscellaneous Commands
 History
commands
 Answer-List commands
 Clone window command
 Fix window and unfix window commands
 Other commands:
 Print to file (makes a postscript)
 Help
 Preferences
 Exit
User Preferences
 Color
 Layout
 Compound
window
 Reaction window
 Pathway window
 History/Answer list
 Reverting and saving user preferences
SRI International
Bioinformatics
Lab Exercises – Desktop
Version
Lab Exercises
 Set
SRI International
Bioinformatics
up personal preferences for:
Color
Layout (set number of windows to 2)
 Save new preferences
 Play with settings for Compound, Reaction,
Pathway, and Overview windows.
 Choose settings for History/Answer List
preferences
Desktop Lab Exercises
 Retrieve
SRI International
Bioinformatics
compounds containing a formate group
 Retrieve compounds adenine and uracil using
class query
 Retrieve reaction with EC# 5.3.1.9
 Retrieve all reactions in the class
sulfurtransferases
 Retrieve all reactions involved in proline
biosynthesis
 Retrieve all reactions where glutamate appears on
left side
 genes coding for enzymes involved in the
degradation of short-chain fatty acids
Desktop Lab Exercises
 Retrieve
SRI International
Bioinformatics
all enzymes involved in purine
biosynthesis
 Retrieve all kinases
 Display region spanning from 10 % - 20 % of E.
coli chromosome
 Display chromosomal region around gene aroA
 Display a map showing all chaperone genes
Desktop Lab Exercises
 Retrieve
SRI International
Bioinformatics
all chaperone genes
 Retrieve gene aroA
 Find the glutamine biosynthesis pathway by
issuing each of the three types of queries in
Pathway mode.
Desktop Lab Exercises
 Clone
SRI International
Bioinformatics
window
 Navigate in the cloned window
 Set preferences so Navigator displays 2 windows
 Navigate by clicking on live objects
 Fix Window
 Navigate in unfixed window
 Fix second window and then click on live object
Web Version Lab Exercises
Web Lab Exercises
 In
SRI International
Bioinformatics
EcoCyc:
 Retrieve
gene aroA
 Retrieve all kinases
 Retrieve all chaperone genes using two different
approaches
 Retrieve all membrane proteins that cite an article
by Gross
 Retrieve all genes whose product interacts with
ppGpp as a regulator or cofactor
Web Lab Exercises
 In
SRI International
Bioinformatics
MetaCyc:
 Retrieve
reaction with EC# 5.3.1.9
 Retrieve compounds adenine and uracil using
ontology search
 Retrieve all reactions involving glutamate as a
substrate
 Find genes coding for enzymes involved in the
degradation of short-chain fatty acids
 Find all pathways involved in degradation of
arginine
Download