LeadMarker™ – an integrated suite for managing chemical and

advertisement
LeadMarker™ – an integrated
suite for managing chemical and
biological information
Tim Dudgeon
Informatics Matters Ltd
Oxford, UK
Bob Bamford
Managed Ventures LLC
California, USA
Chris Palmer
Kcc Tech Pty Ltd
Queensland, Australia
About LeadMarker™

A Modular suite of apps based on Java
Enterprise Edition (J2EE)




Management of core chemical and
biological data for Life Science companies
Allow deployment of different
configurations to different customers
Built around the objects you deal with and
the workflows you operate with
Designed around high performance and
scaleability
LeadMarker™ Features

chemMarker




sampleMarker



Chemical registration
Chemical searching
Manages compounds, batches, libraries
Plate and/or vial sample management
Liquid handling and transformation operations
assayMarker



Manages biological results
Simplified data acquistion from instruments
Data processing to generate biological results



%INH
IC50
Tabular and graphical reporting
LeadMarker Features (2)

structShare



a chemical spreadsheet,
including its SDLoader plug-in
that facilitates a simple
mechanism for library
registration into chemMarker.
structShare has the ability to
handle over 100,000
structures simultaneously
runLoader


A tool to read raw instrument
data files, associate biological
data with samples, and store
the data in assayMarker.
runLoader requires valid
containers, registered via
sampleMarker, to associate the
data.
Goals for LeadMaker

Increase software flexibility by


A single code base from which we can push out
multiple “branded versions,” each with its own
configuration
A working system that we can support



Not a CD in an envelope
Integration hotspots allowing extension and
customisation
Make the scientist's life easy by


LeadMarker™ should correspond to real world objects
Should support scientific workflows


Adaptable
Configurable
Technologies employed
LeadMarker™ architecture
Web apps
Marvin
Java Web Start apps
LeadMarker™ Web application
chemMarker
module
sampleMarker
module
structShare
runLoader
assayMarker
module
SDLoader
externalQuery
overlapAnalyser
JBoss
Service tier
chemService
sampleService
assayService
accountService
Domain tier
cpds
batches
libraries
suppliers
samples
plates
assays
runs
targets
results
JChem
Standardizer
Database tier
chemistry schema
sample schema
assay schema
admin schema
Oracle 9i
Oracle 10g
Key benefits of ChemAxon to
LeadMarker

Single set of Java tools

Designed to work in harmony



Flexibility





All tools have been developed by ChemAxon since
their inception
No mergers of other party’s code
Database vendor independence
Molecule format neutral
Configurable components e.g. standardizer
Performance
Support & documentation
Marvin Applets in web applications


Use for display and editing structures
Structures


Sketching of structures



before and after standardization
for registration
for searching
onMouseOver previews
Use of JChem in LeadMarker

JChem databases for compound and batch
structures

Choose not to use cartridge





Allow other dbms vendors
Maximise performance
Keep cost to a minimum
JChem Base provides sufficient functionality
Two stage standardization

Provides highly flexible definition of chemical
business rules
Salts, stereochemistry and mixtures

Typically this is the biggest headache for a chemical
registration system

Problem with standard “salt stripping” approaches




Problems with stereochemistry




Loss of information and/or data in multiple places
Typically need to define salts up front
Difficult to change your view of what is a salt and what
isn’t
MDL chiral flag
Absolute and relative stereochemistry
Uncertainty
Symptoms


Changes in rules or data means (biological) data has to be
moved around
Database administrators and technical staff are kept busy
fixing errors
LeadMarker™ approach to salts
and mixtures

Register what you have, salts and all




Salts do not get stripped
Compounds that are composite are broken
into their parts and their components also
registered
Parent child relationships within composite
compounds are maintained
A user can flag any component as a salt and
name it

No data needs to be relocated
Batches, compounds, solvents
and salts
New Batch
Batch
Original structure
Batch
standardizer
Standardized structure
Compound
Original structure
Standardized structure
Fragmentation
Compound
standardizer
Typical standardizer configs
Batch standardizer – remove common solvents
<StandardizerConfiguration Version ="0.1">
<Actions>
<Reaction ID="Water" Structure="O>>" Exact="true"/>
<Reaction ID="DMSO" Structure="CS(C)=O>>" Exact="true"/>
<Reaction ID="Methanol" Structure="CO>>" Exact="true"/>
<Reaction ID="Acetonitrile" Structure="CC#N>>" Exact="true"/>
</Actions>
</StandardizerConfiguration>
Compound standardizer – common representation
<StandardizerConfiguration Version ="0.1">
<Actions>
<Aromatize/>
<Reaction ID="PlusMinus" Structure="[*+:1][*-:2]>>[*:1]=[*:2]"/>
<Reaction ID="PlusMinusDouble" Structure="[*+:1]=[*-:2]>>[*:1]#[*:2]"/>
<Reaction ID="nitro" Structure="[O-:2][N+:1]=O>>[O:2]=[N:1]=O"/>
<Reaction ID="azide" Structure="N=[N:1]#[N:2]>>N=[N+:1]=[N-:2]"/>
<Reaction ID="enamine" Structure="[H:4][N:3][C:1]=[C:2]>>[H:4][C:2][C:1]=[N:3]"/>
<Reaction ID="enol" Structure="[H:4][O:3][C:1]=[C:2]>>[H:4][C:2][C:1]=[O:3]"/>
<Reaction ID="ammoniumhalide" Structure="[F,Cl,Br,I;-:3].[H:2][N+:1][#6]>>[F,Cl,Br,I:3][H:2].[N:1][#6]"/>
<Reaction ID="sulphate" Structure="[C,c:1][S:2](=[O:3])(=[O:4])[O;H1:5]>>[C,c:1][S:2](=[O:3])(=[O:4])[O-:5]"/>
<Reaction ID="carboxylate" Structure="[C,c:1][C:2](=[O:3])[O;H1:4]>>[C,c:1][C:2](=[O:3])[O-:4]"/>
<ImplH ID="implh" Charged="true" Radical="true" Mapped="true"/>
<Sgroups ID="ungroup" Act="Ungroup"/>
<Clean ID="cleanIfNeeded"/>
</Actions>
</StandardizerConfiguration>
Salts example
ID
1
ID
2
3
4
Name
LM-000001
Name
LM-000003, HCl
LM-000003
HCl
Parent
1
Type
CPD
Parent
2
2
2
TYPE
CPD
CPD
SALT
B
B
chemMarker in action

Demonstration….
structShare – chemical spreadsheet

Alternative to form builders and office
spreadsheets



Neither of these are good solutions
Office software limits performance and scaleability
Installation and support involves



structShare designed to address this





Office suite licensing
Platform specific libraries (windows DLLs)
Large datasets: > 100,000 structures managed easily
Rapid sorting and filtering
Workbench for analysing and processing data
Easy deployment through Java Web Start
Provides workbench for LeadMarker functionality


High volume data input or output
Ability to persist files locally and collaborate informally
structShare features

Embedded SQL database






Fast sorting and filtering of large datasets
Manage relational data
Future use of JChem searching
Memory only or disk based modes
SDfile and text file import and export
Plug-in architecture and API allows
extension

sdLoader for registering libraries into

OverlapAnayzer for identifying structural

LeadMarker™
overlap between your sample structures and
what is already registered in LeadMarker™
ExternalQuery plug-in for querying
LeadMarker™ and other SQL databases (under
development)
Use of ChemAxon tools within
structShare

Marvin for molecule rendering




MViewPane used as TableCellRenderer
MarvinSketch as molecule editor
MolImporter/MolExporter for
reading/writing SDF
JChem in ovelap analysis - classify
according to




exact match
match to unsalted form
no match
Future



use of JChem to allow structure filtering
integration of Chemical Terms
integration of Reactor for library enumeration
structShare in action

Demonstration….
LeadMarker™ Team


LeadMarker™ has been developed by a
global team that provides regional
customisation and expertise
Organisation

LeadMarker™ Licensor


North America & Japan – Managed Ventures LLC
Regional Agents


Europe – Informatics Matters Ltd
Australasia – KCC Tech Pty Ltd
Consortium History




Managed Ventures Incorporated in 1999
Initially focused on custom informatics
consulting
 Syrrx, Kalypsys, Triad, Anadys, Salk
Institute, Merck, Medsn
 Genomics Institute of Novartis
 Eli Lilly and Company
European expansion with Informatics
Matters collaboration
Recent expansion into Australasia with Kcc
Tech agency
Several custom chemical registration and screening software
implementations led to the development of the LeadMarker™
integrated suite
►To be formally released at World Pharmaceutical Congress in Philadelphia
Contributors - Contacts
●
Informatics Matters
Yew Tree Farm, High Street
Charlton on Otmoor
Kidlington, Oxon OX5 2UQ
●
Managed Ventures
6 Venture, Suite 100
Irvine, CA 92618 USA
–
Tim Dudgeon
–
Jack Crawford
–
Jon North
–
Bob Bamford
–
Petr Hamernik
–
Nadeem Bitar
–
Svata Dedic
–
Bob Thomas
–
Milos Kleint
–
Chris Novak
–
Matt Houser
–
Emma Davila
–
Brad Ziola
–
Arthur Clements
●
KCC Tech Pty
2 Alpha Avenue
Currumbin 4223
Queensland, Australia
–
Chris Palmer
Download