Data Normalization Milestones - Hongfang Liu, Ph.D.

advertisement
Data Normalization
Milestones
Data Normalization
 Goals
– To conduct the science for realizing semantic
interoperability and integration of diverse data
sources
– To develop tools and resources enabling the
generation of normalized EMR data for
secondary uses
Data Normalization

Target
Value Sets
Information
Models
Normalization
Targets
Tooling
Raw EMR
Data
Normalized
EMR Data
Normalization
Process
SHARPn Data
Normalization Team
PIs: Dr. Chris Chute; Dr Stan Huff
Program Management Team
– Manager: Lacey Hart; Assistant: Erin Martin
Members
Hongfang Liu (Lead)
Calvin Beebe (co-Lead)
Tom Oniki (co-Lead; Modeling Lead)
Harold Solbrig (CTS2 Lead)
Vinod Kaggal (Implementation Lead)
Kyle Marchant (CEM DB Lead)
Joey Coyle (Modeling)
Teresa Conway (Modeling)
Nathan David (Modeling)
Edinardo Potich (Modeling)
Cyndalynn Tilley (Modeling)
Les Westberg (CEM DB)
Kevin Peterson (CTS2)
Siddhartha Jonnalagadda
(Pipeline)
Dingcheng Li (Pipeline)
Modeling -Accomplishments
 CORE and Secondary Use
Models/terminology patient, meds, labs,
administrative dx and procedure,
disease/disorder
 Definition of value sets in terms of
standards where possible
Modeling - Accomplishments
Request site (in pilot)
CEM Browser
Modeling - Future work
Request site full launch
Browser launch
Browser enhancements (terminology
integration, knowledge repository
integration)
Better documentation
Model enhancements
Tooling – explore collaborations
Value Sets- Accomplishments
 SHARP Value Sets published as CTS2
Resolved Value Sets
– ICD-9
– RxNorm
– LOINC
– ECIS
– SNOMED-CT
Value Sets- Accomplishments
 CTS2 Services Created For
– SNOMED-CT Simple Refsets and Dynamic
Value Set Definition
– ECIS Value Sets and Maps
 CTS2 Participation in CIMI effort
Value Sets - Future work
 CTS2 Service for RxNorm and UMLS
access
– Allow creation of new value sets
– Links to concept descriptions and
relationships
 Integrate BioPortal Value Set Services
– Bridge Bioinformatics Ontology / Clinical
Terminology
Value Sets – Future Work
 Complete ECIS Server
– Will aid in CEM to ADL migration
 CTS2 Services available to CIMI project
 Integration
– Create Pipeline specific services coupled directly to
CTS2
– Tighter coupling of Concept Domains and Value Sets
– Ajax Widgets for viewing, integrating and Authoring
Pipeline - Accomplishments
Architecture revision to make pipeline
highly configurable
– Enable seamlessly pipeline integration of
components from different data sources (e.g.,
NLP)
Generic pipeline components
– XML Initializer
Pipeline - Accomplishments
Highly configurable syntactic and semantic
mappings
– Allow four kinds of syntactic mapping –
constant, 1:1, 1:n, and inference
Two options to process data
– XML tree-based (achieve semantic mapping
through UIMA annotations)
– Template based (achieve translation directly
in CEM annotations)
Pipeline – Future work
 Integrate NLP normalization pipeline into
data normalization
 GUIs to simplify the configurations of the
pipeline
– syntactic mapping
– Semantic mapping
Integrate the two options provided to
process data into a single project
CEM to DB - Accomplishments
Completed the following CEM to DB Mirth
Channel Development
– Patient CEM Channel (Add and Update)
– NotedDrug CEM Channel (Add)
– AdministrativeDiagnosis CEM Channel (Add)
– Lab (Quantitative, Narrative, Ordinal, and
Titer) CEM Channel (Add)
CEM DB - Accomplishments
Completed CEM DB Design and
Development
– Patient CEM Tables
 Demographics including Names, Addresses,
Telecom, and Language
 Patient Cross Reference and External IDs
– Clinical CEM Tables
 Index Data
 Source Data
 Patient Data
HL7 to XDR Channel Accomplishments
Completed the following HL7 to XDR
channels for data exchange
– HL7 AdminDX to XDR Channel (Sender)
– HL7 Lab to XDR Channel (Sender)
– HL7 Noted Drug to XDR Channel (Sender)
– HL7 Message Separator (Sender)
– HL7 Receive Message Separator (Receiver)
– ReceiveXDR and Extract HL7 (Receiver)
Next Steps for Channels and CEM
DB
Next Steps
– Complete the Update feature for Clinical CEM
Channels
– Enhance the Error Handling and resending of
error Message for channels
– Additional Relational field data storage for
Clinical CEM Messages
– Support CEM changes and new data types
In summary
 Continue effort in tooling for
– Modeling (through CEMBrowser and Request
Site)
– Value sets (through CTS2)
– Pipeline set up (GUIs for configuration and
tools for generating semantics mapping)
– CEM XML -> CEM DB, CEM RDF, etc
Download