TGB thesis outline

advertisement
1
Inadequate requirements to analytical keys may
compromise standardised analysis of SEND data – with
focus on laboratory data (LB)
- Why SEND++ should be considered
Master’s thesis
Thomas Gade Bjerregaard
(MI07121)
2
Background – the current data flow to FDA
Local analysis
& statistics
GLP
Plan &
collect data
GLP
Report data
and analysis
FDA review
3
Background – the new defined data flow to FDA
Local analysis
& statistics
GLP
Report data
and analysis
FDA review
GLP
Plan &
collect data
Exchange
format
FDA Analysis
4
Background – the new defined data flow in
generic terms and converted to eData use cases
Local analysis
& statistics
GLP
Report data
and analysis
GLP
Plan &
collect data
Exchange
data
Global
analysis
5
Objective
• Investigate and substantiate scenarios where inadequate
requirements may compromise standardised analysis
• Scope restriction: Laboratory data
 Investigate analytical prerequisites for SEND eData using
laboratory data as example
6
Thesis outline
Presentation outline
• Two core concepts
• Core concept
• Analytical keys
• Laboratory data
• Three Use cases
• GLP
• Exchange
• Analysis
• Two conflicts
• When
• What & How
• Analytical keys
• Two conflicts
• What & How
• Core concept/GLP
• Exchange
• Basic analysis conflict
• When
• GLP
• Exchange
8
Core concept: Analytical keys outline
appropriate/inappropriate comparisons - and caveats
• When
• Planned timing
• Who
• Within study analysis
• Study design ~ treatment groups
• What
• The endpoint under
investigation
• How
• The method of assessment
• Between study analysis
• Animal factors
• Species, strain, age, sex, etc.
• Study factors
• Target, drug, laboratory, husbandry,
etc.
9
Conflict on What & How:
- Characterisation of laboratory measurements
NCoA: Nested Context of an Analyte
Primary

category
Secondary
Category

Analyte

Sample matrix

Method
Analyte
-
-
Eryth
X
-
-
Endpoint
Haematology
Haemogram
Eryth
-
-
-
-
(What?)
X
Urinalysis
X-
X
Eryth
-
-
Eryth
Whole blood
Flow cytometry
-
-
X
Eryth
X
Urine
X
Dipstick
-
-
Eryth
Urine
Microscopy
Study specific
Haematology
Haemogram
Eryth
Whole blood
Flow cytometry
implementation
Urinalysis
X
X-
Eryth
X
Urine
X
Dipstick
X
of endpoint
Urinalysis
-
Eryth
Urine
Microscopy
Test (How?)
10
Conflict on What & How:
- Characterisation of laboratory measurements
Primary

category
Secondary
Category

Analyte

Sample matrix

Method
Analyte
-
-
X
Eryth
-
-
Endpoint
Haematology
Haemogram
Eryth
-
-
-
-
(What?)
X
Urinalysis
X-
X
Eryth
-
-
Eryth
Whole blood
Flow cytometry
-
-
Eryth
X
X
Urine
Dipstick
X
-
-
Eryth
Urine
Microscopy
Study specific
Haematology
Haemogram
Eryth
Whole blood
Flow cytometry
implementation
Urinalysis
X
X-
Eryth
X
Urine
X
Dipstick
X
of endpoint
Urinalysis
-
Eryth
Urine
Microscopy
Test (How?)
Too much information – without control
Scientific
umbrella
Primary
category

Secondary
Category

Analyte

Sample
matrix

Method
LIMS
governance
Presentation title
Impossible to specify analytical criteria
Date
12
The basic conflict
- detachment of information accounting for incompleteness and inconsistency
SDRG
SDRG
SDRG
SDRG
SDRG
SDRG
SDRG
Define
File
Define
File
Define
File
Define
File
Define
File
Define
File
Define
File
SEND
data
SEND
data
SEND
data
SEND
data
SEND
data
SEND
data
SEND
data
Presentation title
Compromised analytical performance
General confusion regarding the components of the NCoA
+
Excess information from the GLP Use Case relative to the 5 variables in SENDIG
+
3 (or 4) of 5 variables corresponding to NCoA are without controlled terminology
+
null values allowed in ‘Expected’ and ‘Permissible’ variables
=
compromised analytical performance in any multi-study environment
Date
14
15
For analysis, CATegory variables should be controlled
NCoA
Primary
Secondary
terminology
category
category
LOINC
terminology
SSIE defining
SEND variables
Option A
Option B
LBCAT
LBSCAT
New
New
NCI list
NCI list
New
New
NCI list
NCI list
Analyte name
Sample
matrix
Method
Component
System
Method
LBTESTCD
LBSPEC
LBMETHOD
LBTESTCD
SPEC
LBTESTCD
SPEC
‘test’
longname /
shortname
LBLOINC
New
NCI list
?
LOINC term
16
For analysis, LBLOINC can substitute control of LBMETHOD
NCoA
Primary
Secondary
terminology
category
category
LOINC
terminology
SSIE defining
SEND variables
Option A
Option B
LBCAT
LBSCAT
New
New
NCI list
NCI list
New
New
NCI list
NCI list
Analyte name
Sample
matrix
Method
Component
System
Method
LBTESTCD
LBSPEC
LBMETHOD
LBTESTCD
SPEC
LBTESTCD
SPEC
‘test’
longname /
shortname
LBLOINC
New
NCI list
?
LOINC term
17
Thesis outline
Presentation outline
• Two core concepts
• Core concept
• Analytical keys
• Laboratory data
• Three Use cases
• GLP
• Exchange
• Analysis
• Two conflicts
• When
• What & How
• Analytical keys
• Two conflicts &
• What & How
• Core concept/GLP
• Exchange
• Basic analysis conflict
• When
• GLP
• Exchange
18
When: ‘Global timing’ or ‘Local timing’
Reference time point
Session
Dosing
Start
of
study
0h
2h
4h
8h
24h
Local timing
0h
2h
4h
8h
24h
Local timing
D 28
D -3
Pre-treatment
Global timing
Treatment
19
The SEND timing variables
SENDIG time
planning
variable
Abbreviated label and CDISC notes from
SENDIG
VISITDY
Planned study day of assessment
LBTPT
Text label for sampling time
LBTPTNUM
Numeric version of LBTPT to aid in sorting
of records
LBELTM
Planned elapsed time from reference to a
time point
LBTPTREF
Text label for reference time point
LBRTPTNM
Numeric version of LBTPTREF to aid in
two layer sorting of records
Corresponding SEND variable for actual time
point
Terminology of Section 5.1.3
LBDY
Actual study day – directly collected, or
calculated from LBDTC and RFSTDTC in the
Demographics data set
LBDTC
The GLP date/time of verification of the
sample collection
N/A
LBRFTDTC
The GLP date/time of verification of the
reference time point
Scheduled date
Session label
N/A
Distance
Reference time point
N/A
LBDY
VISITDY
Record
N:1 synonym
LBDTC
LBTPT
1:1
assumption
LBELTM
relative
N:1 relation
Reference
record
LBTPTNUM
LBREFDTC
LBTPTREF
LBTPTRNM
Identical
1:1 synonym
1:1 synonym
--DTC
--TPT
N:1 synonym
synonym
N:1
--DY
1:1
assumption
Two timing
key sets
--TPTNUM
--ELTM
N:1 relation
--TPTREF
‘VISITDY’
ref-record
22
For analysis, only one key set should be allowed!
- VISITDY cannot be more granular – but session key set can account for grace days
No need for grace days
Absolute
VISITDY key set
Maximum one session per TEST per day
or
Need for grace days
Absolute
VISITDY key set
Relative
Session key set
Two or more sessions per TEST per day (session timed
relative to reference time point)
Relative
Session key set
N/A
24
A study consists of consecutive VISITDY intervals
1
24:00
8:00
2
16:00
24:00
8:00
3
16:00
24:00
8:00
4
16:00
24:00
8:00
5
16:00
24:00
8:00
6
16:00
24:00
8:00
7
16:00
24:00
8:00
8
16:00
24:00
8:00
16:00
24:00
25
VISITDY = 1
--TPT = ‘6 hours post dose’
--TPTNUM = 1
-example
--ELTM
= ‘P6H’ with timed
--TPTREF = ‘Day 1 Dose’
--TPTRNM = 1
VISITDY = 1
Planning relative
to dosing do not adhere to VISITDY
--TPT = ‘12 hours post dose’
1
24:00
8:00
2
24:00
16:00
8:00
session
dosing with three days intervals
--TPTNUM after
=2
--ELTM = ‘P12H’
--TPTREF = ‘Day 1 Dose’
--TPTRNM = 1
3
16:00
24:00
8:00
16:00
24:00
VISITDY = 2
--TPT = ‘24 hours post dose’
--TPTNUM = 3
--ELTM = ‘P24H’
--TPTREF = ‘Day 1 Dose’
--TPTRNM = 1
4
8:00
5
16:00
24:00
8:00
16:00
24:00
8:00
6
7
Dose 2
Dose 1
VISITDY = 4
--TPT = ‘72 hours post dose’
--TPTNUM = 4
--ELTM
24:00
8:00
16:00
24:00 = ‘P72H’
8:00
16:00
--TPTREF = ‘Day 1 Dose’
--TPTRNM = 1
8
16:00
P6H
P6H
P12H
P12H
P24H
P24H
P72H
VISITDY = 4
--TPT = ‘6 hours post dose’
--TPTNUM = 1
--ELTM = ‘P6H’
--TPTREF = ‘Day 4 Dose’
--TPTRNM = 2
P72H
VISITDY = 4
--TPT = ‘12 hours post dose’
--TPTNUM = 2
--ELTM = ‘P12H’
--TPTREF = ‘Day 4 Dose’
--TPTRNM = 2
VISITDY = 5
--TPT = ‘24 hours post dose’
--TPTNUM = 3
--ELTM = ‘P24H’
--TPTREF = ‘Day 4 Dose’
--TPTRNM = 2
VISITDY = 7
--TPT = ‘72 hours post dose’
--TPTNUM = 4
--ELTM = ‘P72H’
--TPTREF = ‘Day 4 Dose’
--TPTRNM = 2
24:00
26
Dose-independent sessions are planned to occur on a specific day
- example with morning/afternoon session on all days
1
24:00
8:00
2
16:00
Day 1 sessions
24:00
8:00
3
16:00
Day 2 sessions
24:00
8:00
4
16:00
Day 3 sessions
24:00
8:00
5
16:00
Day 4 sessions
24:00
8:00
6
16:00
Day 5 sessions
24:00
8:00
7
16:00
Day 6 sessions
24:00
8:00
8
16:00
Day 7 sessions
24:00
8:00
16:00
Day 8 sessions
24:00
27
Suggested convention for --ELTM with semantic overlap to VISITDY
- based om pseudo-intervals for VISITDY
Shift in visual
representation
1
2
3
VISITDY = 1
VISITDY = 2
VISITDY = 3
Day 1
00:00  24:00
Day 2
00:00  24:00
Day 3
00:00  24:00
28
Suggested convention for --ELTM with semantic overlap to VISITDY
- based om pseudo-intervals for VISITDY
VISITDY = -1
VISITDY = 1
VISITDY = 2
VISITDY = 3
Day -1
00:00  24:00
Day 1
00:00  24:00
Day 2
00:00  24:00
Day 3
00:00  24:00
Pseudo-interval
specifications for VISITDY
(Hours)
--STINT = -P24H
--ENINT = (-)P0H
--STINT = P0H
--ENINT = P24H
--STINT = P24H
--ENINT = P48H
--STINT = P48H
--ENINT = P72H
Pseudo-interval
specifications for VISITDY
(Days)
--STINT = -P1D
--ENINT = (-)P0D
--STINT = P0D
--ENINT = P1D
--STINT = P1D
--ENINT = P2D
--STINT = P2D
--ENINT = P3D
Suggested convention
for --ELTM
--ELTM = -P1D
--ELTM = P1D
--ELTM = P2D
--ELTM = P3D
Time zero
29
Suggested convention for --ELTM with semantic overlap to VISITDY
- based om pseudo-intervals for VISITDY
VISITDY = -1
VISITDY = 1
VISITDY = 2
VISITDY = 3
Day -1
00:00  24:00
Day 1
00:00  24:00
Day 2
00:00  24:00
Day 3
00:00  24:00
Pseudo-interval
specifications for VISITDY
(Hours)
--STINT = -P24H
--ENINT = (-)P0H
--STINT = P0H
--ENINT = P24H
--STINT = P24H
--ENINT = P48H
--STINT = P48H
--ENINT = P72H
Pseudo-interval
specifications for VISITDY
(Days)
--STINT = -P1D
--ENINT = (-)P0D
--STINT = P0D
--ENINT = P1D
--STINT = P1D
--ENINT = P2D
--STINT = P2D
--ENINT = P3D
Suggested convention
for –ELTM
(non- ISO8601)
--ELTM =
-P1D/P0D
--ELTM =
P0D/P1D
--ELTM =
P1D/P2D
--ELTM =
P2D/P3D
Suggested convention
for --ELTM
--ELTM = -P1D
--ELTM = P1D
--ELTM = P2D
--ELTM = P3D
Time zero
30
Suggested convention for --ELTM accounting for grace days
- based om pseudo-intervals for VISITDY
VISITDY = -1
VISITDY = 1
VISITDY = 2
VISITDY = 29
VISITDY = 30
Day -1
00:00  24:00
Day 1
00:00  24:00
Day 2
00:00  24:00
Day 29
00:00  24:00
Day 30
00:00  24:00
Pseudo-interval
specifications for VISITDY
(Hours)
--STINT = -P24H
--ENINT = (-)P0H
--STINT = P0H
--ENINT = P24H
--STINT = P24H
--ENINT = P48H
Pseudo-interval
specifications for VISITDY
(Days)
--STINT = -P1D
--ENINT = (-)P0D
--STINT = P0D
--ENINT = P1D
--STINT = P1D
--ENINT = P2D
--STINT = P28D
--ENINT = P30D
Suggested convention
for --ELTM
--ELTM = -P1D
--ELTM = P1D
--ELTM = P2D
--ELTM = P5W
Time zero
31
VISITDY = 1
--TPT = ‘Day 1 morning’
--TPTNUM = 1.1
- example with morning/afternoon
--ELTM = ‘P1D’
--TPTREF = ‘Start of study’
--TPTRNM = 0
Dose-independent sessions are planned relative to study start
1
24:00
8:00
2
16:00
session on selected days
3
VISITDY = 1
--TPT = ‘Day 1 afternoon’
--TPTNUM = 1.2
24:00
8:00
16:00
24:00
8:00
--ELTM = ‘P1D’
--TPTREF = ‘Start of study’
--TPTRNM = 0
16:00
4
24:00
8:00
5
16:00
24:00
8:00
6
16:00
24:00
8:00
7
16:00
24:00
8
VISITDY = 6
24:00
8:00
16:00
8:00 = 16:00
--TPT
‘Day 6
morning’
--TPTNUM = 6.1
--ELTM = ‘P6D’
--TPTREF = ‘Start of study’
--TPTRNM = 0
24:00
Start of Study
Day 1 sessions
Day 6 sessions
P1D
P6D
VISITDY = 6
--TPT = ‘Day 6 afternoon’
--TPTNUM = 6.2
--ELTM = ‘P6D’
--TPTREF = ‘Start of study’
--TPTRNM = 0
32
VISITDY = 1
--TPT = ‘Day 1 morning’
--TPTNUM = 1.1
- example with morning/afternoon
--ELTM = ‘P1D’
--TPTREF = ‘Start of study’
--TPTRNM = 0
Dose-independent sessions are planned relative to study start
1
24:00
8:00
2
16:00
session on selected days
3
VISITDY = 1
--TPT = ‘Day 1 afternoon’
--TPTNUM = 1.2
24:00
8:00
16:00
24:00
8:00
--ELTM = ‘P1D’
--TPTREF = ‘Start of study’
--TPTRNM = 0
16:00
4
24:00
8:00
5
16:00
24:00
8:00
6
16:00
24:00
8:00
7
16:00
24:00
8
VISITDY = 6
24:00
8:00
16:00
8:00 = 16:00
--TPT
‘Day 6
morning’
--TPTNUM = 6.1
--ELTM = ‘P6D’
--TPTREF = ‘Start of study’
--TPTRNM = 0
24:00
Start of Study
Day 1 sessions
Day 6 sessions
P1D
P6D
VISITDY = 6
--TPT = ‘Day 6 afternoon’
--TPTNUM = 6.2
--ELTM = ‘P6D’
--TPTREF = ‘Start of study’
--TPTRNM = 0
33
VISITDY = 1
--TPT = ‘Day 1 morning’
--TPTNUM = 1.1
- example with morning/afternoon
--ELTM = ‘P1D’
--TPTREF = ‘Start of study’
--TPTRNM = 0
Dose-independent sessions are planned relative to study start
1
24:00
8:00
2
16:00
session on selected days
3
VISITDY = 1
--TPT = ‘Day 1 afternoon’
--TPTNUM = 1.2
24:00
8:00
16:00
24:00
8:00
--ELTM = ‘P1D’
--TPTREF = ‘Start of study’
--TPTRNM = 0
16:00
4
24:00
8:00
5
16:00
24:00
8:00
6
16:00
24:00
8:00
7
16:00
24:00
8
VISITDY = 6
24:00
8:00
16:00
8:00 = 16:00
--TPT
‘Day 6
morning’
--TPTNUM = 6.1
--ELTM = ‘P6D’
--TPTREF = ‘Start of study’
--TPTRNM = 0
24:00
Start of Study
Day 1 sessions
Day 6 sessions
P1D
P6D
VISITDY = 6
--TPT = ‘Day 6 afternoon’
--TPTNUM = 6.2
--ELTM = ‘P6D’
--TPTREF = ‘Start of study’
--TPTRNM = 0
34
Dose-independent sessions are planned relative to study start
- example with grace days on ophthalmoscopy
…
1
24:00
8:00
16:00
24:00
22
8:00
16:00
23
24:00
8:00
16:00
24
24:00
8:00
16:00
25
24:00
8:00
16:00
26
24:00
8:00
16:00
27
24:00
8:00
16:00
28
24:00
8:00
16:00
Start of Study
VISITDY = 22
--TPT = ‘Week 4 ophthalmoscopy’
--TPTNUM = 22
--ELTM = ‘P4W’
P72H
--TPTREF = ‘Start of study’
--TPTRNM = 0
Week 4 ophthalmoscopy
P4W
24:00
35
Session key set describing all three planning paradigms
- Absolute day/week number should shift between TPT/TPTREF
…
1
24:00
8:00
16:00
24:00
22
8:00
16:00
23
24:00
24
VISITDY = 22
--TPT = ‘Day 22 afternoon’
--TPTNUM = 22.2
--ELTM = ‘P22D’
--TPTREF = ‘Start of study’
8:00
16:00
8:00
16:00
--TPTRNM
= 024:00
25
24:00
8:00
26
24:00
16:00
27
VISITDY = 26
--TPT = ‘24 hours post dose’
--TPTNUM = 3
--ELTM
= ‘P24H’
8:00
16:00
24:00
8:00
16:00
--TPTREF = ‘Day 25 Dose’
--TPTRNM = 9
28
24:00
8:00
16:00
Dose 9
Start of Study
P6H
P12H
Day 22
sessions
P24H
Week 4 ophthalmoscopy
P4W
VISITDY = 22
--TPT = ‘Week 4 ophthalmoscopy’
--TPTNUM = 22
--ELTM = ‘P4W’
P72H
--TPTREF = ‘Start of study’
--TPTRNM = 0
24:00
36
Findings summary
• What & How
• Expected perceived as optional
 Incompleteness, mixture of intended/unintended causes
• Not adequately standardised
 Inconsistency
• When
• Permissible perceived as optional
 Incompleteness, mixture of intended/unintended causes
• Complex and scenario dependent relations between variables in timing key sets and
incomplete variables landscape
 Inconsistency unless balanced approach is applied
Legacy
SEND-light
SEND+
SEND++
No eData – only paper/pdf
- eData
- as-is LIMS export or manual entry from paper/pdf report
-
eData
Standardised variable structure  SENDIG
Completeness  feasible LIMS export and post processing
Consistency  only formats and NCI code lists
Gap handling
Curation
- eData
- Standardised variable structure beyond SENDIG
- Completeness  all that can be derived
(i.e. also from report or protocol)
- Consistency  only formats and NCI code lists
Curation
- eData
- Standardised variable structure beyond SENDIG
- Completeness  all that can be derived
(i.e. also from report or protocol)
- Consistency  Full standardisation of analytical keys
Direct
Standardised
analysis based
on the correct
analytical keys
Presentation title
Date
38
Complicated stake holder landscape – no easy fix
- every body have their own interpretation of the standards
• Standard developing
organisations
• CDISC
• Controlled terminology
• National Cancer Institute
• Validation rules
• Independent – OpenCDISC
• FDA
• Regulatory agencies
• FDA
• PMDA?
• IT vendors
• LIMS
• Data repositories
• Data processing
•
•
Data management
Data analysis
• Discussion fora
• PhUSE
39
Thank you!
Presentation title
Backup slides
Date
40
Presentation title
Intention must be to mimic local analysis
Local analysis
& statistics
GLP
Report data
and analysis
GLP
Plan &
collect data
Exchange
data
Translations between
data base structures
Global
analysis
Date
41
Presentation title
Real GLP Use Case
Local analysis
& statistics
Collect
Plan
Report
Date
42
Presentation title
The shortcoming of the real GLP Use Case
- Completeness compromised
Local analysis
& statistics
GLP
Report
GLP
LIMS setup
and data
capture
Plan
Who
Analyte
(When)
(What)
(How)
Exchange
data
Global
analysis
When
What
How
Date
43
Curated
data repository
Automated data
repository
Pre-storage
gap handling
&
standardisation
No format
requirements
Data for
storage
Storage type
Standard
analysis
No format
requirements
Consistently standardised
and complete data sets
Curation
into DB
Automated
load to DB
Data repository (DB)
(including derived summary statistics data layers)
DB key based
analysis
Direct storage of data sets (as-is)
File repository
(DB)
File storage
Analysis based onKey
correct
based
keys in
consistently populated
analysis
data sets
Custom
analysis
Analytical
outcome
Standard
file share
Current level of SEND data sets
Source data
Storage process
Automated
file repository
Case by case
customised analysis
Automated/standardised visualisation, analysis,
summary statistics etc.
?
Download