ERozet_V02 - ORBi

advertisement
MEASURING VARIABILITY SOURCES IN NMR METABOLOMIC STUDIES
E. Rozet1,*, P. De Tullio2, Ph. Hubert1, B. Govaerts3
1.
Analytical Chemistry Laboratory, CIRM, Université de Liège, Liège, Belgium.
Medicinal Chemistry, CIRM, Université de Liège, Liège, Belgium.
3.
Institut de Statistique, Université Catholique de Louvain, Louvain-la-Neuve, Belgium.
*
Eric.Rozet@ulg.ac.be
2.
Due to the huge amount of information available in NMR spectra obtained from the analysis
of metabolomic experiments, multivariate analysis such as Principal Component Analysis
(PCA) are required to understand the influence of treatments over the metabolites [1].
However, many experiments in metabolomics studies have more complexes variability
structures than simply comparing several treatments: they may include time effects, biological
effects such as diet or hormonal status, and other blocking factors or variability sources:
samples stability, age of the individuals, pH of a buffer, days of acquisition, and so on.
Metabolomic data analysis needs to extract from the spectral data matrix the variations linked
to a change indicated in the factor of interest. However other sources of variability may
impair this objective. This stresses the importance to discover the sources of variability of the
spectral metabolomic data using appropriate methodology.
Classically, to analyze such data analysis of variance (ANOVA) or multivariate ANOVA
(MANOVA) [2] is used. However direct application of these methodologies to NMR spectra
obtained from structured metabolomics studies is inappropriate or impossible. More complex
data analyses methodologies are required to understand the importance of the various factors
implied in the experiments and to provide a measure of their variance components. Three
related methodologies have been proposed to achieve this: ASCA [3], ANOVA-PCA [4] and
AComDim [5]. The ASCA and ANOVA-PCA methodologies combine first an analysis of
variance step (ANOVA) and then a PCA step. The AComDim one adds to the output of the
ANOVA-PCA step a multi-block analysis.
In this presentation, the usefulness and applicability of these advanced techniques to data
analysis of NMR metabolomic spectra are provided to highlight the increase of knowledge
gained and the estimation of main sources of variability arising in an experimental setup.
Two NMR databases will be used [6]. The first one concerns human serum analyzed by 1HNMR where three random factors are present: day of measurement (3 days), sample (2
samples per individual) and replication of analyses as well as two fixed controlled factors,
time of measurement after thawing (2 times) and two protein suppression methods for the
spectral pre-treatment. The second database is about the 1H-NMR analyses of rats’ urine
where two different concentrations of citrate and of hippurate were deliberately added and
three other sources of variability are present: urine pool diluted or not diluted, repetitions of
analyses, days of analyses (three days), as well as two different spectral pre-treatment
procedures.
[1] Metabolomics in Practice, Successful strategies to generate and analyze metabolomic data, M. Lämmerhofer
and W. Weckwerth (Ed), Wiley-VCH Verlag, 2013, 415 p.
[2] L. Stahl, et al., Chemom. Intell. Lab. Syst. 9 (1990) 127-141.
[3] J.J. Jansen, et al., J. Chemom., 19 (2005) 469-481.
[4] P.B. Harrington, et al. Anal. Chim. Acta 544 (2005) 118-127.
[5] D.J.R. Bouveresses, et al., Chemom. Intell. Lab. Syst. 106 (2011) 173-182.
[6] R. Rousseau, Statistical Contribution to the analysis of metabonomic data in 1H-NMR spectroscopy,
Doctoral Thesis, Université Catholique de Louvain(UCL), Louvain-la-Neuve, 2011.
Download