Metabolomics Software Tools Xiuxia Du, Paul Benton, Stephen Barnes Outline ! • Introduction ! ! ! ! • Software Tools for LC-MS metabolomics ! ! ! ! • !Software ! ! Tools for GC-MS metabolomics ! • !Software Tools for Statistical Analysis !! !! !! !! ! !! !! 2 Introduction • !LC-MS data analysis workflow ! ! ! ! ! ! ! data pre! ! processing ! ! statistical analysis compound identification pathway analysis ! ! ! ! ! • !GC-MS data analysis workflow ! !! feature ! extraction ! ! deconvolution statistical analysis compound identification pathway analysis 3 Software Tools for LC-MS • !Commercial • !MassHunter — Agilent ! • MetQuest — Thermo Scientific ! • MetWorks — Thermo Scientific ! • Multiple Mass Defect Filter — Thermo Scientific ! • Progenesis QI — Waters ! • XCMSplus — Sciex ! ! ! ! ! • and more …… ! ! ! ! !! !! ! ! ! 4 Software Tools for LC-MS • !Free • !Insilicos ! Viewer ! ! • ProteoWizard ! • SeeMS: interactive viewer for mass spec data files (Windows only) ! • ! ! ! ! ! ! ! MSConvert: convert between various file formats ! • MZmine 2: LC-MS data processing • ! MetaboSearch: perform mass-based metabolite search simultaneously against four major metabolite databases ! • MetaboAnalyst: a web server for metabolomics data analysis ! ! ! • and more …… 5 Data format • Proprietary data formats: need vendor library functions to access ! Company File extension ! Agilent .D ! Sciex .WIFF ! Theomo .RAW ! Waters .RAW ! ! …… ! ! • Open data formats: free library functions to access • • • • netCDF mzData mzXML mzML 6 InsilicosViewer 7 Insilicos Viewer Raw data viewer ! ! ! total ion chromatogram ! ! ! ! ! ! ! MS and MS/MS ! ! ! ! ! 8 ProteoWizard 9 ProteoWizard: SeeMS • A data viewer ! ! ! ! • Can read these open data formats • mzML • mzXML ! ! ! ! ! ! ! 10 ProteoWizard: SeeMS Raw data viewer ! ! ! ! ! total ion chromatogram ! ! ! ! ! spectrum ! ! ! ! list of scans ! 11 ProteoWizard: MSConvert Data format converter 12 ProteoWizard: MSConvert • Supported data formats • Read: open formats, vendor formats • Write: open formats ! • Filters and transformation • • • • • • • • msLevel Peak picking Zero samples ETD filter Threshold peak filter Charge state predictor Activation Subset ! ! ! 13 MZmine 2 14 MZmine 2 ! metabolomics data processing, analysis, and visualization • LC-MS ! ! ! ! ! ! ! ! ! ! ! ! • !Supported open data formats ! • NetCDF ! • mzData ! • mzML ! • mzXML ! ! ! ! ! 15 Raw data import list of scans in raw files • MS scans in blue • MS/MS scans in red • # sequential number • @ retention time • MS level • type of spectrum • p = profile • c = centroid • t = thresholded • polarity of ionization • + = positive • - = negative • ? = unknown 16 Raw data visualization brings up Right click on the file name 17 Raw data visualization double click TIC total ion chromatogram brings up spectrum 18 Raw data visualization ! ! ! ! 2D view ! ! ! ! ! ! ! ! ! ! ! 19 Raw data visualization rotate to find the best perspective 3D view 20 Data pocessing • Workflow • Raw data import • (optional) Raw data methods / Filtering • Peak detection • Isotopic peak grouping • (optional) Identification of fragments, adducts, and peak complexes • (optional) Normalization of retention time • Alignment • (optional) Gap filling • (optional) Normalization of peak heights / areas • (optional) Identification using database search, formula prediction, etc. • Data analysis and export ! ! ! ! 21 Peak detection • Mass detection • • • • • Centroid Exact mass Local maxima Recursive threshold Wavelet transform ! • Chromatogram building ! • Peak deconvolution ! ! ! ! ! ! 22 Peak detection Peak detection procedure ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! 23 Mass detection Mass detection options ! ! ! ! ow ! in d ! er w ! ar am et ! ep ! up th ! br in g ! ! re t he k cl ic ! ! o ! 24 Mass detection Set ! mass detection parameter ! ! ! ! ! ! ! ! ! ! ! ! ! ! 25 Mass detection Set ! mass detection parameter ! ! ! ! ! ! ! ! ! ! ! ! ! ! 26 Chromatogram building Chromatogram builder ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! 27 Chromatogram building Chromatogram builder — set parameters ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! 28 Chromatogram building After chromatograms are built ! ! ! ! ! ! ! ! ! peak list • # ID • m/z value • @ retention time ! ! ! ! ! ! 29 Chromatogram building To display extracted ion chromatograms brings up Right click on the file name 30 Chromatogram building Show chromatogram information ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! 31 Chromatogram building Show chromatogram information ! ! ! ! ! ! ! a particular extracted ion chromatogram (EIC) ! ! ! ! ! ! ! ! 32 Peak deconvolution 33 Peak deconvolution Algorithms ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! 34 Peak deconvolution Set parameters click here to bring up the parameter window 35 Peak deconvolution ! ! ! ! ! ! ! ! de i f n o i t u l o v n co d e h nis ! ! ! ! ! ! ! 36 Peak deconvolution Results ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! 37 Alignment Retention time normalization ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! 38 Alignment Retention time normalization: set parameters ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! Alignment Join aligner Alignment Join aligner: set parameters Alignment Get aligned peak list Alignment Information on aligned peak list Green: present Red: absent Peak identification Options 44 Peak identification By searching custom databases an example custom database 45 Peak identification By adduct search: set parameters 46 Peak identification Adduct search results 47 Peak identification By online database search 48 Peak identification Online database search results 49 Gap filling Options and parameters 50 Gap filling Results and visualization options 51 Gap filling Set visualization parameters 52 Gap filling Visualization 53 Data analysis Options 54 Data export Options 55 Data export Parameters 56 Data export csv format 57 MetaboSearch 58 MetaboSearch Input data format 59 http://omics.georgetown.edu/metabosearch.html#ug MetaboSearch GUI 60 MetaboSearch Steps 61 Software tools for GC-MS Metabolomics 62 Software Tools for GC-MS • !Commercial • !ChromaTOF® — LECO ! • MassHunter — Agilent ! ! ! ! ! ! ! ! ! ! ! • !Free ! • AMDIS: Automated Mass Spectral Deconvolution and Identification System NIST MS Search !• ! • Tagfinder ! ! ! ! 63 AMDIS Raw data visualization ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! 64 AMDIS Result visualization ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! 65 NIST MS Search GUI ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! 66 Software tools for Statistical Analysis 67 Stats and Machine Learning • !Commercial • !SIMCA ! • Mass Profiler Professional (MPP) — Agilent ! ! ! • and more …… ! ! ! ! ! ! ! ! ! ! • !Free ! • MetaboAnalyst !• R ! ! ! ! ! 68 MetaboAnalyst 69 Other software tools • !COMSPARI • !COMparison of SPectral And Retention Information ! ! • !MathDAMP ! • Mathematica package for Differential Analysis of Metabolite Profiles ! ! • !MSFACTs ! • Metabolomics Spectral Formatting, Alignment and Conversion Tools ! ! ! ! ! many more …… and ! !! ! ! ! 70 Thank You!