First demonstration on NSE biomarker of a computational environment dedicated to lab-on-chip based cancer diagnosis Pierre GRANGEAT1, Laurent GERFAULT1, Caroline PAULUS1, Vangelis KRITSOTAKIS2, Manolis TSIKNAKIS2, Frédérique LISACEK3, Pierre-Alain BINZ4, Manuel M PEREZ-PEREZ5, Mathieu TRAUCHESSEC6, Virginie BRUN6. CEA, LETI, MINATEC, DTBS, Laboratoire d’Electronique et de Systèmes pour la Santé, 17 rue des Martyrs, F-38054 Grenoble cedex 9, France. 2 FORTH, Foundation for Research and Technology –ICS, Vassilika Vouton, Heraklion 71110, Greece 3 Proteome Informatics Group, Swiss Institute of Bioinformatics, Michel-Servet 1, CH-1211 Geneva 4, Switzerland 4 Geneva Bioinformatics (GeneBio) SA, Av de Champel 25, CH-1206 Geneva, Switzerland 5 ATOS Research and Innovation, Biotechnology and Health Unit,C/ Albarracin 25, Madrid 28035, Spain 6 CEA/INSERM/UJF U880, Laboratoire d’Etude de la Dynamique des Protéomes, 17 rue des Martyrs, F-38054 Grenoble cedex 9, France. 1 Introduction : CEA-LETI is designing new silicon-based digestion and nano-chromatography components for mass spectrometry analytical chains. EDyP laboratory has developed an expertise dedicated to the absolute and accurate quantification of proteins in biofluids [1]. On the LOCCANDIA European project (www.loccandia.eu) an innovative integrated clinicoproteomics computational environment has been developed to support the LOCCANDIA integrated lab-on-chip based diagnostic device [2]. It combines several modules dedicated to web-based data and parameter management, profile reconstruction for protein quantification, data visualisation, peptide identification, and decision support based on classification. In this communication, we propose the first demonstration of this software environment on an experimental data set. Method : This LIMS computational environment integrates several innovative contributions. It is a web-based application, responsible for the storage, examination and manipulation of clinical and proteomic data. The three-tier architecture allows any upgrade as requirements or technology change. The profile reconstruction module computes the protein quantification from LC-MS spectrograms. The new version implements the inversion of a nano-liquid chromatography dynamical model [3] that allows including non-linear isotherms as it is the case for nano-LC chip. The visualization module is the MSight software platform. It provides a sophisticated way of visualizing mass spectrometer raw data. The identification module from LC/MS/MS data is the Phenyx software[4]. The classification module is based on Logistic regression and Support Vector Machines with nine different kernels. Preliminary data : Neuron Specific Enolase (NSE), recognized as a lung and brain cancer biomarker, was chosen as model. Using the PSAQ (Protein Standard Absolute Quantification) strategy [5], EDyP laboratory has developed a LC-MS-based assay enabling the absolute quantification of NSE in serum samples. The analytical process includes an immunoenrichment step followed by trypsin digestion and LC-MS analysis on an Orbitrap mass spectrometer. Quantification is performed by spiking into the samples an isotopically-labeled version of NSE as internal standard. To generate the data set, 4 defined amounts of unlabeled NSE (ranging from 0 to 200 ng/mL) were artificially spiked into 1 mL serum samples. NSE was accurately quantified down to its physiological level (50ng/mL). The data have been uploaded into the LIMS via the web-based access and the relevant parameters have been filled. The analysis has been carried on 3 peptides (IEEELGDEAR, GNPTVEVDLYTAK, IVIGMDVAASEFYR). The identification module has identified these peptides and transferred the information to the visualisation module for annotation purposes. The visualisation module clearly demonstrates the presence of these peaks in the light and heavy forms. The reconstruction module has been applied independently on each peptide. Calibration is performed to estimate the common model parameters. Regions of interest around processed peptides are specified. Then, unsupervised adaptive computation of protein concentration is performed. For the 27 computed protein quantities, the maximum relative error is 20% with a mean value of 10%. Despite 3% of retention time variation and 35% of gain variation, the mean coefficient of variation on quantification is 10%. [1] Brun et al. (2009), J. Proteomics 72.5, 740-749 [2] Jordan et al., (2008), 5th pHealth Workshop, Valencia, Spain. [3] Paulus et al. (2009), SYSID 2009, 251-256, Saint Malo, France. [4] Colinge et al. (2004) Proteomics 7, 1977-84. [5] Brun et al. (2007), Mol. and Cell. Proteomics 6.12, 2139-2149. Novel aspect: Protein quantification computational environment, web-based application, nano liquid chromatography dynamical model, unsupervised adaptive computation, NSE quantification in serum samples.