References

advertisement
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
EEG/ERP Analytic Tools Management System
Jan Štěbeták, Tomáš Řondík, Roman Mouček, Department of Computer Science and
Engineering, Faculty of Applied Science, University of West Bohemia, Pilsen, Czech
Republic.
Correspondence:
Jan Štěbeták
University of West Bohemia
Department of Computer Science and Engineering
Univerzitní 22,
Pilsen, 30614, Czech Republic
stebjan@kiv.zcu.cz
Abstract:
Neuroinfomatics laboratories produce a lot of experimental data, which has to be stored and
further processed. This paper introduces a management system for signal processing and
sharing analytic tools. Although several systems for signal processing exist, their sharing is
not satisfactorily solved. Authors present a server-side approach, which provides processing
signal via analytic tools. Since the implemented methods are accessible via the SOAP Web
Service, integration with system providing data storage is implemented. Integration and
sharing signal processing methods with other systems is available. The set of methods is
implemented and presented. The design of workflows is also presented.
Keywords: Electroencephalography; Event-related potential; Analytic tools, Web Services;
Integration; Workflows
1 Introduction
In our research group we specialize of research of a brain activity. During our experiments we
widely use the methods of Electroencephalography (EEG) with its subset Event-Related
Potentials (ERP). Experiments are performed in a neuropsychological laboratory including
recording devices or a car simulator. When experiments are performed experimental data and
metadata are collected for future processing. Since neuroscience community is facing
problems with the long-term storage of data and metadata, raw data analysis, or sharing data
and analytic methods, International Neuroinformatics Coordinating Facility (INCF, 2012)
released recommendations (Van Pelt and van Horn, 2007) for handling neurophysiologic data.
As members of the Czech National Node of International Neuroinformatics Coordinating
Facility (INCF) we cooperated one the definition of data and metadata format for
electrophysiology research. Our efforts resulted in a custom solution – the EEG/ERP Portal
(Jezek and Moucek, 2010). Data are stored within the EEG/ERP Portal and are accessible via
a web browser.
Because data from stored experiments are usually further processed using various signal
processing methods, the EEG Data Processor (Jezek and Moucek, 2013) is presented.
This paper describes methods for obtaining ERP components from EEG records and analytic
tools for detection ERPs. Since scientists appreciate the possibility to have experimental data
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
and processing methods on the same place, we presented integration of analytic tools to the
EEG/ERP Portal. The last part of this paper deals with a design of workflows.
2 Materials and methods
This section briefly describes the development of neuroinformatics infrastructures, methods of
detection ERP components, development of methods for EEG/ERP signal analysis, and
system integration.
2.1 Neuroinformatics infrastructures
The neuroinformatics infrastructure is being built in several INCF national nodes in parallel.
The INCF portal includes e.g. a software center for easy storage and sharing of
neuroinformatics software tools, a content management system for national nodes presentation
and provides access to supercomputing resources for the neuroinformatics community.
Modular Toolkit for Data Processing (MDP) (Zito et al., 2008) is a data processing framework
written in Python. MDP is a modular framework that Python programmers can extend by
additional modules. Common users can call implemented modules locally.
The British CARMEN system (Watson et al., 2007) has been designed to allow neuroscientists
to share data and programs (services) from neurophysiological experiments. The ‘Portal’ is a
web interface onto the CARMEN system, and provides end‐users with access to the computer
and data storage resources which are at the core of the system.
The portal provides following set of features
 Search across archived data set
 Upload, annotate, and store experimental data
 Run processes and routines on the stored data on the CARMEN computers
The analysis services are implemented as ‘web‐services’ on the CARMEN system.
2.2 ERP components and methods
P300 component (Fig. 1) is a component obtained from an EEG signal which correlates with
the specific brain activity. This component appears as a brain response to an unexpected
external stimulus. The normal delay of the occurrence is approximately 300 ms after the
appearance of the stimulus.
Fig. 1
Example of P300 component
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
The methods (Luck 2005) for analyzing a pure EEG signal are used especially in medicine for
detecting epileptic waveforms and sleep disorders, or for indicating brain death. The cluster
analysis and adaptable neural network are widely used. The fundamental method for obtaining
ERP components from the EEG signal is averaging.
A stimulus is repeated during an experiment and brain responses are averaged into one
response. Many subjects can be measured using one experimental scenario and these subjects
can be associated into subject groups depending on their age, gender, or disease. The grand
average is one representative brain response to an external stimulus in the whole group
2.3
Levels of System Integration
2.3.1 Data Layer Integration
Data integration involves combining data residing in different data repositories and providing
business users with a unified view of this data. The top batch-oriented technique that
companies utilize is known as ETL (Extract – Transform – Loading, Fig. 2) (ETL, 2011).
Fig. 2
Principle of ETL
ETL enables physical movement of data from source to target data repository. The first step,
extraction, is to collect or grab data from its source(s). The second step, transformation, is to
convert, reformat, cleanse data into format that can be used be the target database. Finally the
last step, loading, is import the transformed data into a target database, data warehouse, or a
data mart.
ETL tool providers:
Commercial ETL Tools:
 IBM Infosphere DataStage
 Informatica PowerCenter
 SAP Business Objects Data Integrator (BODI)
 SAP Business Objects Data Services
 Oracle Warehouse Builder (OWB)
 Oracle Data Integrator (ODI)
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172






Data Integration Studio
Microsoft SQL Server Integration Services (SSIS)
Ab Initio
SyncSort DMExpress
iWay DataMigrator
Pervasive Data Integrator
Freeware, Open Source ETL tools:
 Pentaho Data Integration (Kettle)
 Talend Integrator Suite
 CloverETL
 Jasper ETL
2.3.2 Application Layer Integration
Remote Procedure Call (RPC) is widely used for constructing distributed, client-server based
applications. A client application calls a remote procedure (method), transfers data to a server
application, and waits for a result. For web applications there is used Web Services
technology (Jie Liu et al., 2006).
Web Services uses XML messages and HTTP protocol (HyperText Transfer Protocol). Web
Services use XML Schema for data type definition and XML Namespaces for objects
identification. Web Services includes three cores:
 SOAP (Simple Object Access Protocol)
 WSDL (Web Services Definition Language)
 UDDI (Universal Description Discovery and Integration)
SOAP is a simple XML-based protocol that allows to communicate applications information
over HTTP without the dependency of OS platform. SOAP uses HTTP and XML as the
mechanisms for information exchange.
Current free and commercial implementations available for Web Services:
 Apache SOAP, Axis 1 and Axis 2. SOAP and Axis 1 are now obsolete; use Axis 2
instead.
 JAX-WS Reference Implementation
 JAX-RS Reference Implementation
 Metro (includes the JAX-WS reference implementation)
 Apache CXF (formerly called XFire)
 MS.NET
 Java 6 includes the JAX-WS reference implementation (and a minimal server for it)
Cloud computing is the use of computing resources (hardware and software) that are delivered
as a service over a network (typically the Internet). The name comes from the use of a cloudshaped symbol as an abstraction for the complex infrastructure it contains in system diagrams.
Cloud computing entrusts remote services with a user's data, software and computation.
2.3.3 Presentation Layer Integration
Web portals (Ingyin, 2006) are a means for presentation level integration of enterprise
application and services. Web portals allow the users to select the links they would like on
their personal page. A portal is built upon layers of services and component modules. Portlets
are used by portals as pluggable user interface components that provide a presentation layer to
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
information systems. Portal provides Web based personalizable and integrated system to
access internal and external application which support user process and implement front-end
integration.
3 EEG/ERP Portal
Our research group has developed an EEG/ERP Portal (fig. 3) because of a need for storage of
EEG/ERP experiments gained from encephalography research. This portal enables research
groups to store, manage and download their experimental data and metadata. Portal is
developed as a standalone product running on servers in our department. The usage of the
Portal does not require any special software installation, only a web browser.
The portal application is based on Spring MVC (Spring, 2011) technology (Model – View –
Controller). Access to the EEG database is ensured by framework Hibernate (Hibernate,
2011). This framework is used for object-relational mapping. It also protects our portal
against SQL injection attack among other things. The portal uses JSP technology (Java Server
Pages) including JSTL for displaying HTML pages into user´s browser. The Portal provides:
 Management of EEG/ERP data and metadata
 Management of EEG/ERP experimental design (experimental scenarios)
 Management of data related to tested subjects
 Sharing of knowledge and working within groups
 Content management system
 Full-text and advanced search
The data are protected by the system of user accounts with defined user roles. Individual users
are grouped into self-managed groups. On the basis of activities that the user can perform four
user roles are recognized (Reader, Experimenter, Group Administrator, and Supervisor). The
user who wants to upload or download experiments has to create an account and to become a
member of at least one group.
Fig. 3
EEG/ERP Portal frontend
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
4
Analytic Tools
This section describes methods that we use for obtaining ERP components, removing
artifacts, and detecting the P3 component. Averaging is the fundamental method for obtaining
ERP components and removing artifacts. We use the Fast Fourier Transform and Fast ICA
algorithm for detecting and removing artifacts. For detecting the P3 component there are used
following set of methods: Matching Pursuit, Discrete and Continuous Wavelet Transform, and
Hilbert-Huang Transform.
Fourier transform (FT) is the oldest method of signal analysis. This method allows you to
convert the signal from time domain to the frequency, which means that it can detect all
frequency components occurring in the signal, and level of their representation [9]. Its discrete
variant is often used in computer science. We use FFT for detecting frequency in EEG/ERP
signal during experiments with steady-state potentials. FFT is also used in implementation of
matching pursuit algorithm according to [820]. We implemented two methods for signal
spectrum estimation as well – BlackmanTukey.
The matching pursuit (MP) (Vareka, 2012) algorithm decomposes any signal to the sum of socalled atoms, which are selected from the dictionary. The atom that best approximates the
input signal is chosen in each iteration. This atom is subtracted from the input signal and the
residue enters the next iteration of the algorithm. The total sum of atoms selected successively
in algorithm iterations is an approximation of the original signal - more iteration we do, more
accurate approximation we get. The algorithm was originally described in [MP]. We decided
to implement this method according to [820] where the Gabor dictionary is used.
For displaying results we implemented the time-frequency transformation known as WignerVille transformation. The input of this transformation is the set of chosen atoms. Energy of
atoms shows the occurrence of P3 waveform.
Wavelet Transform (WT) (Ciniburk et al., 2010) is a suitable method for analyzing and
processing non-stationary signals such as EEG. For EEG signal processing it is possible to
use continuous wavelet transform (CWT) or discrete wavelet transform (DWT). Both CWT
and DWT were tested during our research focused on automatic ERP detection. WT is a
suitable method for ERP detection because it has a good time and frequency localization.
DWT is common in computer science because of high performance caused by its algorithmic
complexity. We’ve implemented a lot of wavelets which can be used in DWT. The reason is
that for our research in automatic ERPs detection is necessary to have a wavelet which
corresponds to detected ERP as much as possible. And there are many ERPs with different
waveforms.
Available wavelets: Coiflet6, Coiflet12, Coiflet18, Coiflet24, Coiflet30, Daubechies4,
Daubechies4, Daubechies6, Daubechies8, Daubechies10, Daubechies12, Daubechies14,
Daubechies16, Daubechies18, Daubechies20, Haar, Symmlet4, Symmlet6, Symmlet8.
CWT is often replaced in computer science by its discrete form because of its algorithmic
complexity. However, we decided to make this method accessible for signal processing in
EEG Portal. The reason is size of step by shifting a wavelet on original signal and is described
in [Souhyho diplomka] in detail.
Available wavelets: ComplexGaussian, ComplexMorlet, Gaussian, MexicanHat, Morlet.
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
Independent Component Analysis (ICA) (Hyvärinen et al., 2001) is a well-known method for
blind signal separation and signal deconvolution. In the EEG/ERP domain, ICA can be used
for artifact removal, ERPs detection, and – generally speaking – for detection and separation
of every signal which is independent on EEG activity.
We implemented four common filter types: low pass, high pass, band pass, and band reject,
which can be parameterized to get a desirable filter. These filters allow us to get rid of
undesirable frequencies from input signal. This is typically first step in EEG/ERP processing
because we know which frequencies are common in this kind of signal – any other frequency
is regarded as noise.
5
Results
5.1 EEG Data Processor
EEG Data Processor is a web based application, which offers analytic tools for EEG/ERP
signals. This application does not need any installation, only web browser. The frontend
design is on figure 4.
Fig. 4
EEG Data Processor frontend
This application is written in Java language. We use the Spring Framework and AJAX
technology for processing HTTP requests. For connection to the database we use the
Hibernate framework. To the database there are stored registered users, which are permission
to use the EEG Data Processor. The authentication process is ensured by Spring Security. The
application design is on figure 5.
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
Fig. 5
EEG Data Processor architecture
The internal structure consist of several components:
 EEG Binary Loading – It loads data from binary files obtained from an analoguedigital converter. We currently suppord data obtained from the Brain Vision Recorder.
 Processong Resource Pool – Since performance capacity of the hosted server is
limited, we have implemented a pool of available resources. The systém can be
configured to manage a number of requests simultaneously.
 EEG Processing Algorithms – This module manages a running of installed plug-ins. It
has access to the list of installed plug-ins and calls a method invoker.
 External Method Invoker – Is responsible for execution of a requested method. It
parses the method parameters an takes the method result.
5.1.1 Supported Data Format
Since there are not standardized format for EEG/ERP domain, we use data format, which is
provided by Brain Vision Recorder. This format includes binary data file .eeg where signal
values are encoded. This data format encoding is described in a header file .vhdr. An example
of a header file is given below.
[Common Infos]
Codepage=UTF-8
DataFile=rab.eeg
MarkerFile=rab.vmrk
DataFormat=BINARY
MULTIPLEXED=ch1,pt1,ch2,pt1…
DataOrientation=MULTIPLEXED
NumberOfChannels=21
; Sampling interval in microseconds
SamplingInterval=1000
[Binary Infos]
BinaryFormat=INT_16
[Channel Infos]
Ch1=Fp1,,0.1,μV
Ch2=Fp2,,0.1,μV
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
Ch3=F3,,0.1,µV
Ch4=F4,,0.1,µV
According to the header file, we have implemented the reader of the binary file. The output
of this reader is a field of double numbers (signal values).
The last type of file produced by Brain Vision Recorder is a marker file .vmrk. This file
contains positions of stimuli in the EEG signal. An example of a marker file is given below.
[Common Infos]
Codepage=UTF-8
DataFile=rab.eeg
[Marker Infos]
Mk1=New Segment,,1,1,0,20100223094409773314
Mk2=Stimulus,S 1,32661,1,0
Mk3=Stimulus,S 2,34481,1,0
Mk4=Stimulus,S 3,36301,1,0
Mk5=Stimulus,S 4,38121,1,0
Mk6=Stimulus,S 5,39941,1,0
The EEG Data Processor is fully prepared for adding support of a new data format.
5.1.2 Adding New Methods
Currently, we design a plug-in system for registering and adding new analytic tools. This
system allows adding new methods automatically. Nevertheless, we are focused only for
methods developed in Java language. These JAR libraries have to follow the criteria:
 The initialization method must be void type
 XML description of the output. Because output of individual methods can be different,
the XML format ensures its easier future representation
 To the JAR libraries must be attached a setting file, where the initialization Method
have to be described
An example of the setting file is given bellow.
algorithmName=Matching pursuit
author=Anonymous
main.class=pilsner.matchingPursuit.present.RunnerMatchingPursuit
main.method=runMatchingPursuit
The next challenge leads in providing a wrapper for a set of most often used programming
languages in signal processing such as Python, C/C++, Pearl, or Matlab.
5.2
Integration of Applications
5.2.1 Purpose of Integration
The purpose of integration is to enable users of EEG/ERP Portal analyzing their data using
analytic methods mentioned in Chapter 4. The second goal is to allow sharing analytic tools
using this application as a third party service provider.
Since analytic methods are time-consuming and computationally demanding, EEG Data
Processor is running on another server than EEG/ERP Portal. This separation ensures that
users of EEG/ERP Portal are not influent by running analytic methods.
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
Because of sharing methods, the data layer integration has been rejected. This approach only
allows sharing data between both applications. However, data stored in EEG/ERP Portal
database would be available for EEG Data Processor; users of EEG/ERP Portal would have to
open EEG Data Processor in their browser and then analyze their data. We decided for the
business layer integration, which is based on remote procedure call. This approach allows
using analytic tools via EEG/ERP Portal. It also provides sharing these tools, because remote
methods can be called from many applications.
5.2.2 Implementation
The technology used for remote procedure call is known as Web Services technology. This
technology is suitable for point to point integration for web based applications. Currently, we
have two web based applications (EEG/ERP Portal and EEG Data Processor). Both
applications are based on Spring framework. Apache CXF is frequently used implementation
of Web Services technology for such applications.
Apache CXF is an open source services framework that makes web service development easy,
simplified, and standard based. This technology is used for client-server based integration of
web applications. The client application will prepare an order and send it to the server
application through a business method call. The server application will contain a web service
that will process the order. For implementation we use code-first approach. It includes three
following steps:
 Create a Service Endpoint Interface (SEI) and define a business method to be used
with the web service.
 Create the implementation class and annotate it as a web service.
 Create an xml configuration of the service class and a Spring bean using JAX-WS
frontend.
The interface below is the Service Endpoint Interface defying methods, which EEG Data
Processor provides.
@WebService
@Secured("ROLE_USER")
public interface ProcessService{
/**
* Returns number of currently available processing units.
* @return available processing units
*/
public int availableProcessingUnits();
/**
* Getter of parameters necessary for method to run.
* @param fileFormat supported file format
* @param methodName name of desired process method
* @return array of parameters
*/
public MethodParameters[] getMethodParameters(SupportedFormat fileFormat,
String methodName);
/**
* Returns byte array of processed data
* (will be replaced by output format in time)
* @param data files to be processed
* @param fileFormat one of supported file formats
* @param algorithmName name of processing algorithm
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
* @param params other parameters
* @return bytes of processed data
*/
public byte[] processData(DataFile[] data, SupportedFormat fileFormat,
String algorithmName, String[] params);
/**
* Getter of available algorithm names.
* @return algorithm names
*/
public String[] getAvailableMethods();
}
The interface and the implementation have been created. It is necessary to configure an
endpoint in an xml file. In EEG Data Processor the webservice.xml file is used. Configuration
is given below.
<bean id="processService"
class="cz.zcu.kiv.eegprocessor.webservice.ProcessServiceImpl"/>
<jaxws:endpoint
id="personService"
implementor="#processService"
address="/webservice/processService">
<jaxws:properties>
<entry key="mtom-enabled" value="true"/>
</jaxws:properties>
</jaxws:endpoint>
</beans>
Id specifies a unique identifier for a bean, implementor specifies the actual web service
implementation class, and address specifies the URL address where the endpoint is published.
The URL address must be relative to the web context.
5.3 Design of Workflows
Since data processing often requires usage of more methods sequentially, development of
specific workflows is required. Apart from sharing the methods described above, we will
provide an opportunity to create, use, and share workflows in the EEG/ERP Portal.
Workflows will simplify the work with data and methods and offer more comfort to users.
Now we are working on design and technological aspects of workflows. We need to extend
the database by the tables for workflows defined by users. There are two types of analytic
methods:
 Signal processing methods such as Matching Pursuit or Wavelet Transformation
 Signal preprocessing methods such as averaging or filtering
Technically, workflow means to put analytic methods into a pipe, where an output from the
previous method becomes an input to the next method. The signal processing methods modify
the signal; their input and output have the same format. These methods are compatible and
could be put anywhere into the pipe. Signal processing methods provide result in other
format.
We consider several approaches, how to deal with the incompatibility of provided output
formats. The first idea consisted in putting signal processing methods at the end of the pipe
because of the other output format. This approach would allow modifying an EEG/ERP signal
using signal preprocessing methods and analyze it as one task. Nevertheless, this approach
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
would not allow further processing of analytic result. There are also methods for visualization
of this result. These methods should be put into the pipe after signal processing methods.
Considering this point, we are developing a workflow management system. This system will
check the compatibility of input and output parameters. An example of parameters type and
explaining of the compatibility is given below.
 Output parameter type from averaging method is an array of double values (EEG/ERP
signal). Input parameter type of Wavelet Transformation is the array of double values
as well. These parameters are compatible.
 Output parameter type from Matching Pursuit method is a two-dimensional array
(values of atoms). Input parameter type of scalogram, which is used for visualization
of result is a one-dimensional array. Parameters of these methods are not compatible.
Scalogram is used for visualization of Wavelet Transformation result.
This management system will solve the issue about further processing of analytic result.
However, parameters, which are syntactically compatible, do not have to be semantically
compatible as well. It means, one-dimensional arrays match syntactically but in case of
averaging, output is an EEG/ERP signal. Output format of Wavelet Transformation is a onedimensional array as well, but in this case it returns computed coefficients.
As a solution of this issue we will extend our database. It will provide a list of available
methods, input and output parameters type, and description of these parameters. This
information will be provided to users, who will create their own workflows. A user will
decide which method he/she put into the pipe and he/she also decides the order of these
methods. The graphic user interface will be developed, which allows creating and saving
user´s own workflows.
Workflows are organized as a tree structure (Fig. 6), where each branch of the tree has the
same meaning as a pipe in Linux; an output of the method serves as an input of the next
method.
Fig. 6
Example of a workflow tree
5.4 Tests
We did some performance tests to be sure that our encapsulation of discrete signal processing
methods and their integration into EEG portal doesn’t have a negative impact on its
performance.
5.4.1 Methodology
We run four test routines for each discrete signal processing method. Each routine with
different number of samples in input signal: 256, 512, 1024, and 2048 (we used real
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
EEG/ERP signal, but there is no difference in performance which depends on input signal
characteristic). Each routine was repeated one thousand times to minimize measuring error
caused by third parties (CPU task planning, asynchronous events, etc.). Our effort was keep
the same conditions for all methods as much as possible – we used the same configuration
(Win 7 Professional SP1 x86 with Java Runtime Environment 7 on Intel Core 2 Duo 2 GHz,
1066 MHz FSB, 3 MB L2 cache, 3 GB DDR2 RAM – no multithreading was used) and only
one application - which was started by user - was our test routine started from terminal.
5.4.2





Setup
DWT: Haar wavelet, all possible wavelet strengths were computed
CWT: Mexican hat wavelet, step 1, wavelet strength set to 60
MP: Gabor base was used, decomposition to 10 atoms
FastICA: decomposition to 10 independent signals
FFT: there is no parameter to be set up
5.4.3 Test result
We obtain following results (Table 1):
Table 1. Test results
Average time in milliseconds for each discrete signal processing method
DWT
CWT
MP
FastICA
FFT
Number of
samples
256
512
1024
2048
542
543
544
0.044
0.058
0.078
0.153
10.002
55.685
230.468
709.730
0.225
0.552
0.621
1.317
0.136
0.219
0.410
0.797
Results are also displayed in graphs (Fig. 7):
DWT, FastICA, FFT
800
1.4
700
1.2
Time in milliseconds
Time in milliseconds
CWT
600
500
400
300
200
100
0
1
0.8
DWT
0.6
FastICA
0.4
FFT
0.2
0
256 512 1024 2048
Number of samples
545
1225.988
4941.827
21320.228
89471.612
256
512
1024
Number of samples
2048
MP
Time in seconds
100
80
60
40
20
0
256
512
1024
2048
Number of samples
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
Fig. 7
Test of signal processing methods
The results look like we expected. Performance of DWT, FastICA, and FFT is in normal and
we expect that there is no reason to do some performance optimization in future. CWT is
implemented as brute-force algorithm so its performance is not a surprise. The performance of
matching pursuit algorithm is worse.
6 Discussion and conclusion
Although several systems that implement signal processing methods exist, their sharing and
remote procedure calling is not satisfactorily solved. We have developed a neuroinformatics
infrastructure. This infrastructure is composed of two web based application. Since
neuroinformatics research produce lot of data, the EEG/ERP Portal was developed. The
EEG/ERP Portal enables research groups to store, manage, and download their experimental
data and metadata. The second application, EEG Data Processor enables to analyze
experimental data.
Remote procedure call is widely used for business layer integration of application. For web
based applications there is used Web service technology. Since both applications are
developed using Spring framework, we use Apache CXF implementation of Web services.
This integration allows users of EEG/ERP Portal analyzing their data using methods
implemented in EEG Data Processor. Since sharing data and tools is very helpful for scientific
community, used technology also provides analytic methods of EEG Data Processor to a
third-party application. But using of this technology for sharing analytic tools has one
disadvantage. A developer of a third-party application, who wants to integrate analytic tools
provided by EEG Data Processor, is forced to modify his/her application. It is necessary to
implement the Web services client into this application.
Since data processing often requires signal preprocessing before using signal processing
method, the purpose of workflows is presented. According to this design we need to extend
our database and then we will develop a workflows management system and graphic user
interface for creating and running workflows via web browser.
We plan to investigate possibilities in the area of Cloud Computing. The suitable cloud should
help us to improve management of system resources. We also plan to investigate and develop
plug-in system, which allows users to add new analytic methods written in various
programming language, not only in Java.
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
7
Acknowledgment
References
Ciniburk, J., Mautner, P., Moucek, R., Rondik T., (2010).
ERP components detection using wavelet transform and matching pursuit algorithm, DCII,
Prague
ETL (2011), Data Integration Techniques
http://bi-insider.com/portfolio/data-integration-techniques-etl-and-data-federation [online]
Hibernate (2011), http://www.hibernate.org/ [online]
Hyvärinen, A., Karhunen, J., and Oja, E. (2001). Independent Component Analysis. Adaptive
and Learning Systems for Signal Processing, Communications and Control. J. Wiley.
INCF (2012), International Neuroinformatic Coordinating Facility,
http://www.incf.org/about/what-is-neuroinformacics [online]
Ingyin, O., (2006), Presentation Level Integration of Portal Personalization Architecture,
Information and Communication Technologies, 2006. ICTTA '06. 2nd, pages 564-565.
Jezek, P., Moucek, R., (2010). System for storage and management of EEG/ERP experiments
- generation of ontology. In ICEIS 2010, vol. 1 , Databases and Information Systems
Integration. Madeira: SciTePress.
Jezek, P., Moucek, R., (2013). Electroencephalography Data Processor. HEALTHINF 2013 International Conference on Health Informatics, Barcelona, pages 357-361.
Jie Liu, Er-peng Zhang, Jin-fen Xiong, and Zhi-yong Lv, (2006). Deployment of Web
Services for Enterprise Application Integration (EAI) System, Berlin.
Luck, J. Steven, 2005, An Introduction to the Event-Related Potential Technique.
Cambridge, Mass.: The MIT Press.
Vareka, L. (2012). Matching Pursuit for P300-based Brain Computer Interfaces, Prague.
Spring technology (2011), http://www.springsource.org/ [online]
Van Pelt, J. and van Horn, J. (2007). 1st incf workshop on sustainability of neuroscience
Databases. Workshop, Stockholm.
Watson, P., Jackson, T., Pitsilis, G., Gibson, F., Austin, J., Fletcher, M., Liang, B., and Lord,
P. (2007). The CARMEN Neuroscience Server. Proceedings of the UK e-Science All hands
Meeting, pages 135-141.
Zito, T., Wilbert, N., Wiskott, L., and Berkes, P. (2008). Modular Toolkit for Data Processing
(MDP): A Python Data Processing Framework. Frontiers in neuroinformatics, 2.
Download