iRODS performance test and SRB system at KEK Yoshimi Iida @ KEK

advertisement
iRODS performance test
and SRB system at KEK
Yoshimi Iida @ KEK
Building data grids with iRODS
27 May 2008
Outline

Performance measurement





Transfer test between CC-IN2P3 and KEK
Scaling test for ICAT
Concurrent test for ICAT
Compare with iRODS and SRB
SRB/iRODS system at KEK
27/05/2008
Building data grids with iRODS
2
Transfer between CC-IN2P3 and KEK
CC-IN2P3, FR

NY, USA
1GB data transfer between CC-IN2P3 and KEK


KEK, JP
Compare with iRODS and bbcp
The network route go through USA

27/05/2008
The RTT is about 285ms
Building data grids with iRODS
3
System used

iRODS system at CC-IN2P3

ICAT enabled iRODS server on Solaris 10





Thumpers Sun x4500 (AMD processors)
Oracle 10g is on a cluster of dedicated machines
Linux file resource from local disk system
iRODS 0.9
iRODS system at KEK

iRODS server on RHEL3



27/05/2008
Intel Xeon 3.0GHz ×4
Linux file resource from local disk system
iRODS 0.9
Building data grids with iRODS
4
From KEK to CC-IN2P3
50
iput

Speed [MB/sec]
40
30
1GB data transfer
during 24 hours
20

10

0
Speed [MB/sec]
50
window size 4MB
number of parallel
streams 16
bbcp
40

30
20
bbcp often fail to
connect
10
0
27/05/2008
Building data grids with iRODS
5
From CC-IN2P3 to KEK
Speed [MB/sec]
40
iput

30
20

10

0
40
Speed [MB/sec]
1GB data transfer
during 12 hours
bbcp
window size 4MB
number of parallel
streams 16
30

20
10
iput is better than
bbcp
0
27/05/2008
Building data grids with iRODS
6
Scaling test

Data



Using the same directory at client machine
1000 files of 1000 bytes each
Measurement


Ingesting directory and listing the collection
Performance measured for every directory
operation

27/05/2008
1 collection and 1000 files
Building data grids with iRODS
7
System used

ICAT based on Oracle system at CC-IN2P3

ICAT enabled iRODS server on Solaris 10



iRODS resource and client on SL4



Thumpers Sun x4500 (AMD processors)
Oracle 10g is on a cluster of dedicated machines
Dual AMD Opteron Processor 848
Linux file resource from local disk system
ICAT based on PostgreSQL system at KEK

ICAT enabled iRODS server on RHEL3



iRODS resource and client on RHEL3


27/05/2008
Dual Intel Xeon 2.8GHz
PostgreSQL 8.2.5 is running on the same machine
Dual Intel Xeon 2.8GHz
Linux file resource from local disk system
Building data grids with iRODS
8
Ingesting up to 1 million files
iput - Oracle
iput-PostgreSQL
100
time in seconds
tme in seconds
100
80
60
40
20
80
60
40
20
0
0
0
200
400
600
800
number of files [x 1,000]
0
1000
200
1000
ils - Oracle
ils - PostgreSQL
5
time in seconds
5
time in seconds
400
600
800
number of files [x 1,000]
4
3
2
1
0
4
3
2
1
0
0
200
400
600
800
number of files [x 1,000]
1000
0
200
400
600
800
number of files [x 1,000]
1000
Running the other process
27/05/2008
Building data grids with iRODS
9
Nested collection test

Data



Registering same directory at client machine
100 files of 100 bytes each
Measurement



Ingesting directory and listing the collection
Making nested collection every 10 collection
Performance measured for every directory
operation

27/05/2008
1 collection and 100 files
Building data grids with iRODS
10
Nested collection test
iput - Oracle
70
60
50
40
30
20
10
0
nest
flat
0
50
100
150
number of files [x 1,000]
time in seconds
time in seconds
ireg - PostgreSQL
70
60
50
40
30
20
10
0
200
nest
flat
0
ils - PostgreSQL
200
ils - Oracle
4
10
3
nest
flat
2
1
time in seconds
time in seconds
50
100
150
number of files [x 1,000]
8
6
nest
flat
4
2
0
0
0
50
100
150
number of files [x 1,000]
200
coll-1220: USER_PATH_EXCEEDS_MAX
27/05/2008
0
50
100
150
number of files [x 1,000]
200
nest-121 : OCI_ERROR
Building data grids with iRODS
11
Concurrent test

Data



Ingesting same directory at client machine
1000 files of 1000 bytes each
Measurement

Runing multi process at the same time





read operation – ils and iget
write operation – iput and ireg
mixed operation – iput, ireg, ils and iget
Performance measured for every directory operation
DB setting

27/05/2008
Set the number of maximum connections to 200
Building data grids with iRODS
12
Concurrent test for reading metadata
ils - PostgreSQL ICAT
ils - Oracle ICAT
1000
Time in seconds
Time in seconds
1000
100
10
1
100
0.1
10
1
0.1
0
50
100
150
200
250
300
0
50
100
Number of clients
250
300
250
300
iget - Oracle ICAT
10000
10000
Time in seconds
Time in seconds
200
Number of clients
iget - PostgreSQL ICAT
1000
100
10
1000
100
10
1
1
0
50
100
150
200
250
300
0
50
Error: connectToRhost failed
100
150
200
Number of clients
Number of clients
27/05/2008
150
10 clients
100 clients
Building data grids with iRODS
200 clients
× 300 clients
13
Concurrent test for writing metadata
Time in seconds
iput - PostgreSQL ICAT

10000
1000
iput – Oracle ICAT
Because of limit of inodes, we
cannot put any more files

100
10
1
0
50
100
150
200
Number of clients
ireg - PostgreSQL ICAT
ireg - Oracle ICAT
10000
Time in seconds
Time in seconds
10000
1000
100
10
1
1000
100
10
1
0
50
100
150
200
0
100
150
200
Number of clients
Number of clients
10 clients
27/05/2008
50
Building data grids with iRODS
100 clients
200 clients
14
Concurrent tests for mix - PostgreSQL
iput
ils
1000
Time in seconds
Time in seconds
10000
1000
100
10
1
100
10
1
0.1
0.1
0
50
100
150
0
200
50
100
200
150
200
Number of clients
Number of clients
ireg
iget
10000
Time in seconds
10000
Time in seconds
150
1000
100
10
1
0.1
1000
100
10
1
0.1
0
50
100
150
200
0
Number of clients
100
Number of clients
10 clients
27/05/2008
50
Building data grids with iRODS
100 clients
200 clients
15
Concurrent test for mix - Oracle
ils
iput – Oracle ICAT

Because of limit of inodes, we
cannot put any more files
Time in seconds

10000
1000
100
10
1
0
50
100
150
200
150
200
Number of clients
iget
10000
1000
Time in seconds
Time in seconds
ireg
100
10
1000
100
10
1
1
0
50
100
150
0
200
100
Number of clients
Number of clients
10 clients
27/05/2008
50
Building data grids with iRODS
100 clients
200 clients
16
Compare with iRODS and SRB

For the scaling test

Data



Measurement



Using the same directory at client machine
1000 files of 1000 bytes each
Ingesting directory and listing the collection
Performance measured for every directory operation
For the nested collection test

Data



Measurement



27/05/2008
Registering same directory at client machine
100 files of 100 bytes each
Ingesting directory and listing the collection
Making nested collection every 10 collection
Performance measured for every directory operation
Building data grids with iRODS
17
System used

iRODS system at KEK

ICAT enabled iRODS server on RHEL3




iRODS resource and client on RHEL3



Dual Intel Xeon 2.8GHz
PostgreSQL 8.2.5 is running on the same machine
iRODS 1.0
Dual Intel Xeon 2.8GHz
Linux file resource from local disk system
SRB system at KEK

MCAT enabled SRB server on RHEL3




SRB resource and client on RHEL3


27/05/2008
Dual Intel Xeon 2.8GHz
PostgreSQL 8.2.5 is running on the same machine
SRB 3.5.0
Dual Intel Xeon 2.8GHz
Linux file resource from local disk system
Building data grids with iRODS
18
Scaling test - iRODS and SRB
Sput
iput
time in seconds
tme in seconds
50
40
30
20
10
0
0
200
400
600
800
number of files [x 1,000]
3000
2500
2000
1500
1000
500
0
0
1000
10
ils
50
Sls
0.5
1
time in seconds
time in seconds
20
30
40
number of files [x 1,000]
0.4
0.3
0.2
0.1
0
0.8
0.6
0.4
0.2
0
0
27/05/2008
200
400
600
800
number of files [x 1,000]
1000
0
Building data grids with iRODS
10
20
30
number of files [x 1,000]
40
50
19
nested collection –iRODS and SRB
Sregister
70
60
50
40
30
20
10
0
nest
flat
0
50
100
150
number of files [x 1,000]
time in seconds
time in seconds
ireg
30
25
20
15
10
5
0
nest
flat
0
200
ils
nest
flat
2
1
0
100
150
200
number of files [x 1,000]
coll-1220: USER_PATH_EXCEEDS_MAX
27/05/2008
time in seconds
time in seconds
3
50
50
Sls
4
0
10
20
30
40
number of files [x 1,000]
0.6
0.5
0.4
0.3
0.2
0.1
0
nest
flat
0
10
20
30
40
number of files [x 1,000]
50
nest-47: Error Problem running command
Building data grids with iRODS
20
SRB system for Belle at KEK
LCG user
Melbourne
LCG user
KU
CYFRONET
APAN
GEANT2
KEK-DMZ
GridFTP
ASGC
NCU
SINET
KEK FW
SRB
KEK-LAN
KEK-2
Nagoya
KEK-1
SRB-DSI
dedicated inside
Pluggable
Extension


Both protocols are authorized by
GSI
Setting up on GridFTP server



NFS
MCAT
3.5PB
HSM
SRB client
grid-mapfile for SRB user
SRB configuration file for
server and resource
Register LCG user DN on MCAT
27/05/2008
Belle Net
SRB server
Enhanced
GridFTP service
LSF
Computing Farm
Building data grids with iRODS
Still not integration with Grid
21
Belle analysis user
Plan for iRODS system at KEK

Data transfer for J-PARC project

Generate huge amount of imaging data
at Tokai


Store the data at Tokai storage once,
then copy to KEK and distribute for
collaborators


27/05/2008
About 1PB data in a year in total
Storage at Tokai are recycled
Tokai
KEK
60km
Bandwidth between 2sites
will be 10Gbps
Building data grids with iRODS
22
Acknowledgements


Special thanks to Jean-Yves Nief from CCIN2P3 for his help to setup iRODS system
and supporting at CC-IN2P3
Thanks to Adil Hasan from RAL for his help
to these tests
27/05/2008
Building data grids with iRODS
23
Back up
27/05/2008
Building data grids with iRODS
24
Bandwidth Lyon-KEK
Speed [MBytes/sec]
120
Lyon to KEK
KEK to Lyon
100
80
60
40
20
0
0
10
20
30
40
50
60
Time [min]

iperf with some options;



27/05/2008
-w 4M : TCP window size [Bytes]
-P 16 : the number of parallel threads
-i 5 : periodic bandwidth reports [sec]
Building data grids with iRODS
25
Summery of iRODS Performance


Transfer from KEK to CC-IN2P3 is not stable but
iput is better than simple transfer software (bbcp)
iRODS can manage 1 million files stability



In case of very nested collection, PostgreSQL ICAT takes
long time to register data
Better performance than SRB
Oracle ICAT can handle more than 300 clients at the
same time, but it takes time according to increase the
number of clients
27/05/2008
Building data grids with iRODS
26
Download