KCAF for CDF - The Center for High Energy Physics

advertisement
The International Workshop on HEP Data Grid
A Design for KCAF
for CDF Experiment
Kihyeon Cho
(CHEP, Kyungpook National University)
and
Jysoo Lee
(KISTI, Supercomputing Center)
November 8-9, 2002
Contents
 CHEP and Fermilab CDF
 CDF Data/Analysis Flow
 CDF Grid RoadMap (bottom –up method)
□ Step 1. CAF (Central Analysis Farm)
□ Step 2. DCAF (DeCentralized Analysis Farm)
□ Step 3. Grid
 KCAF (DeCenteralized Analysis Farm in Korea)
 Future Plan
CENTER FOR HIGH ENERGY PHYSICS
2
KIHYEON CHO
CHEP and Fermilab CDF Grid
Fermilab in USA
KCAF (DCAF in Korea)
Korea
CHEP
Network between CHEP and overseas
FNAL
CERN
TEIN
10Mbps
APII
45Mbps
Disc Storage
(NAS) 100GB
Gigabit Ethernet
KOREN
APII
2 X 1Gbps
Gigabit Ethernet
C6509
KEK
Gigabit
Ethernet
Tape Library
(0.6 TB)
DB Server
Gigabit Ethernet/Fast Ethernet
L3 Switch
(3군)
Gigabit Switch
(CHEP)
…
Server
s
CENTER FOR HIGH ENERGY PHYSICS
Servers
IBM 8271
Linux Clusters
Cluster Servers
…
PCs
4
KIHYEON CHO
Hub
Overview of Fermilab
CDF
Fixed Target
Experiment
Booster
p source
CENTER FOR HIGH ENERGY PHYSICS
Main Injector
and Recycler
5
D
D0
KIHYEON CHO
CDF Collaboration
North America
Europe
3 Natl. Labs
25 Universities
Asia
1 Research Lab
6 Universities
1 University
2 Universities
1 Research Lab
4 Universities
1 University
4 Universities

Totals
 11 countries
 52 institutions
 525 physicists
CENTER FOR HIGH ENERGY PHYSICS
2 Research Labs
KOREA
1 University
1 University
Center for High Energy Physics:
Kyungpook National University
Seoul National University
SungKyunKwan University
KIHYEON CHO
Production Farm
CENTER FOR HIGH ENERGY PHYSICS
7
KIHYEON CHO
CDF Computer Resource CAF Stage 1
FCC (Feynman Computer Center)
 Current configuration:
Stage 1
– 63 dual worker nodes.
– Roughly 160 GHz total,
compared to 38 GHz for
fcdfsgi2, and 8 GHz for
cdfsga (Run I).
– 7 fileservers for physics
(~50% full), 5 for DH, 2 for
development, 2 for user
scratch.
CENTER FOR HIGH ENERGY PHYSICS
8
KIHYEON CHO
Integer
CAF Design
 CAF design
considerations:
– Submit jobs from
'anywhere'
– Job output can be:
sent directly to
desktop
or stored on CAF for
later retrieval or
– input to subsequent
job
CENTER FOR HIGH ENERGY PHYSICS
9
KIHYEON CHO
DCAF/Grid at CDF Experiment
 Requirement in 2005
– 200 simultaneous users will analyze 107 events of secondary
data set in a day.
– Need ~700 TB of disk and ~5THz of CPU by end of FY’05
 Current CAF (Central Analysis Farm) is not enough:
– Limited resources and spaces at FCC
– In case of network problems at FCC, it is dangerous.
 We need DCAF (DeCentralized Analysis Farm) and Grid
 DCAF (DeCentralized Analysis Farm)
– Users around regional area and/or around the world
– Korea, Toronto, Karlsruhe, ….
– Let us call DCAF in Korea as “KCAF”
CENTER FOR HIGH ENERGY PHYSICS
10
KIHYEON CHO
DCAF for CDF Experiment
APII (Asia Pacific
DCAF in Karlsruhe
Information Infrastructure)
DCAF in Toronto
CAF in USA
KCAF (DeCentralized
Analysis Farm in Korea)
Center for HEP (CHEP)
Kyungpook National U.
Daegu, Korea
CENTER FOR HIGH ENERGY PHYSICS
11
KIHYEON CHO
Proposed Schemes for Final Goal
CAF or
DCAF in Korea (KCAF)
CENTER FOR HIGH ENERGY PHYSICS
12
KIHYEON CHO
The Road Map for Goal
 Step 1. To make MC production farm using KCAF
– First, we start to construct 12 CPU test bed for KCAF.
– After policy inside of CHEP (another test bed for EDG and
iVDGL), we will decide how many CPUs for actual MC
generation farm will be used among this year’s planed 140
CPUs.
 Step 2. To handle real data
– To extend the KCAF to the real data handling system using SAM
(Sequential data Access via Meta-data), Gridftp, etc after settling
down real data handling system.
 Step 3. Final goal of CDF Grid
– A gridification for KCAF related with EDG, iVDGL and CDF Grid
CENTER FOR HIGH ENERGY PHYSICS
13
KIHYEON CHO
The Road Map in Detail
1.
Software

•
2.
Kerborse
•
•
•
3.
Client : login from Korea to Fermilab
Server : login from Fermilab to Korea
KDC : Client & Server Trust
Data Handling System
•
•
4.
5.
CDF Software
FBSNG for batch job
Glasgow, Italy and Yale use Data Handling System.
SAM station needs to be installed on 1 PC work node.
OS for Fermilab Red Hat Linux 7.3 with Kernel 2.4.18
Fundamental license problem for DB
CENTER FOR HIGH ENERGY PHYSICS
14
KIHYEON CHO
A Design of KCAF for CDF (Fermilab) Grid
FCC (CAF)
STKen
fcdfsam
stager
dCache
Calibration
Data
CDFen
(Raw Data)
rcp
dCache
Technical
Request
1TB buffer (FCC)
rcp,
GridFTP
bbftp
CHEP
User Desktop
(KCAF)
KCAF Head Node
CAFGUI
KCAF Cluster
stager
rcp
ICAFGUI
cp
SAM Station (KCAF)
Smaster
FSS, Stager
Remote
Desktop
CENTER FOR HIGH ENERGY PHYSICS
ICAF FTP server
15
KIHYEON CHO
Future Plan (This year)
 Around 140 CPU linux clusters will be constructed at the end of this
year at CHEP.
 We are constructing KCAF for CDF at CHEP, Kyungpook National
University.
 Contribute 1 T byte hard disk (now) + 4 T byte hard disk
(December) to CDF for the network buffer between CHEP and
Fermilab.
 The DCAF demonstration will be shown at SC2002 on November
16, 2002.
CENTER FOR HIGH ENERGY PHYSICS
16
KIHYEON CHO
Download