4/30/2014 1 University of California, San Diego • Prism@UCSD – Science DMZ – PI: P. Papadopulos, co-PI: L. Smarr – 01/01/2013 to 12/31/2014 • CHERuB – 100G campus gateway – PI: M. Norman, co-PI: T. Hutton, V. Polichar – 01/01/2014 to 12/31/2015 UCSD and its environment General Atomics Stem Cell Institute Salk Institute SDSC CalIT2 Physics NCMIR Skaggs Medical School Scripps Institute of Oceanography 4/30/2014 In addition to the 3 main UCSD units: - General Campus - Medical School - Scripps I.O. there are many other research organizations on and around campus. Venter Institute 3 Connecting YOU on UCSD Campus with the World By Creating a Big Data Freeway System CHERuB NSF CC-NIE Has Awarded Prism@UCSD Optical Switch Phil Papadopoulos, SDSC, Calit2, PI Prism@UCSD: A Researcher Defined 10 and 40Gbit/s Campus Scale Data Carrier Project in Brief • high-bandwidth end-to-end optical connections • routed by next generation Arista switches (7504) • connects lab “data producers” with SDSC data-intensive computing & storage resources • 10 Terabit/s of aggregate bandwidth, has full bisection similar to inmachine room clusters, but is deployed at a campus scale • builds upon and upgrades the Quartzite "campus-scale network laboratory" NSF MRI (awarded 2006) • adds IPv6 and OpenFlow • existing optical fiber connection to the SDSC is being expanded to 120Gbps as a high-bandwidth bridge to cloud/parallel storage and NSF XSEDE resources PRISM Puts SDSC’s Big Data Gordon Supercomputer and Data Oasis Storage Into Your Lab 12 PRISM is Connecting CERN’s CMS Experiment To Our Physics Department 80 Gbps PRISM Connection Has Been Made UCSD is a Tier-2 LHC Data Center: CMS Flow into UCSD Physics Dept. Peaks at 2.4 Gbps Source: Frank Wuerthwein, Physics UCSD Planning for climate change in California substantial shifts on top of already high climate variability SIO Campus Climate Researchers Need to Download Results from Remote Supercomputer Simulations to Make Regional Climate Change Forecasts Dan Cayan USGS Water Resources Discipline Scripps Institution of Oceanography, UC San Diego much support from Mary Tyree, Mike Dettinger, Guido Franco and other colleagues Sponsors: California Energy Commission NOAA RISA program California DWR, DOE, NSF average average summer summer afternoon afternoon temperature temperature GFDL A2 1km downscaled to 1km Hugo Hidalgo Tapash Das Mike Dettinger 10 Ultra High Resolution Microscopy Images Created at the National Center for Microscopy Imaging NIH National Center for Microscopy & Imaging Research Integrated Infrastructure of Shared Resources Shared Infrastructure Scientific Instruments Local SOM Infrastructure End User FIONA Workstation Source: Steve Peltier, Mark Ellisman, NCMIR PRISM Links Calit2’s VROOM to NCMIR to Explore Confocal Light Microscope Images of Rat Brains Protein Data Bank (PDB) Needs Bandwidth to Connect Resources and Users • Archive of experimentally determined 3D structures of proteins, nucleic acids, complex assemblies • One of the largest scientific resources in life sciences Virus Hemoglobin Source: Phil Bourne and Andreas Prlić, PDB PDB Usage Is Growing Over Time • • • • More than 300,000 Unique Visitors per Month Up to 300 Concurrent Users ~10 Structures are Downloaded per Second 7/24/365 Increasingly Popular Web Services Traffic Source: Phil Bourne and Andreas Prlić, PDB 2010 FTP Traffic RCSB PDB PDBe PDBj 159 million entry downloads 34 million entry downloads 16 million entry downloads Source: Phil Bourne and Andreas Prlić, PDB PDB Plans to Establish Global Load Balancing • Why is it Important? – Enables PDB to Better Serve Its Users by Providing Increased Reliability and Quicker Results • How Will it be Done? – By More Evenly Allocating PDB Resources at Rutgers and UCSD – By Directing Users to the Closest Site • Need High Bandwidth Between Rutgers & UCSD Facilities Source: Phil Bourne and Andreas Prlić, PDB PRISM Will Link Computational Mass Spectrometry and Genome Sequencing Cores to the Big Data Freeway Source: proteomics.ucsd.edu ProteoSAFe: Compute-intensive discovery MS at the click of a button MassIVE: repository and identification platform for all MS data in the world http://cherub.ucsd.edu SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO CHERuB*: SDSC-ACT partner to bring 100Gbps connectivity to UCSD Production late 2014 UWisc Madison - OSG FNAL Tier-1 LHC UNL - OSG LBL - CMMAP NERSC POLARBEAR, CAIDA NICS CMMAP UCSB UCR UCSD/SDSC New 100G path Austin/TACC Pink line – New CENIC 100G Blue lines – Existing/planned ANI 100G Green lines – Existing PacWave 100G Maroon lines – XSEDE 10G network Thin lines – Other existing 10G or lower SAN DIEGO SUPERCOMPUTER CENTER the UNIVERSITY OF CALIFORNIA; SAN DIEGO *Configurable, High-speed,atExtensible Research Bandwidth The Plumbing (ask Tom Hutton) 818 W. 7th, Los Angeles, CA 10100 Hopkins Drive, La Jolla, CA SDSC NAP Equinix/L3/CENIC POP DWDM 100G transponders existing CENIC fiber up to 3 add'l 100G transponders can be attached DWDM 100G transponders Nx10G up to 3 add'l 100G transponders can be attached 100G Existing ESnet SD router UCSD/SDSC Gateway Juniper MX960 "MX0" New 2x100G/8x10G line card + optics New 40G line card + optics SDSC Juniper MX960 "Medusa" PacWave, CENIC, Internet2, NLR, ESnet, StarLight, XSEDE & other R&E networks 100G Dual Arista 7508 "Oasis" 256x10G New 100G card/ optics 2x40G UCSD DYNES 4x10G add'l 10G card/optics Other SDSC resources mult. 40G connections UCSD Primary Node Cisco 6509 "Node B" DataOasis/ 128x10G Pink/black SDSC Cloud existing UCSD infrastructure GORDON SAN DIEGO SUPERCOMPUTER CENTER compute Green/dashed lines cluster SDSC new component/ DYNES equipment in proposal mult. 10G connections UCSD Production users PRISM@UCSD Arista 7504 mult. 40G+ connections Key: NEW 10G UCSD/SDSC Cisco 6509 100G to CENIC/ PacWave switch L2 UCSD 10G PRISM@UCSD - many UCSD big data users at the UNIVERSITY OF CALIFORNIA; SAN DIEGO CENIC/ESnet 100G Connection enables Big Data science collaborations between NERSC and SDSC UWisc Madison - OSG FNAL Tier-1 LHC UNL - OSG LBL - CMMAP NERSC POLARBEAR, CAIDA NICS CMMAP UCSB UCR UCSD/SDSC New 100G path Austin/TACC Pink line – New CENIC 100G Blue lines – Existing/planned ANI 100G Green lines – Existing PacWave 100G Maroon lines – XSEDE 10G network Thin lines – Other existing 10G or lower SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO A Unique, Powerful, Data-Intensive Testbed for Scientific Discovery EDISON HPC SYSTEM 2 PF, 434 TB RAM GORDON HPD SYSTEM 0.3 PF, 364 TB RAM+SSD 150 GB/s 100 GB/s ESnet/CENIC 6 PB DTN DTN 4.5 PB 100 Gb/s SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO POLARBEAR Cosmology Telescope UC Berkeley/NERSC-UCSD/SDSC • Goal: Measure B-mode polarization in the CMB from inflation era • Data path: Chile (obs)UCB/NERSC (analysis)UCSD/SDSC (analysis) • Data acquisition rates: • 22 GB/mo. (current) • 3 TB/mo. (2014-2016) • Map making data analysis NERSC & SDSC • 100 MC realizations of 100 TB data = 10 PB Atacama Desert, Chile SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO Next Generation Network Measurement CAIDA (SDSC)-NERSC • CAIDA operates the UCSD Network Telescope, which collects Internet Background Radiation • Data paths: global internet, ESnet • Data rates: 3-4 TB/mo • Using NERSC tape archive to replicate 100 TB historical data • Other projects: network measurement tools, Future Internet Architecture unassigned IPv4 addresses 100’s TB archival data SDSC/NERSC SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO High Energy Physics LHC/US-CMS UCSD Tier-2—US-CMS collaboration • Goals: Higgs boson, supersymmetry, BSM • Data Paths: CERNFNAL (Tier 1)-UCSD (Tier 2) via ESnet and CENIC/I2 • Peak Bandwidths: • Current: 10+5 Gbps • 2015: 40 Gbps when LHC operates @ 14 Tev SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO Education & Training: UCSD Telemedicine Center SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO CHERuB Implementation Status • January, 2014 • Project funded, equipment on order • February, 2014 • Equipment received • Production network switch upgraded • March, 2014 • Campus gateway upgraded, connected to regional 100G feed • Successful border-to-regional test @100Gbps • Next steps (April/May): • Connect Prism switch, test @2x40Gbps • Connect SDSC infrastructure, test @100Gbps • Connect production switch, test @4x10Gbps • Production Goal: September 2014 SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO Comet is a ~2000TeraFLOP System Architected for the “Long Tail of Science” NSF Track 2 award to SDSC $12M NSF award to acquire $3M/yr x 4 yrs to operate Production early 2015