An Introduction to National High Performance Computing Environment Zhimin Tang, tang@ict.ac.cn Institute of Computing Technology Chinese Academy of Sciences Supported by State High-Tech Program http://www.grid.org.cn Outline • • • • • Electric Power Grid and Internet Computational Grid Information Grid Developing Information Grid in China Using China-build High Performance Computers and Gigabit Networks • Applications • Future Work The S Curve of Technology Impact to Economy and Society Internet Information Grid ? Internet Web Time Characteristics of Electric Power Grid Power Plant Power Plant Lighting Office Factory Electric Power Grid vs Internet Electric Power Grid Internet Electric Energy Power Plant Information High Performance Computers, Servers Computing Center Power Grid Data Network Power Generator Middleware Electric power scheduling Water, wind, nuclear, fire, Data Source Solar, etc. Network Terminals Electric appliances Electric Power Grid vs Internet Electric Power Grid Internet Ubiquitous Limited Existence Uniform Isolated Islands Automatic Load Balance No( or Limited) Limited (Need to Specify) Resource Sharing Generated Automatically Produced Manually Pure Resource Intelligent Lots of Grabage Less Intelligence Migrate form Internet to Information Grid • Supply computing power and organized information/knowledge to end-users – Just like what the power grid does – Users need not know where the computing power or electric power comes • Much work needed to fill the gap – Uniform Interface for Various Users – Uniform Representation for Resources – Automatic load balance and scheduling High Performance Computing Environment Node Node Node Node Expensive Equipment Database Gigabit IP Network MP3 Player Internet Digital Camera Telephone DVD Mobile Phone TV PC, Workstation Notebook HPC Game Player Current Status and Trends • United States – NSF:Computational Grid – DOE:ASCI Grid – NASA:Information Power Grid (IPG) – DOD:Global Information Grid (GIG) • Europe Grid Forum: eGrid – CERN LHC Grid • China:National HPC Environment – www.grid.org.cn Grid vs WEB Grid Data Storage Uniform Merged Data Types Data Content Propagation Single System Image Resources Web Scattered Heterogeneous Web Pages Discrete Sites Essential Problems (1) • Useful: Get large amount of resource • Ease of Use: Chinese Interface, – Everybody can use it • Single System Image: Location Independent – User Interface – Data, File System, Memory – Authentication, Attribute Checking – Management Essential Problems (2) • Wide-area Directory and Registration • Wide-area Caching (or Buffering) – cache, mirror, duplication, partition • Resource Management – Both wide-area and intra-node • Grid and Node Reconfiguration • QoS in Grid • Wide-area Operating System National HPC Environment Applications Grid Applications Programming Environment Grid Operating System Visualization Equipment Grid … Grid Node Node Gigabit IP Backbone Expensive Instrument and Equipment Database Information Library Grid System Software Grid Hardware Another View of Grid Meteoro logy Grid Biology Grid Logical Layer iii Petroleum Grid Grid System Software Grid Node iii Grid Node iii Physical Layer Gigabit IP Network Grid Node Information Grid High Performance Computing Centers Beijing Gigabit Backbone Gigabit Backbone Xi’an Nanjing Chengdu Wuhan Changsha Hefei Shanghai High Performance Computers • Dawning 2000/3000 SuperServer (Cluster) – Developed by National Research Center for Intelligent Computing Systems (NCIC) – Equipped 7 grid nodes, with configurations ranging from 32-160 CPUs, 20-400 Gflops – CPU: PowerPC or Power II – O/S: IBM AIX – Database: DB2, Oracle – Language: C, C++, F77, MPI, PVM, ... High Performance Computers • Galaxy 3 (MPP) – 20 Gflops • Sunway 1 (MPP) – 460 Gflops • Tsinghua Tongfang PC-Cluster – 16 CPUs, 8 Gflops Gigabit Networks • National Cable TV Backbone – 1 Gbps channel for intra-Grid connection – Available in early next year • CERNET (China Education Research) – Currently 155Mbps, shared media – Upgrade to 622Mbps, 2.5Gbps soon • Other High Bandwidth Networks – e.g., NSFCnet in Beijing area – When available Grid System Software Users and Applications Learning Security User Mng. Resource Batch Job Directory Management Online Help Common Interface Layer Interface1 SunWay 1 Interface2 Galaxy 3 Interface3 Tongfang108 Interface4 Dawning 2000 User Management Job Management (submit) No. of CPUs Working Directory Program Name STDIN STDOUT Execution Time Host Type Host Name Resource Management (Find a User) Search by •Host Name •User Name •User Type •User Account •E-mail •Phone Grid System Monitor Grid Utilities Simulate Command Line Interface Grid Applications • • • • • • • • Domestic Numerical Weather Forecast Petroleum Reservoir Simulation Numerical Wind Tunnel Simulation Automobile Collision Simulation Structural Design of Ships Bio-Informatics Database and App. National Scientific Database and App. Digital Library Bio-Informatics Database • Incorporation with Peking University • Provide gene and protein database for 100,000 scientists and researchers in biology and related areas – By the end of this year, there will be more than 500GB, including mirrors of international biodatabase and China’s own bio-information • Develop applications based on bioinformatics database Numerical Weather Forecast • Collective Forecast – Run 12 different forecasting Models on the same data, and summarize to get a more convinced result • Local Area Weather Forecast – Give 36 hours’ weather forecast, focusing on a city or small area about 15-45KM • Provide remote weather forecast service – to provinces or cities without HPC engine – become a productive application Petroleum Reservoir Simulation • Simulate complex fluid (petroleum and natural gas) flows inside the oil field – large system of sparse linear equations • Fine simulation of a large oil field involves at least 1 million mesh nodes • Understand current status of the oil/gas field, and select a most cost effective exploitation plan – Incorporation with Da Qing Oil Field Numerical Wind Tunnel • Solving large fluid dynamics equations – Millions of mesh nodes • Simulate aerodynamic process of space vehicles, aircraft and high speed trains • Calculate aerodynamic force and heat • To improve outline design, reduce resistance, and avoid surface burn Automobile and Ship Design • Design of Complex Structures • Simulation of Extreme Situations – Automobile Collision Experiment – Combustion Process within an engine – Design of various components with different and complex shapes • Incorporation with Industry Partners in Shanghai Scientific Database • More than 10 large databases – Totally 1.5 Tera Bytes of Data • • • • • Environment and Resource Database Chemistry Toxicity Database Animal and Plant Database Microorganism Database Science popularization database – www.kepu.com.cn Digital Library • Incorporation with Ministry of Culture – China’s national library and other libraries • Develop a demonstration system – with HPC as Servers, and – Grid as backbone • Essential Techniques – Character Recognition and Calibration – Automatic Abstracting from Text – Digital Copyright Protection Future Work • More Grid Nodes – 2 National Centers (1-10 Tflops) – 15 Local Centers (100-500 Gflops) • More Powerful Grid Software – Efficient Resource Management – Support Grid-level Heterogeneous Computing • More Elaborate Applications – Help Industry Improve Design Capability – Provide powerful Tools for Scientists