High Performance Computing Environment (Grid) in China Qian Depei Xian Jiaotong University Email: depeiq@xjtu.edu.cn Outline Purpose of Building a Grid Grid vs Internet and Web Overview of China’s Grid Project Major issues in Developing a Grid Current Status of the Project Future Work Purpose of Building a Grid Needs for greater computing capability Science & Engineering computation High performance Information services Social and economical development Research & education Purpose of Building a Grid (con.) Needs for resource sharing Exchange and sharing of Data, information, and knowledge Sharing Expensive instruments Purpose of Building a Grid (con.) Needs for collaborated work Concurrent engineering Distributed simulation Purpose of Building a Grid (con.) Utilizing idle or underused computing power Uneven distributed computing resources Fluctuation of computer usage Grid: the next wave the first wave (1960-present) the second wave (1980-present) Internet Web Interconnection of computers telnet://… ftp://… mailto://… Interconnection of the third wave (1993- ) Grid Interconnection of Web pages resources http://www… grid://… Resoures: Computing Storage Information Knowledge Grid vs WEB Grid System Resources Sharing and Collaboration Information & Knowledge Aquisition Web Single Image Discrete Sites System Enabled Explicit User Efforts Certain Intelligence Mainly Manual Overview of China’s Grid Project supported by the National High-Tech R&D Program (the 863 program) from 1999 to 2000 about 20 sub-projects were supported four major tasks Develop a super server for major grid sites Establish several high performance computing centers across the country to form the grid infrastructure Develop a system software for the Grid system Develop demo applications on the Grid Major issues in developing a Grid Heterogeneity Scalability Dynamic adaptation Security Ease of use Key techniques Single System Image Uniform resource management Uniform User management Efficient task management and scheduling Security Authentication Authorization Encryption Secure node operation system Key techniques (con.) User-friendly interface GUI Web-based technology Chinese language support Achievements of the Project Developd the Dawning 3000 super server 280 processors/70 nodes, 160GB memory, 3TB disks Worm-hole and SAN interconnection network 400Gflops peak performance O/S: IBM AIX Database: DB2, Oracle Language: C, C++, F77, MPI, PVM, ... Achievements of the Project (con.) Established 9 high performance computing centers across the country Interconnected the centers with available networks (Cernet & CSTNet) Equipped the nodes with Dawning 2000/3000 Galaxy 3, 20Gflops Sunway 1, 460Gflops Tsinghua Tongfang PC-Cluster, 8Gflops High Performance Computing Centers Beijing Gigabit Backbone Gigabit Backbone Xi’an Nanjing Chengdu Wuhan Changsha Hefei Shanghai Achievements of the Project (con.) Developed a system software for the Grid A function layer above the node operating system Managing the resources, users, and tasks Authentication & Authorization Hardware enhanced encryption Web-based user interface technology Grid System Software Users and Applications Learning Security User Mng. Resource Batch Job Directory Management Online Help Common Interface Layer Interface1 SunWay 1 Interface2 Galaxy 3 Interface3 Tongfang108 Interface4 Dawning 2000 User Management Job Management (submit) No. of CPUs Working Directory Program Name STDIN STDOUT Execution Time Host Type Host Name Resource Management (Find a User) Search by •Host Name •User Name •User Type •User Account •E-mail •Phone Grid System Monitor Grid Utilities Simulate Command Line Interface Achievements of the Project (con.) Developed Several applications on the Grid Weather forecasting Petroleum Reservoir simulation Bio-information database and appl. Numerical wind tunnel simulation Automobile Collision Simulation Ship structure analysis National scientific databases add appl. Digital library Bio-Informatics Database Provide gene and protein database for 100,000 scientists and researchers in biology and related areas more than 500GB, including mirrors of international bio-database and China’s own bio-information Develop applications based on bioinformatics database Numerical Weather Forecast Collective Forecast Local Area Weather Forecast Run 12 different forecasting Models on the same data, and summarize to get a more convinced result Give 36 hours’ weather forecast, focusing on a city or small area about 15-45KM Provide remote weather forecast service to provinces or cities without HPC engine become a productive application Petroleum Reservoir Simulation Simulate complex fluid (petroleum and natural gas) flows inside the oil field large system of sparse linear equations Fine simulation of a large oil field involves at least 1 million mesh nodes Understand current status of the oil/gas field, and select a most cost effective exploitation plan Incorporation with Da Qing Oil Field Numerical Wind Tunnel Solving large fluid dynamics equations Millions of mesh nodes Simulate aerodynamic process of space vehicles, aircraft and high speed trains Calculate aerodynamic force and heat To improve outline design, reduce resistance, and avoid surface burn Automobile and Ship Design Design of Complex Structures Simulation of Extreme Situations Automobile Collision Experiment Combustion Process within the engine Design of various components with different and complex shapes Incorporation with Industry Partners in Shanghai Scientific Database More than 10 large databases Totally 1.5 Tera Bytes of Data Environment and Resource Database Chemistry Toxicity Database Animal and Plant Database Microorganism Database Science popularization database www.kepu.com.cn Digital Library Incorporation with Ministry of Culture Develop a demonstration system China’s national library and other libraries with HPC as Servers, and Grid as platform Essential Techniques Character Recognition and Calibration Automatic Abstracting from Text Digital Copyright Protection Applications Grid Applications Programming Environment Grid Operating System Visualization Equipment Grid … Grid Node Node Gigabit IP Backbone Expensive Instrument and Equipment Database Information Library Grid System Software Grid Hardware Another View of Grid Meteoro logy Grid Biology Grid Logical Layer iii Petroleum Grid Grid System Software Grid Node iii Grid Node iii Physical Layer Gigabit IP Network Grid Node Information Grid On going research efforts on Grid Continuing support from the national 863 program Initiation of a key project in 863 program with a 100 mil. government fund To attract investment from application and industry 2 to 3 times greater than government fund A 4-year plan Major tasks of the project Developing a high performance computer Building the HPC environment (Grid) Developing a grid software Developing several application grids The High Performance Computer Grid-oriented architecture Efficient support for both scientific & engineering computation and data-intensive applications 4Tflops peak peformance, 2TB memory, 20TB disks Single system image High performance interconnection High I/O capacity Security User, resource, and job management Parallel program development support Building the Grid 1-2 super node: 1Tflops+ 4-6 ordinary node: 100Gflops+ 5-7 Tflops aggregate computing power 30TB disk storage capacity Based on the available network infrastructure (CSTNet, Cernet…) Criteria of node selection: application oriented Grid Software Development System software for the grid Uniform resource management Uniform job management Uniform user management Security Grid Software Development Grid program development environment Compilation Debugging Performance monitoring and analysis Grid user interface Grid browser (GTTP?) Grid portal Web-based technology Appl. Grid: Scientific research grid Scientific data bases (Data Grid) Bioinformatics (BioGrid) Bioinformation databases New drug design Scientific computing (Computing Grid) Appl. Grid: Environment and resource grid Nation-wide geological survey Mass data processing and sharing Collaborative work China Resource Grid Appl. Grid: Manufacturing grid Concurrent engineering Coordinated product design Product simulation Design data and document management CAD software sharing Appl. Grid: Service grid Information service Distributed web caching Content delivery network Transportation management Entertainment International Cooperation Joint research supported by both China and European country Projects funded by both party Common interests Resource sharing Computing power Database Software tools Application software International Cooperation Academic Exchange Conferences on special interests http://www.grid.org.cn Thank you!