An Introduction to National High Performance Computing Environment Zhimin Tang,

advertisement
An Introduction to
National High Performance
Computing Environment
Zhimin Tang, tang@ict.ac.cn
Institute of Computing Technology
Chinese Academy of Sciences
Supported by State High-Tech Program
http://www.grid.org.cn
Outline
•
•
•
•
•
Electric Power Grid and Internet
Computational Grid
Information Grid
Developing Information Grid in China
Using China-build High Performance
Computers and Gigabit Networks
• Applications
• Future Work
The S Curve of Technology
Impact to Economy
and Society
Internet
Information Grid
?
Internet
Web
Time
Characteristics of
Electric Power Grid
Power
Plant
Power
Plant
Lighting
Office
Factory
Electric Power Grid vs Internet
Electric Power Grid
Internet
Electric Energy
Power Plant
Information
High
Performance
Computers, Servers
Computing Center
Power Grid
Data Network
Power Generator
Middleware
Electric power scheduling
Water, wind, nuclear, fire,
Data Source
Solar, etc.
Network Terminals
Electric appliances
Electric Power Grid vs Internet
Electric Power Grid
Internet
Ubiquitous
Limited Existence
Uniform
Isolated Islands
Automatic Load Balance No( or Limited)
Limited (Need to Specify)
Resource Sharing
Generated Automatically Produced Manually
Pure Resource
Intelligent
Lots of Grabage
Less Intelligence
Migrate form Internet to
Information Grid
• Supply computing power and organized
information/knowledge to end-users
– Just like what the power grid does
– Users need not know where the computing
power or electric power comes
• Much work needed to fill the gap
– Uniform Interface for Various Users
– Uniform Representation for Resources
– Automatic load balance and scheduling
High Performance Computing Environment
Node
Node
Node
Node
Expensive
Equipment
Database
Gigabit IP Network
MP3 Player
Internet
Digital
Camera
Telephone
DVD
Mobile
Phone
TV
PC,
Workstation
Notebook
HPC
Game
Player
Current Status and Trends
• United States
– NSF:Computational Grid
– DOE:ASCI Grid
– NASA:Information Power Grid (IPG)
– DOD:Global Information Grid (GIG)
• Europe Grid Forum: eGrid
– CERN LHC Grid
• China:National HPC Environment
– www.grid.org.cn
Grid vs WEB
Grid
Data Storage Uniform
Merged
Data Types
Data
Content
Propagation
Single
System
Image
Resources
Web
Scattered
Heterogeneous
Web Pages
Discrete Sites
Essential Problems (1)
• Useful: Get large amount of resource
• Ease of Use: Chinese Interface,
– Everybody can use it
• Single System Image: Location
Independent
– User Interface
– Data, File System, Memory
– Authentication, Attribute Checking
– Management
Essential Problems (2)
• Wide-area Directory and Registration
• Wide-area Caching (or Buffering)
– cache, mirror, duplication, partition
• Resource Management
– Both wide-area and intra-node
• Grid and Node Reconfiguration
• QoS in Grid
• Wide-area Operating System
National HPC Environment
Applications
Grid Applications
Programming Environment
Grid Operating System
Visualization
Equipment
Grid … Grid
Node
Node
Gigabit IP
Backbone
Expensive
Instrument
and
Equipment
Database
Information
Library
Grid
System
Software
Grid
Hardware
Another View of Grid
Meteoro
logy
Grid
Biology
Grid
Logical
Layer
iii
Petroleum
Grid
Grid System Software
Grid
Node
iii
Grid
Node
iii
Physical
Layer
Gigabit IP Network
Grid
Node
Information
Grid
High Performance Computing Centers
Beijing
Gigabit Backbone
Gigabit Backbone
Xi’an
Nanjing
Chengdu
Wuhan
Changsha
Hefei
Shanghai
High Performance Computers
• Dawning 2000/3000 SuperServer (Cluster)
– Developed by National Research Center for
Intelligent Computing Systems (NCIC)
– Equipped 7 grid nodes, with configurations
ranging from 32-160 CPUs, 20-400 Gflops
– CPU: PowerPC or Power II
– O/S: IBM AIX
– Database: DB2, Oracle
– Language: C, C++, F77, MPI, PVM, ...
High Performance Computers
• Galaxy 3 (MPP)
– 20 Gflops
• Sunway 1 (MPP)
– 460 Gflops
• Tsinghua Tongfang PC-Cluster
– 16 CPUs, 8 Gflops
Gigabit Networks
• National Cable TV Backbone
– 1 Gbps channel for intra-Grid connection
– Available in early next year
• CERNET (China Education Research)
– Currently 155Mbps, shared media
– Upgrade to 622Mbps, 2.5Gbps soon
• Other High Bandwidth Networks
– e.g., NSFCnet in Beijing area
– When available
Grid System Software
Users and Applications
Learning
Security User Mng. Resource Batch Job
Directory Management Online Help
Common Interface Layer
Interface1
SunWay 1
Interface2
Galaxy 3
Interface3
Tongfang108
Interface4
Dawning 2000
User Management
Job Management (submit)
No. of CPUs
Working Directory
Program Name
STDIN
STDOUT
Execution Time
Host Type
Host Name
Resource Management
(Find a User)
Search by
•Host Name
•User Name
•User Type
•User Account
•E-mail
•Phone
Grid System Monitor
Grid Utilities
Simulate
Command
Line Interface
Grid Applications
•
•
•
•
•
•
•
•
Domestic Numerical Weather Forecast
Petroleum Reservoir Simulation
Numerical Wind Tunnel Simulation
Automobile Collision Simulation
Structural Design of Ships
Bio-Informatics Database and App.
National Scientific Database and App.
Digital Library
Bio-Informatics Database
• Incorporation with Peking University
• Provide gene and protein database for
100,000 scientists and researchers in biology
and related areas
– By the end of this year, there will be more than
500GB, including mirrors of international biodatabase and China’s own bio-information
• Develop applications based on bioinformatics database
Numerical Weather Forecast
• Collective Forecast
– Run 12 different forecasting Models on the
same data, and summarize to get a more
convinced result
• Local Area Weather Forecast
– Give 36 hours’ weather forecast, focusing
on a city or small area about 15-45KM
• Provide remote weather forecast service
– to provinces or cities without HPC engine
– become a productive application
Petroleum Reservoir Simulation
• Simulate complex fluid (petroleum and
natural gas) flows inside the oil field
– large system of sparse linear equations
• Fine simulation of a large oil field
involves at least 1 million mesh nodes
• Understand current status of the oil/gas
field, and select a most cost effective
exploitation plan
– Incorporation with Da Qing Oil Field
Numerical Wind Tunnel
• Solving large fluid dynamics equations
– Millions of mesh nodes
• Simulate aerodynamic process of space
vehicles, aircraft and high speed trains
• Calculate aerodynamic force and heat
• To improve outline design, reduce
resistance, and avoid surface burn
Automobile and Ship Design
• Design of Complex Structures
• Simulation of Extreme Situations
– Automobile Collision Experiment
– Combustion Process within an engine
– Design of various components with
different and complex shapes
• Incorporation with Industry Partners in
Shanghai
Scientific Database
• More than 10 large databases
– Totally 1.5 Tera Bytes of Data
•
•
•
•
•
Environment and Resource Database
Chemistry Toxicity Database
Animal and Plant Database
Microorganism Database
Science popularization database
– www.kepu.com.cn
Digital Library
• Incorporation with Ministry of Culture
– China’s national library and other libraries
• Develop a demonstration system
– with HPC as Servers, and
– Grid as backbone
• Essential Techniques
– Character Recognition and Calibration
– Automatic Abstracting from Text
– Digital Copyright Protection
Future Work
• More Grid Nodes
– 2 National Centers (1-10 Tflops)
– 15 Local Centers (100-500 Gflops)
• More Powerful Grid Software
– Efficient Resource Management
– Support Grid-level Heterogeneous Computing
• More Elaborate Applications
– Help Industry Improve Design Capability
– Provide powerful Tools for Scientists
Download