中国科学院新疆天文台 China-VO and Astroinformatics 2013 2013.11.13-17 雅安 澳洲工作介绍 2013.5.20-2013.11.10 报告人:张海龙 单 位:中国科学院新疆天文台 二零一三年十一月 中国科学院新疆天文台 Outline: ICRAR,MRO,MWA,IVEC NGAS GAVO Our DC 中国科学院新疆天文台 中国科学院新疆天文台 中国科学院新疆天文台 ICRAR The International Centre for Radio Astronomy Research is a collaborative centre based in Perth, Western Australia. Already achieving research excellence in astronomical science and engineering, ICRAR has grown steadily since its launch in September 2009. MRO The Murchison Radio-astronomy Observatory (MRO) is the home of the Australian SKA Pathfinder (ASKAP) and the Murchison Widefield Array (MWA). It will shortly also be the home of a significant part of the Square Kilometre Array (SKA) itself (South Africa will also host part of the SKA). It is located in Western Australia’s Mid West, approximately 300km north east of the coastal city of Geraldton, within the Shire of Murchison. This outback region is roughly the size of the Netherlands with a resident population of around 140 people and a population density of around one person for every 350 sq.km. The centre of the 70 km radius circle defining the MRO is at S26o42’15”, E 116o39’32”. The MRO and the surrounding landscape are protected through federal legislation from any activities that might interfere with the operation of radio telescopes within 150km of the core site. The establishment of this ‘Radio Quiet Zone’ (RQZ) combined with a very low and sparse local population means the MRO is the perfect place for constructing and operating the most sensitive radio telescopes the world has ever seen. 中国科学院新疆天文台 MWA The MWA consists of 2048 dual-polarization dipole antennas optimized for the 80-300 MHz frequency range, arranged as 128 "tiles", each a 4x4 array of dipoles. The array has no moving parts, and all telescope functions including pointing are performed by electronic manipulation of dipole signals, each of which contains information from nearly four steradians of sky centered on the zenith. Each tile performs an analog beamforming operation, narrowing the field of view to a fully steerable 25 degrees at 150 MHz. The majority of the tiles (112) are scattered across a roughly 1.5 km core region, forming an array with very high imaging quality, and a field of view of several hundred square degrees at a resolution of several arcminutes. The remaining 16 tiles are placed at locations outside the core, yielding baseline distances of about 3 km to allow higher angular resolution for solar burst measurements. 中国科学院新疆天文台 中国科学院新疆天文台 ASKAP ASKAP is designed to be a high speed survey instrument with high dynamic range. This requires an extremely wide field-of-view interferometer with Phased Array Feeds (PAFs). The PAF has a total of 188 elements, 94 in each of two polarizations. The specification for ASKAP is: A total collecting area of approximately 4,000 square metres, from 36 antennas each 12 metres in diameter System temperature less than 50 K Frequency range from 700 MHz to 1.8 GHz 300 MHz instantaneous bandwidth 36 independent beams, each of about 1 square degree, yield overlapping to a 30 square degree field-of-view at 1.4 GHz Maximum baseline of approximately 6 km Full cross-correlation of all antennas Possible remote array station capability located in NSW, approximately 3,000 km from the core site. 中国科学院新疆天文台 中国科学院新疆天文台 中国科学院新疆天文台 IVEC iVEC is an unincorporated joint venture between CSIRO, Curtin University, Edith Cowan University, Murdoch University and the University of Western Australia and is supported by the Western Australian Government. Fornax The SGI supercomputer comprises 96 nodes, each containing two 6-core Intel Xeon X5650 CPUs, an NVIDIA Tesla C2075 GPU and 72 GB RAM, resulting in a system containing 1152 cores and 96 GPUs. Storage Data Stores @ iVEC also includes the Petabyte Data Store (PB Store) which is a scalable storage solution based on Sun’s SAM-QFS (Storage and Archive Manager File System & Quick File System) software, and consists of a 70TB of disk cache and a second 100TB file system. The SAM-QFS software automates the migration and retrieval of files across multiple tiers of storage from the online disk cache, to near-line tape storage, to offline tape archival. This system allows researchers to see their complete directory hierarchy and have seamless access to their files within a single file system. Each file that is placed on the system has a copy written to two separate tapes. LTO4 and StorageTek SL8500 中国科学院新疆天文台 中国科学院新疆天文台 NGAS (Next Generation Archiving System) ESO’s answer to the data explosion challenge. It is an integrated system consisting of hardware and software. Andreas wicenec andreas.wicenec@uwa.edu.au Research Winthrop Professor UWA data intensive research expert. Specialisation: Data flow and storage design and HPC integration 中国科学院新疆天文台 NGAS, NG/AMS, NGAST???? • • • • Next Generation Archive System: ESO’s answer to the data explosion challenge. It is an integrated system consisting of hardware and software. NG/AMS: Next Generation Archive Management System; this is the NGAS software. NGAST: The name of the ESO project leading to the NGAS. Started in 2000. First prototype system installed on La Silla in 2001. Now there are >100 NGAS servers in the ESO Archive. Never call something NG? True, but the intention is to keep this at the forefront of technology. 14 中国科学院新疆天文台 Scope of NGAS Secure, scalable, distributed and affordable file store. Secure: Do not lose files. Scalable: Be prepared for peta-bytes of data. Distributed: Operate transparently across physically and logically separated locations. Affordable: TCO at least comparable to DVDs in juke-boxes. Lowest component of the archive. 15 中国科学院新疆天文台 NGAS concepts Distributed storage, but not global file system. Almost complete separation of storage and business logic: NGAS stores many files on many media, which may be distributed and moved around the globe. Knowledge and functionality to extract the contents, do groupings and associations is somewhere else. NGAS deals only with file names, IDs, sizes, locations, media IDs, capacity…. And that information is stored in a data base. 16 中国科学院新疆天文台 NGAS concepts No user knowledge required about the physical location of a given file. In an operational environment there is one central access point (master), which passes requests to the correct NGAS server using HTTP proxy or HTTP redirect. Any server can be used as a master. Ensure that there are always at least two copies on two different media of any given file in the system. » Special commands for file and media removal. Active health check of media and files. Active consistency checks of media contents vs. DB. NG/AMS uses modular plug-in architecture. » Very flexible and easy adjustments to hardware and environment changes. 17 中国科学院新疆天文台 Advanced NGAS features • Subscription service: –One NGAS server can subscribe to another. Data flow NGAS Archiving Subscription Parallel cluster mirroring. • Disk and file cloning. • Processing plug-in. • Power saving mode: NGAS • NGAS NGAS NGAS –Automatic sleep and wakeup system built in. 18 中国科学院新疆天文台 NG/AMS Interfaces All user I/F and APIs are based on HTTP. Management information is kept in a DB. Interfaces for five DBs. Interface to syslog E-mail notification services. Message and config-file format is XML. 19 中国科学院新疆天文台 NGAS as an Embedded System NGAS is a tightly integrated system in order to be ready for operations: Minimal boot time Automated Hardware monitoring Integration Media boot and shutdown procedures of raid controller verification using access to on-disk BIOS Cluster configuration VirtualEnv 20 中国科学院新疆天文台 Implementation 100% Python about 25,000 lines + 15,000 lines of in-line documentation advanced system integration tests two person team, plus some help from system admin people. additional code added for mirroring Advanced fully automatic installation procedure. 中国科学院新疆天文台 Data Intensive Research Pathfinder Hierarchical Storage Manager 中国科学院新疆天文台 What’s Unique? Transparent global distribution of archival data. Global mirroring service over slow networks Fully traceable disk based data transfer Very robust and flexible (plugins) Full control of code Used to store VLT, ALMA, MWA, VLA, eVLA and VLBI data. Supports processing inside archive 中国科学院新疆天文台 Where Does it Fit? 40T MRO MWA IVEC-DC Raw Visibilities 1Gb/s 80T Raw Visibilities 10Gb/s MRO MWA IVEC-DC 中国科学院新疆天文台 GAVO (The German Astrophysical Virtual Observatory (GAVO) Dr. Markus Demleitner Astronomisches Rechen-Institut in Heidelberg. Woring on the GAVO project, an effort to contribute to Virtual Observatory, e.g., through the operation of the GAVO data center and developing the DaCHS multiprotocol server suite. http://www.tfiu.de/homepage/ gavo@ari.uni-heidelberg.de 中国科学院新疆天文台 GAVO (The German Astrophysical Virtual Observatory (GAVO) GAVO is the German contribution to the IVOA (International Virtual Observatory Association), the international effort to create and expand the Virtual Observatory (VO). ) The goals of the Virtual Observatory are to allow or improve access to astronomical data of all kinds (astrometry, photometry, spectroscopy, time series,...), from everywhere, in well-defined protocols, let astronomers easily discover, access and use data relevant to them, ensure that data does not simply disappear, that it is properly described and can be accessed and understood in the future, provide software to help astronomers to use all this. 中国科学院新疆天文台 中国科学院新疆天文台 中国科学院新疆天文台 中国科学院新疆天文台 中国科学院新疆天文台 中国科学院新疆天文台 中国科学院新疆天文台 中国科学院新疆天文台 中国科学院新疆天文台 中国科学院新疆天文台 中国科学院新疆天文台 Tier 0 Online processing Online archive 25M Correlator Nan Shan PFB Receiver 10Gb/s Data Capture NGAS Client 10Gb/s Tier 2 Tier 1 1G/10Gbps Mirrored Archive (All Data) Publish Data 1Gbps Dedicated Network Qi Tai Site Nan Shan Other Places MS Backup Raw Data NGAS Client Long-term archive R S Q A VO M S VO Based WEB Interface Catalog Data Query Web XAO Headquarters (Storage Arrays) R V local storage Computing resources Processed Data Scripting UI Image, Time sequence HPC XAO’s Future Data Archiving, Processing and Query System Zhanghailong 中国科学院新疆天文台 ICRAR NGAS MRO GAVO MWA OUR DC Question? 38