NASA Center for Computational Sciences iRODS Prototype Update NCCS Advanced Technology Team 16 March 2009 Change Log NASA Center for Computational Sciences Versi on Date Author Change 1.0 2 March 2009 Hoot Thompson 1.1 2 March 2009 Daniel Duffy Changed background; NCCS architecture tie in and concept of operations. 1.2 3 March 2009 Hoot Thompson General Clean-Up 1.3 10 March 2009 Hoot Thompson Added security related information 1.4 3/14/2016 NCCS Data Management. 2 Outline NASA Center for Computational Sciences • • • • • • • • • What is iRODS? iRODS Commands Rules and Micro-services NCCS Prototype Prototype Tests Web Browser and HDF5 Viewer NCCS Architecture and Data Management What Next Backup Slides – Additional iRODS Information – Performance Testing 3/14/2016 NCCS Data Management. 3 What Is iRODS NASA Center for Computational Sciences • Integrated Rule-Oriented Data System – Data grid software system developed by the Data Intensive Cyber Environments (DICE) group (developers of the SRB, the Storage Resource Broker), and collaborators. • Or it is everything and/or nothing 3/14/2016 NCCS Data Management. 4 Basic iRODS Components NASA Center for Computational Sciences iRODS Installation(s) Federation icommands Metadata admin(s) icat guis/apis Collection(s) resource(s) Metadata icat Collection(s) resource(s) user(s) 3/14/2016 NCCS Data Management. 5 icommands – Unix Like NASA Center for Computational Sciences • • • • • • • • • • • • • • • • • • • • • • • iinit Initialize - Store your password in a scrambled form for automatic use by other icommands. iput Store a file iget Get a file imkdir Like mkdir, make an iRODS collection (similar to a directory or Windows folder) ichmod Like chmod, allow (or later restrict) access to your data objects by other users. icp Like cp or rcp, copy an iRODS data object irm Like rm, remove an iRODS data object ils Like ls, list iRODS data objects (files) and collections (directories) ipwd Like pwd, print the iRODS current working directory icd Like cd, change the iRODS current working directory irepl Replicate data objects. iexit Logout (use 'iexit full' to remove your scrambled password from the disk) ipasswd Change your irods password. ichksum Checksum one or more data-object or collection from iRODS space. imv Moves/renames an irods data-object or collection. iphymv Physically move files in iRODS to another storage resource. ireg Register a file or a directory of files and subdirectory into iRODS. irmtrash Remove one or more data-object or collection from a RODS trash bin. irsync Synchronize the data between a local copy and the copy stored in iRODS or between two iRODS copies. itrim Trim down the number of replica of a file in iRODS by deleting some replicas. iexecmd Remotely Execute (fork and exec) a command on the server. imcoll Manage (mount, unmount, synchronize and purge of cache) mounted iRODS collections and the associated cache. ibun Upload and download structured (e.g. tar) files. 3/14/2016 NCCS Data Management. 6 icommands - Metadata NASA Center for Computational Sciences • • • imeta Add, remove, list, or query user-defined Attribute-Value-Unit triplets metadata isysmeta Show or modify system metadata iquest Query (pose a question to) the ICAT, via a SQL-like interface 3/14/2016 NCCS Data Management. 7 icommands - Informational NASA Center for Computational Sciences • • • • • • • • ienv Show current iRODS environment ilsresc List resources iuserinfo List users imiscsvrinfo Get basic server information; test communication irule Submit a user defined rule to be executed by an irods server. iqstat Show pending iRODS rule executions. iqdel Removes delayed rules from the queue. iqmod Modifies delayed rules in the queue. 3/14/2016 NCCS Data Management. 8 Rules NASA Center for Computational Sciences • The Rule Engine is a critical and fundamental component of the iRODS system, and is involved in many iRODS operations. • The core set of rules are defined in the "core.irb" text file in the release. • The names that begin with "msi" in the rules are Micro-Service Interface routines. These are 'C' functions that the Rules call and that may then call other iRODS functions. • Rules format – actionDef | condition | workflow-chain |recovery-chain – Example: • 3/14/2016 acCreateUser||msiCreateUser##acCreateDefaultCollections##msiCommit|msiRollback##msiRollback##nop NCCS Data Management. 9 Micro-service NASA Center for Computational Sciences • Small, well-defined procedures/functions that perform a certain task. • Developed and made available by system programmers and application programmers and compiled into the iRODS server code. • Users and administrators can chain these microservices to implement a larger macro-level functionality (actions) that they want to use or provide for others. 3/14/2016 NCCS Data Management. 10 Adding a Micro-service NASA Center for Computational Sciences • Develop module – collection of specialize micro-services – Conform to directory structure – Write micro-services ‘C’ code (hdf5 example printout) • Enable module • Make module • Rebuild action tables 3/14/2016 NCCS Data Management. 11 msiDataObjRepl Micro-service Example NASA Center for Computational Sciences /** * \fn msiDataObjRepl * \module core * \author Mike Wan * \date 2007 * \brief replicate an existing data object * \param[in STR_MS_T or DataObjInp_MS_T] dataObjName: Path name of data object * \param[in STR_MS_T] rsrcName: optional * \param[out INT_MS_T] status: status of the operation * \DolVarDependence none * \DolVarModified none * \iCatAtrDependence none * \iCatAttrModified none * \sideeffect none * \return integer * \retval 0 on success * \bug no known bugs **/ 3/14/2016 NCCS Data Management. 12 iRODS Prototype NASA Center for Computational Sciences W220 E100 Extreme Networks Summit 400-48T VLAN (172.16.230.xxx) 172.16.230.12 client1nas client1nasResc 10.4.0.2 10.4.0.1 archivenas archivenasHSResc solgzip solrods solfast client2nccs 10.2.0.3 172.16.230.3 10.2.0.5 172.16.230.5 10.2.0.4 172.16.230.4 client1nccs client1nccsResc 172.16.230.2 10.2.0.2 archivenccsHSResc icatnccsResc archivenccs archivenccsResc 10.2.0.1 172.16.230.1 icatnccs WANem Network Emulator (rtt=90msec) 172.16.230.11 VLAN (10.4.0.xxx) VLAN (10.2.0.xxx) 3/14/2016 Simulated NAS /tms2 Simulated GSFC /tms1 nccsZone NCCS Data Management. 13 iput NASA Center for Computational Sciences iput data Client icat resource metadata Metadata /<filesystem> Data iput –R <resource> </path/filename> 3/14/2016 NCCS Data Management. 14 iput With Replicate NASA Center for Computational Sciences iput data Client icat Resource 1 metadata Metadata /<filesystem> Data Data 3/14/2016 data Resource 2 NCCS Data Management. Rule added to core.irb 15 ils Showing Multiple Copies NASA Center for Computational Sciences kirk@client1nccs:~> ils -L /archivenccsZone/home/kirk: kirk 0 client1nccsResc 0 2009-02-27.13:11 & file_1 /tms/home/kirk/file_1 kirk 1 archivenccsResc 0 2009-02-27.13:12 & file_1 /home/archivenccs/iRODS/Vault/home/kirk/file_1 kirk 0 client1nccsResc 0 2009-02-27.13:11 & file_2 /tms/home/kirk/file_2 kirk 1 archivenccsResc 0 2009-02-27.13:13 & file_2 /home/archivenccs/iRODS/Vault/home/kirk/file_2 kirk 0 archivenccsResc 0 2009-02-27.13:11 & file_3 /home/archivenccs/iRODS/Vault/home/kirk/file_3 kirk 1 client1nccsResc 0 2009-02-27.13:13 & file_3 /tms/home/kirk/file_3 kirk 0 archivenccsResc 0 2009-02-27.13:11 & file_4 /home/archivenccs/iRODS/Vault/home/kirk/file_4 kirk 1 client1nccsResc 0 2009-02-27.13:13 & file_4 /tms/home/kirk/file_4 3/14/2016 NCCS Data Management. 16 ireg NASA Center for Computational Sciences client icat resource metadata Metadata /<filesystem> Data ireg –R <resource> </path/filename> </irods/full/path> 3/14/2016 NCCS Data Management. 17 ireg With Replicate NASA Center for Computational Sciences client icat Resource 1 metadata Metadata /<filesystem> Data Data 3/14/2016 data Resource 2 NCCS Data Management. Rule added to core.irb 18 ireg With Replicate – Shared File System NASA Center for Computational Sciences Client icat Client N/Resource 1 metadata Metadata /<filesystem> Data Data 3/14/2016 data Resource 2 NCCS Data Management. 19 iget NASA Center for Computational Sciences iget data client icat resource metadata Metadata /<filesystem> Data iget –R <resource> </path/filename> 3/14/2016 NCCS Data Management. 20 iget Replication Number NASA Center for Computational Sciences client icat Resource 1 metadata Metadata /<filesystem> Data Resource 2 Data 3/14/2016 NCCS Data Management. 21 isysmeta NASA Center for Computational Sciences [hoot@leftknee src]$ isysmeta -l ls hdf5_test.h5 doing ls of /leftkneeZone/home/leftknee/hdf5_test.h5 data_name: hdf5_test.h5 data_id: 10012 coll_id: 10008 data_repl_num: 0 data_version: data_type_name: generic data_size: 1782027 resc_group_name: resc_name: leftkneeResc data_path : /home/hoot/irods/iRODS/Vault/home/leftknee/hdf5_test.h5 data_owner_name: leftknee data_owner_zone: leftkneeZone data_repl_status: 1 data_status: data_checksum : data_expiry_ts (expire time): : None data_map_id: 0 r_comment: create_ts: 01235592554: 2009-02-25.15:09:14 modify_ts: 01235592554: 2009-02-25.15:09:14 3/14/2016 NCCS Data Management. 22 imeta – Attribute Value Units NASA Center for Computational Sciences [hoot@leftknee src]$ imeta ls -d hdf5_test.h5 AVUs defined for dataObj hdf5_test.h5: None [hoot@leftknee src]$ imeta add -d hdf5_test.h5 length 10 meters [hoot@leftknee src]$ imeta ls -d hdf5_test.h5 AVUs defined for dataObj hdf5_test.h5: attribute: length value: 10 units: meters [hoot@leftknee src]$ imeta add -d hdf5_test.h5 weight 213 kilograms [hoot@leftknee src]$ imeta ls -d hdf5_test.h5 AVUs defined for dataObj hdf5_test.h5: attribute: length value: 10 units: meters ---attribute: weight value: 213 units: kilograms 3/14/2016 NCCS Data Management. 23 iRODS Web Browser NASA Center for Computational Sciences 3/14/2016 NCCS Data Management. 24 HDFview iRODS NASA Center for Computational Sciences 3/14/2016 NCCS Data Management. 25 iRODS Explorer For Windows NASA Center for Computational Sciences 3/14/2016 NCCS Data Management. 26 Other iRODS Access Methods NASA Center for Computational Sciences • FUSE – File system like interface – Tested – caching and performance concerns • PRODS – PHP client API – Does not depend on any external library – Talks to iRODS server directly via sockets with native iRODS XML protocol • Jargon – Pure java API for developing programs with a data grid interface – Currently handles file I/O for local and SRB/iRODS file systems, as well as querying and modify SRB/iRODS metadata – Easily extensible to other file systems. • WebDAV – Access from a iPhone 3/14/2016 NCCS Data Management. 27 Security NASA Center for Computational Sciences • Default is single authentication – user/password • Grid Security Infrastructure (GSI) option – Globus a prerequisite – Based on public key cryptography 3/14/2016 NCCS Data Management. 28 Passwords NASA Center for Computational Sciences • Challenge/response protocol using an MD5 hash confirms user has the correct password, – – – • Routines are derived from the RSA Data Security, Inc. MD5 Message-Digest Algorithm Password not sent on the network iRODS user passwords stored in the iCAT database in a scrambled form iinit stores the password on disk in a scrambled form – Avoids storing plain-text passwords in files – – • Warning: with the source code, passwords can be descramble the passwords Scrambling algorithm is iRODS-specific and is not high-grade encryption Database system (PostgreSQL) passwords used to control access to the iCAT database – 3/14/2016 Stored in a server configuration file (by the install script) also in a scrambled form NCCS Data Management. 29 Access Permissions - ichmod NASA Center for Computational Sciences • Default – file owner has full control (read, write or delete) • As owner, give access to other users or groups, either just read access, or read and write, or full ownership – If 'own' given to someone else, they can also give (and remove) access to others. • Remove access by changing the access to 'null'. • Multiple paths can be entered on the command line. • If the entered path is a collection, then the access permissions to that collection will be modified – Give write access to a user or group so they can store files into one of your collections. Access permissions on collections are not currently displayed via ils – As normally configured, all users can read all collections – Inherit/noinherit form sets or clears the inheritance attribute of one or more collections. When collections have this attribute set, new dataObjects and collections added to the collection inherit the access permisions (ACLs) of the collection. 'ils -A' displays ACLs and the inheritance status. 3/14/2016 NCCS Data Management. 30 Group ichmod Example NASA Center for Computational Sciences archivenccs@archivenccs:~/test> ils -A /archivenccsZone/home/hoot: ACL - hoot#archivenccsZone:own Inheritance - Disabled file1 ACL - hoot#archivenccsZone:own file2 ACL - hoot#archivenccsZone:own file3 ACL - hoot#archivenccsZone:own ichmod read blue file1 ichmod write red file2 ichmod own rodsadmin file3 archivenccs@archivenccs:~/test> ils -A /archivenccsZone/home/hoot: ACL - hoot#archivenccsZone:own Inheritance - Disabled file1 ACL - blue#archivenccsZone:read object hoot#archivenccsZone:own file2 ACL - hoot#archivenccsZone:own red#archivenccsZone:modify object file3 ACL - hoot#archivenccsZone:own rodsadmin#archivenccsZone:own 3/14/2016 NCCS Data Management. 31 Collection ichmod Example NASA Center for Computational Sciences ichmod own rodsadmin /archivenccsZone/home/hoot archivenccs@archivenccs:~/test> ils -A /archivenccsZone/home/hoot: ACL - george#archivenccsZone:own hoot#archivenccsZone:own rodsBoot#archivenccsZone:own Inheritance - Disabled file1 ACL - blue#archivenccsZone:read object hoot#archivenccsZone:own file2 ACL - hoot#archivenccsZone:own red#archivenccsZone:modify object file3 ACL - hoot#archivenccsZone:own rodsadmin#archivenccsZone:own 3/14/2016 NCCS Data Management. 32 Inheritance ichmod Example NASA Center for Computational Sciences ichmod inherit /archivenccsZone/home/hoot archivenccs@archivenccs:~/test> ils -A /archivenccsZone/home/hoot: ACL - george#archivenccsZone:own hoot#archivenccsZone:own rodsBoot#archivenccsZone:own Inheritance - Enabled file1 ACL - blue#archivenccsZone:read object hoot#archivenccsZone:own file2 ACL - hoot#archivenccsZone:own red#archivenccsZone:modify object file3 ACL - hoot#archivenccsZone:own rodsadmin#archivenccsZone:own 3/14/2016 NCCS Data Management. 33 NCCS Representative Architecture NASA Center for Computational Sciences Existing Planned for FY09 Future Plans NCCS LAN (1 GbE and 10 GbE) Data Portal Data Management Login Existing Discover Data Gateways Analysis 65 TF FY09 Upgrade Future Upgrades ~40 TF TBD Viz ARCHIVE GPFS I/O Nodes Disk ~300 TB Direct Connect GPFS Nodes GPFS I/O Nodes GPFS I/O Nodes GPFS Disk Subsystems ~ 1.3 PB Tape ~8 PB Management Servers License Servers GPFS Management PBS Servers Other Services Internal Services 3/14/2016 NCCS Data Management. 34 Representative Architecture NASA Center for Computational Sciences The analysis uses also require very fast read access to this data from the NCCS analysis platform. Analysis Service The modelers require very fast I/O when generating data on the NCCS computational systems. The generators of the data also want a easy method for sharing data. Compute Service FAST Data Portal FAST SLOW ARCHIVE SLOW GPFS Storage Cluster SLOW The generators of the data also want to store the files into the archive for long term stewardship and retrieval (if necessary). 3/14/2016 NCCS Data Management. 35 Competing Requirements NASA Center for Computational Sciences • Capacity and Throughput – IPCC, as an example, requires a large amount of data to be kept on disk. – The modelers generating the data also need a fast file system to write and subsequently read that data. – The analysis users need a fast file system from which to access the large amount of data. – All of this lends itself nicely to a global parallel file system (GPFS). • How do we include data management in this model? 3/14/2016 NCCS Data Management. 36 Data Management Concept of Operations Archive Access NASA Center for Computational Sciences iRODS iCAT iRODS Resource iRODS Clients ARCHIVE DISCOVER A BIT FASTER iput, iget SLOW (~10 MB/sec) NFS, cp, scp • NOT AS SLOW Bbftp Cons – No file system level interface (Is this a con?) – Cannot open a file from the archive (Again, con?) 3/14/2016 FAST GPFS Storage Cluster • Pros – Simple, parallel transfers – High throughput for large files (~100 MB/sec) – Metadata captured NCCS Data Management. 37 Data Management Concept of Operations Data Security and Access NASA Center for Computational Sciences • Assume we have a well defined set of data security and access levels (examples for pedagogical purposes only) – – – – Level 0: User only Level 1: User and Project Level 2: User, Project, and Service Level 3: Publicly Accessible • Users define their data security and access levels using the appropriate process • When data is put into iRODS by the user under a specific project, it is labeled with the appropriate access level • All NCCS iRDOS enabled services must then check the access level to see if the service can access the data • In addition, the user must grant access to the data to the service 3/14/2016 NCCS Data Management. 38 Data Management Concept of Operations for IPCC Data NASA Center for Computational Sciences Analysis users still have very fast (GPFS) file system access to the data. Analysis Service Step 1: Modelers generate Step 2: Modelers large amounts of data and store into GPFS (very fast). register the data sets into iRODS. Compute Service FAST iRODS iCAT Data Portal FAST SLOW ARCHIVE SLOW GPFS Storage Cluster IPCC data is presented to the data portal either by NFS or iRODS interface. Step 3: Automatic rules kick in to do the following: SLOW A: Automatically extract and publish metadata into a database. B. Make a copy of the file into the NCCS archive. 3/14/2016 NCCS Data Management. 39 Data Management Concept of Operations More Implementation Details NASA Center for Computational Sciences iRODS iCAT iRODS Resource ARCHIVE Services on the data portal would have interfaces into iRODS. Could even have a local iRODS resource for caching data. iRODS Clients iRODS Resource iRODS Clients Archive accessible via iRODS; still use DMF. iRODS Resource Nodes Dedicated nodes would be a combination of GFPS clients and iRODS resources. 3/14/2016 Data Portal NCCS Data Management. DISCOVER FAST GFPS Clients GPFS Storage Cluster 40 Pros and Cons NASA Center for Computational Sciences • Pros – Very easy for users; they can register whatever they want. – NCCS specific micro-services can be set up to automatically copy files to the archive – Maintains the fast access to the data for both modelers and analysis users – Multi-stream throughput seems to work very well. • Cons – No file system level access to iRODS (could be a pro) – No link between data in GPFS and iRODS • Data changed with iRODS or GPFS will not be reflected in the other • Required to resynchronize the data every so often • Data within iRODS not accessible via a file system interface. 3/14/2016 NCCS Data Management. 41 Data Portal Services & Architecture NASA Center for Computational Sciences Connectivity to the Goddard DISC and DISC SW. Interfaces to ESG and PCMDI for model data (IPCC AR5). NASA Other ESG PCMDI Data Portal Local Disk Local disk will allow for relatively small amount of data to be cached in the portal. 3/14/2016 NFS Sufficient compute capability for some amount of analysis. GPFS iRODS MC Reach back capability into the much larger disk environment within the NCCS GPFS and Archive. Users will not have to move or copy data in order to make it available to the portal services. NCCS Data Management. 42 Concerns NASA Center for Computational Sciences • Integration with ESG • Data base design, implementation and number • iRODS security model versus NASA/NCCS policies – Simple single authentication – GSI – Grid Security Infrastructure • Difficulty of developing module/micro-service – Try “get best copy” as an example • iput and iget bandwidth discrepancy with delay injected remains unresolved – Continuing to explore this in the prototype • Little to no services built on top of metadata – Expansive, detailed metadata will have to be scripted 3/14/2016 NCCS Data Management. 43 NASA Center for Computational Sciences Back-up Slides Installation NASA Center for Computational Sciences • Automated install script • Set of preinstall queries • Downloads and installs all components – postgres • Can use Oracle, etc. – unixodbc 3/14/2016 NCCS Data Management. 45 icommands - Administration NASA Center for Computational Sciences iadmin Administration commands: add/remove/modify users, resources, etc. Commands are: – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – 3/14/2016 lu [name[#Zone]] (list user info; details if name entered) lt [name] [subname] (list token info) lr [name] (list resource info) ls [name] (list directory: subdirs and files) lz [name] (list zone info) lg [name] (list group info (user member list)) lgd name (list group details) lrg [name] (list resource group info) lf DataId (list file details; DataId is the number (from ls)) mkuser Name[#Zone] Type [DN] (make user) moduser Name[#Zone] [ type | zone | DN | comment | info | password ] newValue rmuser Name[#Zone] (remove user, where userName: name[@department][#zone]) mkdir Name [username] (make directory(collection)) rmdir Name (remove directory) mkresc Name Type Class Host Path (make Resource) modresc Name [type, class, host, path, comment, info, freespace] Value (mod Resc) rmresc Name (remove resource) mkzone Name Type(remote) [Connection-info] [Comment] (make zone) modzone Name [ name | conn | comment ] newValue (modify zone) rmzone Name (remove zone) mkgroup Name (make group) rmgroup Name (remove group) atg groupName userName[#Zone] (add to group - add a user to a group) rfg groupName userName[#Zone] (remove from group - remove a user from a group) atrg resourceGroupName resourceName (add (resource) to resource group) rfrg resourceGroupName resourceName (remove (resource) from resource group) at tokenNamespace Name [Value1] [Value2] [Value3] (add token) rt tokenNamespace Name [Value1] (remove token) spass Password Key (print a scrambled form of a password for DB) dspass Password Key (descramble a password and print it) pv [date-time] [repeat-time(minutes)] (initiate a periodic rule to vacuum the DB) ctime Time (convert an iRODS time (integer) to local time; & other forms) help (or h) [command] (this help, or more details on a command) Also see 'irmtrash -M -u user' for the admin mode of removing trash. NCCS Data Management. 46 Example icommands NASA Center for Computational Sciences kirk@client1nccs:~> ienv NOTICE: Release Version = rods2.0.1, API Version = d NOTICE: irodsHost=archivenccs NOTICE: irodsPort=1247 NOTICE: irodsDefResource=archivenccsResc NOTICE: irodsHome=/archivenccsZone/home/kirk NOTICE: irodsCwd=/archivenccsZone/home/kirk NOTICE: irodsUserName=kirk NOTICE: irodsZone=archivenccsZone kirk@client1nccs:~> ils /archivenccsZone/home/kirk: blah foo 3/14/2016 kirk@client1nccs:~> ilsresc archivenccsResc client1nccsResc NCCS Data Management. 47 Performance Assessment Summary NASA Center for Computational Sciences • Local testing of 1Gigabit showed wire speeds for iputs and igets • Artificial distance testing of 1Gigabit (with two different delay simulators) yielded wire speed on iputs but significantly less on iget (~10 % of iputs) – Repeated dialogue with iRODS personnel but discrepancy remains unresolved • Actual distance testing with ARSC showed acceptable results giving 110 msec rtt and OC-3 pipe 3/14/2016 NCCS Data Management. 48 Example Rule – core.irb NASA Center for Computational Sciences # 6) acPostProcForFilePathReg - Rule for post processing the registration # of a physical file path (e.g. - ireg command). # # Currently, three post processing functions can be used individually or # in sequence by these rules. # msiExtractNaraMetadata - extract and register metadata from the just # upload NARA files. # msiSysReplDataObj(replResc, allFlag) - can be used to replicate a copy of # the file just uploaded or copied data object to the specified replResc # The allFlag is only meaningful if the replResc is a resource group. In # this case, setting allFlag to "all" means a copy will be made in all # the resources in the resource group. A "null" input means a single # will be made in one of the resource in the resource group # # msiSysChksumDataObj - checksum the just uploaded or copied data object. # acPostProcForPut||msiSysChksumDataObj##msiSysReplDataObj(demoResc8,all)|nop##nop # acPostProcForPut||msiSysReplDataObj(demoResc8,all)|nop # acPostProcForPut||msiSysChksumDataObj|nop # acPostProcForPut||delayExec(<A></A>,msiSysReplDataObj(demoResc8,all),nop)|nop # acPostProcForPut||msiSysReplDataObj(demoResc8,all)|nop #acPostProcForPut||msiSetDataTypeFromExt|nop acPostProcForPut||nop|nop acPostProcForCopy||nop|nop acPostProcForFilePathReg||nop|nop rulegen is a parser that takes rules written in a nicer language to the cryptic one needed by irule and core.irb. The input files for the rulgen is recommended to be *.r (.r extensions) and the output created by the rulegen is inthe form of *.ir (.ir extensions). The grammar for the langauge of the input files are given at the end of this note. 3/14/2016 NCCS Data Management. 49 Local 1 Gigabit – iputs NASA Center for Computational Sciences 120 110 100 64MB 128MB 90 MB/sec 256MB 512MB 1024MB 2048MB 80 4096MB 8192MB 70 60 50 1 2 4 6 8 10 12 14 16 Threads 3/14/2016 NCCS Data Management. 50 Local 1 Gigabit – igets NASA Center for Computational Sciences 120 110 100 MB/sec 64MB 128MB 90 256MB 512MB 1024MB 2048MB 80 4096MB 8192MB 70 60 50 1 2 4 6 8 10 12 14 16 Threads 3/14/2016 NCCS Data Management. 51 Local 10 Gigabit – iputs NASA Center for Computational Sciences 315 295 275 64MB 128MB 255 MB/sec 256MB 512MB 1024MB 2048MB 235 4096MB 8192MB 215 195 175 1 2 4 6 8 10 12 14 16 Threads 3/14/2016 NCCS Data Management. 52 Local 10 Gigabit – igets NASA Center for Computational Sciences 310 290 270 250 64MB MB/sec 128MB 256MB 512MB 230 1024MB 2048MB 4096MB 8192MB 210 190 170 150 1 2 4 6 8 10 12 14 16 Threads 3/14/2016 NCCS Data Management. 53 GSFC/to/ASRC – iputs NASA Center for Computational Sciences 12 10 MB/sec 8 64MB 128MB 256MB 512MB 1024MB 2048MB 4096MB 6 4 2 0 1 2 4 6 8 10 12 14 16 Threads 3/14/2016 NCCS Data Management. 54 ASRC/to/GSFC – igets NASA Center for Computational Sciences 12 10 MB/sec 8 64MB 128MB 256MB 512MB 1024MB 2048MB 4096MB 6 4 2 0 1 2 4 6 8 10 12 14 16 Threads 3/14/2016 NCCS Data Management. 55