Redbooks Paper Alex Osuna Infrastructure Simplification with IBM System Storage N Series Introduction This IBM Redpaper discusses how the IBM® System Storage™ N series storage system simplifies your storage infrastructure. The signs of complexity are everywhere in the modern data center (see Figure 1 on page 2). Firewalls, routers, servers, and multiple operating systems are just the beginning. Server environments, networks, and storage are typically silos of resources that are dedicated per application. They grow with only a loose connection to one another and usually with no regard for overall system complexity. Such environments can be difficult to understand, manage, secure, update, and operate. © Copyright IBM Corp. 2006. All rights reserved. ibm.com/redbooks 1 Figure 1 Storage complexity Infrastructure simplification is a method of looking at an entire environment from end to end and finding ways to eliminate the complexity that can inhibit the free flow of information that is necessary in the fast-paced business world. The storage environment offers numerous opportunities for simplification that can help unlock the value of information to your business. The ability to share and manage information simply and easily is a key requirement for integrating business processes and for On Demand Business. In this paper, we focus on the storage portion of the IT environment. The IBM N series storage system can help simplify a storage infrastructure and the management of that infrastructure. Using the broad features of the IBM N series storage system, a company can holistically address simplification issues, data, and storage and create a more cost-effective, flexible, and resilient environment. In many cases, storage products come from different vendors and represent different generations of technology. Often, this is a natural result of short-term needs outweighing long-term planning or of unplanned growth such as mergers 2 Infrastructure Simplification with IBM System Storage N Series and acquisitions. Network Appliance Snapshots, recovery and backup processes, volume and information management, and other management tasks often incorporate different software from different vendors. An organization that is hampered by such inconsistencies cannot expect to deploy resources where they are most needed or respond to customer needs in a timely fashion. What can an enterprise do? It can make the move towards infrastructure simplification. Infrastructure simplification helps enable business growth, helps reduce operational risks, and helps reduce costs by optimizing IT resources and improving productivity. The free flow of information throughout an organization is the main currency for business innovation and creativity. Throughout a company, the IT infrastructure is where all of this information flows. When the IT infrastructure grows without regard to easing information flow, complexity can choke that free flow just as arterial plaque chokes blood flowing to the heart. How can infrastructure simplification help a business achieve its goals? Simplification strips away inhibitors to business acceleration for faster product development and communications with customers and partners. Simplification also allows shared data to foster innovation. Most importantly, by providing a highly efficient, highly automated environment, infrastructure simplification can help reduce the total cost of ownership (TCO) for an enterprise. When people have instant access to the information that they need in the form that they need, they can provide better service to customers and find better ways to bring value to the marketplace. Three proven methods Storage today has evolved with a shift towards networked storage (see Figure 2 on page 4). Infrastructure Simplification with IBM System Storage N Series 3 Technology shifting towards the network Storage virtualization Virtualization Storage networking Networking Consolidation Heterogeneous Heterogeneous storage Storage Direct attach storage Shared storage Schedule-based Schedule based automation Automation Networked storage Figure 2 Storage evolution Using three proven methods, you can reduce the complexity of a networked storage environment. These methods can help organizations achieve better application availability, improved productivity, and better system optimization, including utilization. The methods are: Consolidation Virtualization Automated management Consolidation Consolidating the storage environment is the logical first step in any effort to simplify infrastructure. There are generally four approaches to consolidation: Centralization, or reducing the number of locations where data is stored to only those that are really needed Physical consolidation to fewer, larger-capacity devices (“scale up”) Data integration through such means as modular storage that easily “scales out” by adding more of the same units Application integration, or moving applications from clusters or multiple linked servers to a smaller number of storage devices 4 Infrastructure Simplification with IBM System Storage N Series The general goal of consolidation is to reduce the number of points of management and the number of physical devices. Consolidated storage environments have fewer elements, which can help reduce the chance of error or failure, avoid the creation of islands of storage, and take better advantage of economies of scale for owning software, disks, and tape. Consolidated environments can be more resilient and more cost effective to manage, reducing planned downtime and TCO. In consolidated environments, servers and storage can scale up or out, depending on the needs of an organization. You can scale up with large capacity tape and disk systems while also scaling out with modular storage products. In addition to scaling storage technology, scale-up server technology can assist your business with the handling of surges in application usage and transaction rates. Scale-out server technology can help you handle hundreds, or even thousands, of application instances. Virtualization Effective virtualization is an essential part of a simplified storage environment. It involves a shift in thinking from physical to logical, treating IT resources as a single, logical pool rather than as separate physical entities. Sieb As business processes change, they can easily be linked to the proper storage for the business need. FCP, iSCSI, NFS,CIFS, HTTP N series SATA Fibre Array Figure 3 Storage virtualization Virtualization is a natural part of the progression of IT systems. Other networks, such as e-mail and the Internet, have long since progressed past the need to keep track of physical resources in order to perform. The same shift is now necessary for storage networks. Just as Web site addresses mask numeric IP Infrastructure Simplification with IBM System Storage N Series 5 addresses and e-mail addresses mask a sequence of router hops leading to a final e-mail server, virtualization masks complexity. Virtualization can help an enterprise: Increase resource utilization by combining the storage capacity of multiple disk arrays into a single reservoir of storage. Improve personnel productivity by enabling administrators to manage their reservoir of storage from a single user interface at a central point. Enable a tiered storage environment in which the cost of the storage can be matched to the value of the data. Aid with the consolidation of resources and simplification of management to help reduce cost and complexity, allowing a business to focus on innovation and growth. Use grid computing to help broaden the access to resources such as data, applications, and processing capacity. Offer the freedom to use heterogeneous hardware providers with common server management, storage management, and copy services software. Virtualization in storage is a key component of creating a multi-tier storage environment that is managed separately from the application and server layers. By insulating the file, volume layers, or both from applications, virtualization helps make it easier to enact data life cycle management, the enabling infrastructure for information life cycle management. Automated management The third proven simplification method is automated management. Automated management can help you understand, manage, and make necessary changes to an IT environment when you have been previously authorized to do so by an IT administrator. This can help produce increased reliability and productivity for a business. Automation tools can aid with backups. For example, you can use IBM Tivoli® Storage Manager to schedule and manage backups. Automated management can also automate tasks such as Snapshots for applications (see Figure 4 on page 7). 6 Infrastructure Simplification with IBM System Storage N Series Figure 4 Automating Snapshots The important applications of automated management are: Driving resource provisioning and workload management with policies that are based on business goals. This helps you attain better alignment of your IT infrastructure with your business. Scheduled free space provisioning when someone in your enterprise asks for additional storage space. You can automate each step in the process. Tasks that previously took as long as 10 days might now take less time. Enterprise-wide reporting and monitoring through the automated tracking of capacity, utilization, performance, file and device health, and integration with systems management software. Emergency management when your device runs out of space or an application suddenly fails in the middle of the night. With automated management and virtualized storage pools, instead of trying to access the storage from different areas and connecting it, you can remotely draw from one big virtual pool of free space. With integrated management products, you can automate complex tasks so that you have the flexibility to design new tasks that take advantage of automation. Infrastructure Simplification with IBM System Storage N Series 7 How IBM N series can help an enterprise It is critically important not to isolate storage from a server and network environment or treat it as a collection of separate, unrelated machines. Instead, simplifying your storage infrastructure can help create a more cost-effective, flexible, and resilient environment. In this environment, resources and information flow freely to where they are most needed, and they can be retained or removed based on business priorities. The IBM N series has an impressive portfolio of features that includes: Support of multiple operating systems Multiple protocols Different storage options for different performance characteristics Virtualization Automated management software Attachment to heterogeneous storage The IBM N series storage system can holistically address your infrastructure simplification issues. Consolidation with IBM N series The IBM N series storage system can help consolidate storage with virtual filers, FlexVol, and multi protocol support. It can also help you share and utilize data better and can dramatically reduce the number of management points for storage and servers in a scale-out strategy. With the multiprotocol support of an IBM N series storage system, there is no need to build storage silos that are based on access method or protocol. IBM N series can serve both storage area network (SAN) and local area network (LAN) protocols simultaneously (see Figure 5 on page 9). 8 Infrastructure Simplification with IBM System Storage N Series SAN Enterprise LAN Departmental Enterprise Departmental iSCSI Dedicated Ethernet Fibre Channel SAN (Block) Corporate LAN Nettwork Attached Storage (File) IBM N series 18 © 2005 IBM Corporation Figure 5 Multi protocol access Also, businesses tend to have direct attached storage (Figure 6). Siebel Figure 6 Direct attached storage Infrastructure Simplification with IBM System Storage N Series 9 This is especially true for small to medium-sized businesses. With IBM N series, you can consolidate these islands of storage, increase idolization, and decrease management (Figure 7). Implementations of IBM N series do not require installing a SAN. Instead, you can utilize your existing IP network. Siebel FCP, iSCSI, NFS,CIFS, HTTP N series 64 IBM TotalStorage® | The power to break through © 2004 IBM Corporation Figure 7 Consolidating storage In addition to being able to consolidate storage to one IBM N series storage system, you can consolidate multivendor storage with the IBM System Storage N series Gateway (see Figure 8 on page 11). 10 Infrastructure Simplification with IBM System Storage N Series g Siebel FCP, iSCSI, NFS,CIFS, HTTP N Series IBM HP Hitachi Figure 8 Multivendor storage Network Appliance MultiStore technology provides an efficient architecture for consolidating multiple physical storage systems into a smaller number of systems. From the user’s perspective, each virtual storage controller appears as a separate physical storage system with a unique IP address. For example, storage for multiple business units can be consolidated into a smaller number of systems, increasing the return on investment and also easing management challenges. Application service providers can also consolidate the storage needs of their customers. The IBM N series hardware is made up of CPUs, network cards, Fibre Channel controllers, power supplies, disk drives, and more. With MultiStore, IBM N series storage and networking resources can be effectively partitioned and dynamically assigned to virtual IBM N series storage systems. This virtualizes the physical resources and moves beyond the logical architectural limitations that are inherent in a single physical IBM N series. As many as 32 virtual IBM N series storage systems (in addition to the hosting IBM N series) can be created and hosted in an IBM N series storage system, each serving data as though it were a physical IBM N series storage system. Other network computers communicate with virtual IBM N series storage systems as they would with any other file server (see Figure 9 on page 12). Infrastructure Simplification with IBM System Storage N Series 11 Figure 9 Vfiler Network Appliance SnapVault uses a client/server model to provide multiple storage appliances with online backup and restore services. A centralized SnapVault server (see Figure 10 on page 13) receives backup data from multiple SnapVault clients. SnapVault controls the entire online backup process. Snapshots are created at designated times on SnapVault clients, transferred to the SnapVault server, and merged into the destination file system. 12 Infrastructure Simplification with IBM System Storage N Series N series Storage N series Storage Secondary N series Figure 10 Backup consolidation with SnapVault With Tivoli Storage Manager, you can also consolidate backups of multiple IBM N series filers to a single Tivoli Storage Manager server and tape library (see Figure 11 on page 14). Infrastructure Simplification with IBM System Storage N Series 13 Figure 11 IBM N series backup Consolidation Virtualization with IBM N series IBM N series storage systems can assist you with the transition from physical file and disk management to a more flexible virtualized environment. There are multiple levels of virtualization layers in the IBM N series (see Figure 12 on page 15). In this section, we discuss several of them. 14 Infrastructure Simplification with IBM System Storage N Series Other Storage HP Storage 19 © 2005 IBM Corporation Figure 12 IBM N series virtualization layers One of the first levels of virtualization with the IBM N series is at the volume level. Flexible volumes (Figure 13) are a ground breaking technology. Flexible Volumes Disks Disks Disks Pooled physical storage (aggregates) Figure 13 Flexible volumes Infrastructure Simplification with IBM System Storage N Series 15 These volumes are logical data containers that are managed separately from their underlying physical storage. They can be sized, resized, managed, and moved independently and non-disruptively. Flexible volumes are file systems that hold user data that is accessible by one or more of the access protocols supported by Network Appliance Data ONTAP, including Network File System (NFS), Common Internet File System (CIFS), HTTP, FTP, Fibre Channel Protocol (FCP), and Internet Small Computer System Interface (iSCSI). Because each flexible volume is a separate file system, you can create one or more Snapshots of the data in a volume so that multiple, space-efficient, point-in-time images of the data can be maintained for purposes such as backup and error recovery. The next level of virtualization is at the aggregate level. As shown in Figure 14, an aggregate is defined as a pool of many disks, from which space is allocated to volumes (shown as IBM FlexVol and FlexClone entities in Figure 14). From the administrator’s point of view, volumes remain the primary unit of data management. But what is not apparent to the administrator is the fact that flexible volumes refer to logical entities, not (directly) to physical storage. FlexVol dynamic virtualization: Designed to offer dramatic improvement in storage management Manage data, not disks Applications: Tied to flexible volumes Designed to provide: Flexible Volumes: Not tied to physical storage Improved asset utilization Better productivity Increased performance Disks Disks Disks Pooled physical storage (aggregates) Figure 14 An aggregate consists of a pool of many disks from which space is allocated to volumes Even data duplication can be virtualized with FlexClone. A FlexClone volume is a writable point-in-time image of a FlexVol volume or another FlexClone volume. 16 Infrastructure Simplification with IBM System Storage N Series FlexClone volumes add a new level of agility and efficiency to storage operations. They take only a few seconds to create and are created without interrupting access to the parent FlexVol volume (Figure 15). FlexClone volumes use space very efficiently, using the Data ONTAP architecture to store only data that changes between the parent and clone. This is a huge potential saving in dollars, space, and energy. In addition to these benefits, clone volumes have the same high performance as other kinds of volumes. Figure 15 FlexClone IBM N series also excels in logical unit number (LUN) virtualization with Unified Multiprotocol Data Access. IBM N series virtualization technology unifies many storage access modes in a single, easy-to-manage appliance. For example: A single IBM N Series storage system can virtualize SAN LUNs into CIFS, NFS, iSCSI LUNs, and FCP LUNs. This unification provides unprecedented economies and flexibility, opening the door to simplified storage administration and greater returns on technology investment. IBM N series systems virtualize target LUNs so that the appliance can present iSCSI and FCP LUNs to hosts. The same LUN can be presented over iSCSI or FCP, providing simplified upgrade paths as the need for higher Infrastructure Simplification with IBM System Storage N Series 17 performance and resiliency grows. You can resize LUNs in seconds, simplifying allocation decisions and reducing management costs. With Snapshots, you can also virtualize data duplication. Snapshots (Figure 16) use a copy-on-write technique to avoid duplicating disk blocks that are the same for a Snapshot and the active file system. Only when blocks in the active file system are modified or removed do Snapshots that contain those blocks begin to consume disk space. This creates a virtualized full copy of the source data. Block 1 Snapshot •A Snapshot is a read-only, “freeze framed” version of a filer file system, frozen at some past point in time. Block 2 •A volume can maintain up to 255 Snapshots concurrently. •Snapshots are readily accessible via “special” subdirectories that appear in the current or active file system. Block 3 •Snapshots consume space when the file system changes. •Snapshots use no additional disk space when first taken. •Snapshots can be taken manually or automatically on a schedule. Figure 16 Snapshot Users can access Snapshots through NFS to recover files that they have accidentally changed or removed, and system administrators can use Snapshots to create backups safely from a running system. In addition, Write Anywhere File Layout (WAFL) uses Snapshots internally so that it can restart quickly even after an unclean system shutdown. At the highest level of virtualization with the IBM N series, there is the Vfiler. A virtual storage controller is a lightweight instance of a multiprotocol server. The physical resources of the storage system, such as system memory and CPU, are shared between virtual storage controllers. A virtual storage controller consists of data stored in a volume or a qtree, the IP address(es) necessary to reach the virtual storage controller, and the security 18 Infrastructure Simplification with IBM System Storage N Series and other attributes associated with the data. From the client systems and management software perspective, the data is completely secure and isolated from all other virtual storage controllers. The benefits of Vfilers are: Vfilers appear in the network as discrete multiprotocol file servers and are accessed in the same way as other Windows® or UNIX® file servers. Companies can deploy a Vfiler into full production in a matter of minutes with only a few software commands. Each Vfiler independently controls access to its storage resources according to its security settings and the permissions that are assigned to groups and users by Windows, UNIX system administrators, or both. You can explicitly assign network settings, such as IP addresses, IP space associations, DNS information, Windows domain, and other settings, to each Vfiler (Figure 17). Figure 17 Managing Vfilers Infrastructure Simplification with IBM System Storage N Series 19 You can replicate or move virtual storage systems and their resources to other storage systems, dramatically simplifying data consolidation, migration, and disaster recovery. You can add, remove, or move resources between virtual storage systems at any time. Automation with IBM N series IBM N series helps simplify storage management by automating data duplication with scheduled Snapshots and backups, using either SnapVault or Tivoli Storage Manager. Snapshot automation Snapshots can be automated to occur weekly, daily, or hourly, depending on your business needs (Figure 18). Figure 18 Snapshot scheduling 20 Infrastructure Simplification with IBM System Storage N Series SnapVault scheduling and retention The SnapVault schedule details the frequency, the number of copies to retain, the date, and the time for performing incremental backups for a specific SnapVault relationship. The SnapVault secondary system creates and maintains copies, based on the specified schedule, for each primary data set that it is responsible for protecting. Incremental backups can be scheduled every hour, week, or month, depending on the needs of the environment. Backup and storage administrators have considerable flexibility when they are defining policies for data protection. In normal operation, updates and Snapshot creation proceed automatically, according to the Snapshot schedule. SnapVault also supports manual operation through basic command line and management interface operations. This allows on demand, application-level integration for specific applications or servers that require an application or event-driven backup capability. All of these scheduling options can increase the frequency of backups without an increased requirement for baseline transfers and media cost. In most cases, you can reduce and even eliminate traditional backup windows after performing the the initial full backup. Tivoli Storage Manager automation You can schedule the backup or restore of images that Network Data Management Protocol (NDMP) operations produce with administrative schedules that process the BACKUP NODE or RESTORE NODE administrative commands. The BACKUP NODE and RESTORE NODE commands can be used only for nodes of TYPE=NAS. In Example 1, we scheduled a full backup, including a table of contents. Example 1 Define scheduled backup define schedule NAS001 t=a cmd='backup node N37002 /vsimple2 mode=full toc=yes' active=yes peru=o starttime=15:30 Autosupport With the Autosupport feature, you can configure an IBM N series storage system so that it initiates automatic e-mail notifications to IBM support and to other designated addresses when specific events occur with the IBM N series storage systems. The e-mail notification contains useful information to help technical support teams identify and solve problems quickly and proactively. Infrastructure Simplification with IBM System Storage N Series 21 You can also configure the IBM N series storage systems to initiate short alert notifications of urgent events that contain only the reason for the alert to a separate list of recipients. This feature is useful for system administrators who read e-mail messages on alphanumeric pagers. Each message generated by Autosupport contains the following types of information: Date and time stamp of the message Data ONTAP software version System ID of the IBM N series storage system Host name of the IBM N series storage system Software licenses that are enabled for the IBM N series storage system Product serial number Simple Network Management Protocol (SNMP) contact name and location (if specified in /etc/rc) Contents of the messages log Output of the following console commands (some are applicable only to the licensed protocols): – – – – – – – – – sysconfig - p options ifconfig -a \ nfsstat -c cifs stat (included if CIFS is licensed) cifs sessions (included if CIFS is licensed) cifs shares (included if CIFS is licensed) httpstat df Conclusion The signs of complexity are everywhere in modern data centers and modern storage environments. If your enterprise can make the move toward infrastructure simplification, it can contain expenses, enable business growth, and reduce operational risk by optimizing IT resources. As part of the IBM strategy for helping your company enable on demand information, the IBM N series storage system can help you simplify your infrastructure and its management. 22 Infrastructure Simplification with IBM System Storage N Series The team that wrote this Redpaper This Redpaper was produced by a specialist from the International Technical Support Organization, Tucson. Alex Osuna is a Project Leader at the International Technical Support Organization (ITSO), Tucson center. He writes extensively and has taught IBM classes worldwide on all areas of storage. Before joining the ITSO, Alex held positions in systems engineering, performance, services development, product planning, storage development, storage support, field engineering, and IT support for the Military IBM Air Defense Computers. He holds over 10 certifications with IBM, Microsoft®, and Redhat. Thanks to the following people for their contributions to this project: Miroslav Klivansky and Edward Reidenbach Network Appliance Infrastructure Simplification with IBM System Storage N Series 23 24 Infrastructure Simplification with IBM System Storage N Series Notices This information was developed for products and services offered in the U.S.A. IBM may not offer the products, services, or features discussed in this document in other countries. Consult your local IBM representative for information on the products and services currently available in your area. Any reference to an IBM product, program, or service is not intended to state or imply that only that IBM product, program, or service may be used. Any functionally equivalent product, program, or service that does not infringe any IBM intellectual property right may be used instead. However, it is the user's responsibility to evaluate and verify the operation of any non-IBM product, program, or service. IBM may have patents or pending patent applications covering subject matter described in this document. The furnishing of this document does not give you any license to these patents. You can send license inquiries, in writing, to: IBM Director of Licensing, IBM Corporation, North Castle Drive, Armonk, NY 10504-1785 U.S.A. The following paragraph does not apply to the United Kingdom or any other country where such provisions are inconsistent with local law: INTERNATIONAL BUSINESS MACHINES CORPORATION PROVIDES THIS PUBLICATION "AS IS" WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESS OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF NON-INFRINGEMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Some states do not allow disclaimer of express or implied warranties in certain transactions, therefore, this statement may not apply to you. This information could include technical inaccuracies or typographical errors. Changes are periodically made to the information herein; these changes will be incorporated in new editions of the publication. IBM may make improvements and/or changes in the product(s) and/or the program(s) described in this publication at any time without notice. Any references in this information to non-IBM Web sites are provided for convenience only and do not in any manner serve as an endorsement of those Web sites. The materials at those Web sites are not part of the materials for this IBM product and use of those Web sites is at your own risk. IBM may use or distribute any of the information you supply in any way it believes appropriate without incurring any obligation to you. Information concerning non-IBM products was obtained from the suppliers of those products, their published announcements or other publicly available sources. IBM has not tested those products and cannot confirm the accuracy of performance, compatibility or any other claims related to non-IBM products. Questions on the capabilities of non-IBM products should be addressed to the suppliers of those products. This information contains examples of data and reports used in daily business operations. To illustrate them as completely as possible, the examples include the names of individuals, companies, brands, and products. All of these names are fictitious and any similarity to the names and addresses used by an actual business enterprise is entirely coincidental. COPYRIGHT LICENSE: This information contains sample application programs in source language, which illustrate programming techniques on various operating platforms. You may copy, modify, and distribute these sample programs in any form without payment to IBM, for the purposes of developing, using, marketing or distributing application programs conforming to the application programming interface for the operating platform for which the sample programs are written. These examples have not been thoroughly tested under all conditions. IBM, therefore, cannot guarantee or imply reliability, serviceability, or function of these programs. © Copyright International Business Machines Corporation 2006. All rights reserved. Note to U.S. Government Users Restricted Rights -- Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp. 25 Send us your comments in one of the following ways: Use the online Contact us review redbook form found at: ibm.com/redbooks Send your comments in an email to: redbook@us.ibm.com Mail your comments to: IBM Corporation, International Technical Support Organization Dept. HYTD Mail Station P099, 2455 South Road Poughkeepsie, NY 12601-5400 U.S.A. ® Trademarks The following terms are trademarks of the International Business Machines Corporation in the United States, other countries, or both: Eserver® Eserver® Redbooks (logo) ™ IBM® Redbooks™ System Storage™ Tivoli® The following terms are trademarks of other companies: Vfiler, Snapshot, Data ONTAP, Network Appliance, WAFL, SnapVault, MultiStore, and NetApp logo are trademarks or registered trademarks of NetApp Corporation or its subsidiaries in the United States, other countries, or both. Microsoft, Windows, and the Windows logo are trademarks of Microsoft Corporation in the United States, other countries, or both. UNIX is a registered trademark of The Open Group in the United States and other countries. Other company, product, or service names may be trademarks or service marks of others. 26 Infrastructure Simplification with IBM System Storage N Series