IBM Information Infrastructure Addressing Information Retention IBM Information
IBM Information Infrastructure Addressing Information Retention IBM Information Infrastructure Archiving Leadership Name: Tony Pearson Date: February 26, 2009 © 2009 IBM Corporation
IBM Information Infrastructure Agenda § Information Infrastructure Challenges § IBM’s approach § Information Retention § Why IBM 2 © 2009 IBM Corporation
IBM Information Infrastructure Information Growth and Drivers growing at 32 % Structured Unstructured growing at 64 % Structured Data: database data Unstructured Data also called ‘content’: files, medical images, docs, web content, rich media files etc. §Application driven growth – Email – E-commerce – ERP/CRM §New applications – Web 2. 0 – Digital video, voice, audio §Data multiplier effect – Backup, Disaster Recovery – Test, Development §Mergers and acquisitions Source: IDC "Storage Infrastructure: Innovations for the Future Datacenter, " Doc # DR 2008_1 RV, March 2008 3 © 2009 IBM Corporation
IBM Information Infrastructure Complete, Integrated, Available Today Manage information more effectively and mitigate information risks … with a dynamic infrastructure … that efficiently and securely stores, protects, and provides optimized access to information. 4 © 2009 IBM Corporation
IBM Information Infrastructure What are your risks? § Have you evaluated the cost of storing and managing all this information? § Is your infrastructure optimized based on your retention? § Can you backup and recover your servers fast and reliably enough? § How are you addressing these requirements today? 5 © 2009 IBM Corporation
IBM Information Infrastructure for efficient Information Retention Support daily activities Business Value Future business development In a marriage of Big Red and Big Blue, the extensive marketing records of the Coca-Cola Company -- from calendars created by Norman Rockwell to corporate memoranda to the famous commercial in which a chorus proclaims its desire to ''teach the world to sing'' -have been digitized in an online archive with the help of I. B. M. 6 "All these wonderful books are only of use if they're read, " said the Rev. Leonard Boyle, prefect of the Vatican Library. He said the I. B. M. project would put the library's manuscripts and texts in digital form as a way of broadening the library's reach. Preservation Infrastructure Concerns Cost and Performance Cultural or Heritage Needs © 2009 IBM Corporation
IBM Information Infrastructure Information Retention Areas of focus § Storage infrastructure optimization § A comprehensive line of tiered storage, across disk and tape technologies. § Policy-based automation software for space management and data retention to move data to the appropriate tier of storage. § Data deduplication and compression technologies for storage capacity gains. § Content Collection and Archiving § Intelligently managing inactive or infrequently-accessed data that still has value, while providing the ability to search and retrieve the information during a specified retention period. § Integrate with information management solutions from IBM and Technology Partners to handle file systems, databases, email, Share. Point and content management. § Long term retention § Future-proofing information with data migration capabilities and forward compatibility. 7 © 2009 IBM Corporation
IBM Information Infrastructure Storage Infrastructure Optimization © 2009 IBM Corporation
IBM Information Infrastructure Storage infrastructure optimization Benefit from a blended solution Cut TCO 50% with Blended Tape and Disk* 10 year TCO example. Assumes 250 TB storage, 25% growth/yr $7 $6, 365, 950 Floor space Power & Cooling Maintenance Millions Prod + DR Carts Hardware $3. 5 $2, 255, 346 $0 SATA Disk Consider the long-term costs of ownership § SATA disk lower cost access to online data than FC disk § Tape less than disk and consumes less energy, but often not ideal for online access § Blended solution: $946, 405 – Online access to most recent content Tape – Lower cost, energy-efficiency for long-term Blended Disk and Tape * TCO estimates based on IBM internal studies. 9 © 2009 IBM Corporation
IBM Information Infrastructure Storage infrastructure optimization Reduce Cost with an Automated Migration Process Data Access Needs Throughout Lifecycle Fast/High Align the business value of information with the most cost effective Information Infrastructure. Policy-driven, Automated Data Movement Slow/Low 100+ Years 50 Years 20 Years 5 Years 3 Years 2 Months 1 Day 1 Hour Today: Move data to the most Fully Managed Costs: Storage Options Relative Cost / GB / Year 70 60 cost effective storage for its current use based on policies 50 40 Future: Move data to the most 30 20 10 0 High Perf Disk 10 Hi Capacity Disk Online Archive Online Tape Offline Tape power efficient storage dynamically to satisfy usage requirements © 2009 IBM Corporation
IBM Information Infrastructure Storage infrastructure optimization Drive exceptional efficiency with data deduplication and compression § Revolutionary in-line data deduplication – Dramatically reduced disk storage requirements – up to 25 X reduction Disk Space • Store up to 25 TB of backups onto 1 TB of disk, in 8 hours • Up to 9 x faster than competitors – Reduced power, space and cooling requirements – Improved management - simplifies and speeds information protection § Two to one, Three to one tape compression – Increase tape capacity utilization 11 © 2009 IBM Corporation
IBM Information Infrastructure Content Collection and Archiving © 2009 IBM Corporation
IBM Information Infrastructure Archiving The foundation for Information Retention Archiving is an intelligent process for managing inactive or infrequently accessed data, that still has value, while providing the ability to preserve, search and retrieve the data during a specified retention period. Not the same as keeping backups forever Long term in nature Selected, preserved, disposed according to policy For retrieval, not recovery Adds operational efficiencies 13 © 2009 IBM Corporation
IBM Information Infrastructure Archiving Benefits § Improve Productivity – Improved Application Availability and Performance – Faster Backup and Recovery § Manage Risks – Adhere to internal corporate policies – Reduce manual operations § Drive Operational Efficiency – Policy base automation – Inactive or infrequently accessed data retained in lower cost storage (disk and tape) for efficiency – Data migrated between tiers automatically § Reduce Cost – Reduce infrastructure costs • Power and Cooling requirements • Hardware and Software Mitigate business risk, reduce costs, improve competitiveness 14 © 2009 IBM Corporation
IBM Information Infrastructure IBM Enterprise Content Management (ECM) Key building block for Archive and Retention IBM ECM 1 3 Records Management Content Collection 15 Advanced Classification Electronic Discovery 2 4 © 2009 IBM Corporation
IBM Information Infrastructure IBM Strategic Archiving & Retention Offerings Data content layer Policy Management layer § Tivoli Storage Manager § System Storage Archive Manager § Data Facility Storage Management Subsystem Storage Media layer § IBM DR 550 (includes SSAM) § IBM Tape Systems § IBM Disk Systems including N series with Snap. Lock™ feature Services layer Applications Files Enterprise Archive Services § IBM Common. Store, IBM Content Manager, IBM Content Collector § IBM File. Net P 8 Content Manager, Image Services, SAP Connector § IBM Records Crawler § IBM Optim § Grid Medical Archive Solution (GMAS) Archiving Application TSM / SSAM Client E-mail Content Collector Archive / HSM Infrastructure Records Images SAP Common Store SSAM DBMS PACS Siebel Peoplesoft File. Net TSM Optim GMAS DFSMS Information Retention Systems Disk DR 550 N series w/Snap. Lock Tape § IBM Enterprise Archive Services 16 © 2009 IBM Corporation
IBM Information Infrastructure IBM Enterprise Archive Services Archiving Assessment Tell me what to do Develop Archiving Strategy § Assess business lines' information retention requirements and IT abilities to support them § Identify gaps § Provide recommendations § Cost Benefit analysis § Create archiving strategy § Define transition planning § Design solution approach § Cost Benefit analysis § Create/update enterprise archive architecture based on Help me to do it Do it for me Archiving Solution Architecture Solution Integration Manage and Run 17 reference architecture § Define the classes of service § Organizational impact / impact on existing architecture § High level Solution design § Cost Benefit Analysis § Identify all areas in scope § Logical and physical design of pilots within reference architecture § Define certification criteria § Complete solution architecture § Enterprise deployment of services § Enterprise deployment of organization § Enterprise deployment of technology © 2009 IBM Corporation
IBM Information Infrastructure Long Term Retention © 2009 IBM Corporation
IBM Information Infrastructure The “Digital Dark Ages” Yedioth Ahronoth, Sunday June 1, 2008 19 © 2009 IBM Corporation
IBM Information Infrastructure Logical Preservation is not a 100 Year Problem Microsoft Office 2003 SP 3 Blocks older File Formats “The move was done for security, says Microsoft, but still bewilders users “ § “Office users will no longer be able to open files in 24 older file formats from Lotus, Corel, and most versions of MS Office products before 2000” § "The decision to block the formats is strictly to protect your machine from being compromised. " according to Microsoft § To restore file formats from the dead users must edit the Windows registry – which Microsoft warns against. Source: Computerworld. com artilce by Gregg Keizer http: //www. computerworld. com/action/article. do? command=view. Article. Basic&article. Id=9055138 20 Computerworld January 2008 © 2009 IBM Corporation
IBM Information Infrastructure Approaches to Long Term Digital Preservation § What are Companies doing today? – Keeping old servers, storage and applications (Museum approach) – Emulating old systems with new systems (Emulation approach) – Migrating data to new technology (Migration approach) § Areas IBM is focusing its research 21 – Methods for descriptive metadata to enable future rendering (Descriptive approach) – Encapsulation of data, metadata and related application logic for future processing (Encapsulation approach) © 2009 IBM Corporation
IBM Information Infrastructure Long-term retention Preserve information The media and content are still accessible Mayan Glyph, Palenque ~630 AD. Coronation of King Pacal on 26 March, 603 This information was created a few years ago Will you be able to access your data in 20 years? 50 years? You must future proof your information Bit Preservation How do you retrieve a bit perfect copy of digital data after years or decades? 22 Logical Preservation Once you’ve retrieved the bit perfect copy, how do you productively use the data? © 2009 IBM Corporation
IBM Information Infrastructure IBM Provides the Complete Information Infrastructure Global Business Services § Line of business oriented consulting and services around archive and information retention for the business executive Global Technology Services § Planning, design and implementation services specifically for security, compliance, archive & retention of information for the IT staff Software § Integrated middleware and software for managing a wide variety of content. Content collection, records management, classification and ediscovery/search 23 Storage & Servers Information Retention § Storage systems providing multiple tiers of storage, performance, scale, and cost § Storage software running on highly reliable and scalable servers. Integrated Solutions § Soft bundles for customers needing maximum flexibility with ease of ordering and deployment. § Appliances for customers desiring turnkey solutions and simplicity © 2009 IBM Corporation
IBM Information Infrastructure Global Archive Solutions Center § Current Topics include – IBM Archive Solutions Overview – Email Archive Solutions Overview with solution demo – Database Archive Solutions Overview with solution demo – Space Management for File Systems with Demo – File System Archive with DR 550 and N series with Snaplock demos § SMEs developing additional integrated archive solutions such as: – GMAS, SAP Archiving, File. Net P 8 and IBM Content Manager Contact Us § To book a briefing, contact your IBM Representative, IBM Business Partner, or Briefing Center Coordinator, Lee Olguin at lolguin@us. ibm. com. § For more information, visit: http: //www-03. ibm. com/systems/services/briefingcenter/tbc/ 24 © 2009 IBM Corporation
IBM Information Infrastructure Quick assessment Storage Infrastructure Optimization – What is driving your information growth: databases, email, or other files? – Are you able to control your IT budget to handle current and anticipated information growth? – Are you aware of inactive, orphaned or stale data on your most expensive storage systems? – Do you manually migrate data to less expensive storage to help reduce costs? Content Collection and Archiving – Has information growth impacted the performance of your key applications? – Are you aware of recent changes in legislation that require information to be retained longer? – Would you pass an audit if a regulator asked to see how electronic data is stored, discovered, retrieved and deleted? Long-term Retention – Under what circumstances does your company request information to be kept for 7 years or longer? 25 © 2009 IBM Corporation
IBM Information Infrastructure Getting Started § Visit ibm. com/information_infrastructure for more information § Download whitepaper “Addressing Archiving and Retention Challenges” § Listen to podcast “Addressing Archiving and Retention Challenges” § Sign up for an in-depth briefing at one of our Briefing Centers worldwide § Visit http: //www- 935. ibm. com/services/us/index. wss/offerfamily/gts/a 10 27722 to schedule an Archive Workshop with GTS 26 © 2009 IBM Corporation
IBM Information Infrastructure 27 © 2009 IBM Corporation
IBM Information Infrastructure Additional information § Retention Systems § IBM Solutions portfolio § Customer References 28 © 2009 IBM Corporation
IBM Information Infrastructure IBM Information Retention Systems Tivoli Storage Manager System Storage Archive Manager Grid Access Manager Non-erasable Non-rewriteable Read/Write Online archive DR 550 N Series Snap. Lock Removable SATA disk WORM Nearline and Offline archive Tape Systems 29 © 2009 IBM Corporation
IBM Information Infrastructure IBM System Storage DS family Entry Level Midrange Disk Family Enterprise Disk (z & open) High-end Open Systems Disk Modular DS 3000 DS 4000 DS 5000 XIV DS 6000 DS 8000 Turbo Enterprise Storage Continuum 50 Years of Disk 30 © 2009 IBM Corporation
IBM Information Infrastructure IBM System Storage™ N series Hardware Portfolio N 3300 Enterprise N 7000 Midrange N 5000/N 6000 N 6040 N 6060 N 6070 N 7700 N 7900 N 3600 Expansion drawers Filers Entry Level N 3000 EXN 4000 FC Storage Expansion Unit (4 Gb) EXN 1000 SATA Storage Expansion Unit Gateways N 6040 N 6060 N 6070 Gateways use external storage 31 © 2009 IBM Corporation
IBM Information Infrastructure Tape automation Tape virtualization IBM System Storage TS family TS 7530 (distributed) Tape drives TS 7650 G (distributed) TS 7740 (mainframe) TS 3200 TS 3500 TS 3100 TS 2230 3494 TS 3310 TS 2900 32 Enterprise Class Midrange Entry Level TS 3400 TS 1030 (LTO 3) TS 1040 (LTO 4) TS 1130 (3592) © 2009 IBM Corporation
IBM Information Infrastructure IBM Retention Solutions enabled with Leading ISVs & PACS vendors § ISVs (ECM and Archiving) – Autonomy (ex Zantaz) – AXS-One Inc. – Brain. Tribe (Formerly Comprendium) – Bycast Inc. – IBM Common. Store – IBM Content Manager – IBM File. Net – IBM Optim (Princeton Softech) Camino. Soft – Ceyoniq – d. velop AG – Front Porch Digital – Easy Software – Enigma Data Systems – Gamma Systems – HABEL Gmb. H & Co. KG – Heilig & Schubert (H&S) – Hummingbird – Hyland Software (On. Base) – Hyperwave – Interwoven – Lighthouse Global Technologies – – Mimosa – MBS Systems Technologies – Open. Text (formerly IXOS) – PBS Software Gmb. H – RJS Software Systems – Saperion – SER Solutions – Solix Technologies – Stellar Technologies – Symantec / Veritas / KVS EV – Windream § PACS (Medical Imaging) vendors Siemens, AGFA, GE Healthcare, Philips, Mc. Kesson, Cerner, Fujifilm, e. Rad. PACS, ACUO Technologies, De. Jarnette 33 © 2009 IBM Corporation
IBM Information Infrastructure Iowa Health Improves Productivity Need high capacity, scalable and automated archival solution for PACS digital images Reduce archive management TCO dramatically Solution to span across multiple centers Near continuous access to imaging files with security Solution Result § IBM Grid Medical Archive Solution storing 30 TB of PACS images and clinic files annually § Non-disruption of production environment § Total Solution Components: § IBM System Storage DS 4000 disk systems, Enterprise Storage Server 800 § Plans to implement tape tier in future for less frequently accessed data. § IBM GTS Implementation Services 34 § Eliminated traditional back-up and restoration window § Improved performance - reduced time to image access by 70% § All hospitals within network can share information § Reduced data management costs by 90% © 2009 IBM Corporation
IBM Information Infrastructure Support for two datacenters, 200+ TBs of patient images Multiple points of failure placing data at risk Explosive growth of clinical patient data, requiring continuous availability to physicians Desire for multi-vendor strategy demanding open, industry standard interfaces Flexibility to absorb Storage media growth & manage data migration without down-time FTEs overly engaged on current HW problem determination/issues vs proactive activities Solution § IBM GMAS Spinning disk solution using IBM and Grid Access Mgr SW § Grid spans across two data centers § Utilizes repurposed legacy storage § Supports AGFA enterprise application 35 Result § Storage cost optimization and TCO reduction long-term data archive § Virtualization across existing storage § Digital signatures for no single point of failure data integrity and "corruption correction" § Auto-replication and encryption of all patient images § Reduction in application down-time regardless of growth & migration needs © 2009 IBM Corporation
IBM Information Infrastructure is 4 Optimizes storage Retain copies of critical SAP data for extended periods of time to comply with government regulations Reduce the cost of managing and storing data Solution Result § Includes IBM Information Infrastructure for SAP data retention and compliance § Optimized storage infrastructure for the entire information lifecycle, from creation to disposal – leveraging disk and tape storage § Data is protected against deletion or modification § IBM Common. Store for SAP § IBM System Storage DR 550 § IBM Tivoli Storage Manager “With IBM storage solutions, we’ve been able to cost-effectively manage and retain critical SAP data and leverage its business value throughout its lifecycle. ” 36 © 2009 IBM Corporation
IBM Information Infrastructure Koninklijke Bibliotheek -(National Library of the Netherlands) Preserves information Collect, maintain and preserve an archive of all publications (books, papers, periodicals, scientific publications) issued in The Netherlands Store & retrieve electronic publications on a large scale Required Long Term Preservation Solution Result § IBM Content Manager, Tivoli Storage Manager, Web. Sphere Application Server, DB 2, AIX § Estimated savings of $5 m (USD) per year by avoiding manual cataloging and storing § Preservation of and improved access to the national cultural heritage of The Netherlands § Better, easier and faster access to information § IBM p 570 system, IBM DS 6800 storage system, IBM 358494 Tape Libraries, Plasmon G 638 Optical Library § IBM Global Services and Almaden Research Laboratory 37 © 2009 IBM Corporation
IBM Information Infrastructure Disclaimer § IBM's customer is responsible for ensuring its own compliance with legal requirements. It is the customer's sole responsibility to obtain advice of competent legal counsel as to the identification and interpretation of any relevant laws and regulatory requirements that may affect the customer's business and any actions the customer may need to take to comply with such laws. § IBM does not provide legal advice or represent or warrant that its services or products will ensure that the customer is in compliance with any law. § The information contained in this documentation is provided for informational purposes only. While efforts were made to verify the completeness and accuracy of the information provided, it is provided “as is” without warranty of any kind, express or implied. IBM shall not be responsible for any damages arising out of the use of, or otherwise related to, this documentation or any other documentation. Nothing contained in this documentation is intended to, nor shall have the effect of, creating any warranties or representations from IBM (or its suppliers or licensors), or altering the terms and conditions of the applicable license agreement governing the use of IBM software. 38 © 2009 IBM Corporation
IBM Information Infrastructure Trademarks and Disclaimers 8 IBM Corporation 1994 -2008. All rights reserved. References in this document to IBM products or services do not imply that IBM intends to make them available in every country. Trademarks of International Business Machines Corporation in the United States, other countries, or both can be found on the World Wide Web at http: //www. ibm. com/legal/copytrade. shtml. Intel, Intel logo, Intel Inside logo, Intel Centrino logo, Celeron, Intel Xeon, Intel Speed. Step, Itanium, and Pentium are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Linux is a registered trademark of Linus Torvalds in the United States, other countries, or both. Microsoft, Windows NT, and the Windows logo are trademarks of Microsoft Corporation in the United States, other countries, or both. UNIX is a registered trademark of The Open Group in the United States and other countries. Java and all Java-based trademarks are trademarks of Sun Microsystems, Inc. in the United States, other countries, or both. Other company, product, or service names may be trademarks or service marks of others. Information is provided "AS IS" without warranty of any kind. The customer examples described are presented as illustrations of how those customers have used IBM products and the results they may have achieved. Actual environmental costs and performance characteristics may vary by customer. Information concerning non-IBM products was obtained from a supplier of these products, published announcement material, or other publicly available sources and does not constitute an endorsement of such products by IBM. Sources for non-IBM list prices and performance numbers are taken from publicly available information, including vendor announcements and vendor worldwide homepages. IBM has not tested these products and cannot confirm the accuracy of performance, capability, or any other claims related to non-IBM products. Questions on the capability of non-IBM products should be addressed to the supplier of those products. All statements regarding IBM future direction and intent are subject to change or withdrawal without notice, and represent goals and objectives only. Some information addresses anticipated future capabilities. Such information is not intended as a definitive statement of a commitment to specific levels of performance, function or delivery schedules with respect to any future products. Such commitments are only made in IBM product announcements. The information is presented here to communicate IBM's current investment and development activities as a good faith effort to help with our customers' future planning. Performance is based on measurements and projections using standard IBM benchmarks in a controlled environment. The actual throughput or performance that any user will experience will vary depending upon considerations such as the amount of multiprogramming in the user's job stream, the I/O configuration, the storage configuration, and the workload processed. Therefore, no assurance can be given that an individual user will achieve throughput or performance improvements equivalent to the ratios stated here. Photographs shown may be engineering prototypes. Changes may be incorporated in production models. 39 © 2009 IBM Corporation
- Slides: 39