IBM Spectrum Scale formerly GPFS A cluster file
IBM Spectrum Scale (formerly GPFS) A cluster file system with high-performance, high availability and parallel file access
Clustered and Distributed File Systems Clustered file systems Distributed file systems File system shared by being simultaneously mounted on multiple servers accessing the same storage File system is accessed through a network protocol and do not share block level access to the same storage Examples: IBM Spectrum Scale, Oracle Cluster File System (OCFS 2), Global File System (GFS 2), Lustre Examples: NFS, Open. AFS, CIFS 2 IBM Spectrum Scale - General overview © 2017 IBM Corporation
IBM Spectrum Scale IBM’s shared disk, parallel cluster file system Shared disk: all data and metadata on storage devices accessible from any node through block I/O interface (“disk”: any kind of block storage device) Linux Cluster: Fast reliable communication, common admin domain Virtualization Management Hardware resources Parallel: data and metadata flow from all of the nodes to all of the disks in parallel. Features: Native encryption, compression, native protocols, cloud tiering, disaster recovery, native raid 3 © 2017 IBM Corporation
Spectrum Scale Enables An Extremely Flexible Architecture Tokyo NY 100 = TB Case 1 2 servers with Spectrum Scale Software Fronting 100 TB storage 20 = PB Case 2: Add Performance Case 3: Increase Capacity Add compute nodes Add any storage Faster Network Virtually unlimited scaling 50 = PB Case 4: Global Share Use Active File Management to expand your global namespace IBM and Business Partner Use Only 6 © 2017 IBM Corporation
Spectrum Scale Features & Applications Linux Standard file system interface with POSIX semantics – Metadata on shared storage – Distributed locking for read/write semantics Highly scalable High capacity (up to 299 bytes file system size, up to 263 files per file system) High throughput (TB/s) Wide striping Large block size (up to 16 MB) Multiple nodes write in parallel Advanced data management Snapshots, storage pools, ILM (filesets, policy) Backup HSM (DMAPI) Remote replication, WAN caching High availability Fault tolerance (node, disk failures) On-line system management (add/remove nodes, disks, . . . ) Virtualization Management Hardware resources 7 © 2017 IBM Corporation
Flexible Topologies for GPFS Cluster Configuration Storage Area Network (SAN) Model Network Shared Disk (NSD) Server Model Application Nodes TCP/IP Network Application Nodes Storage Network TCP/IP or Infinband Network I/O Servers Storage Shared Nothing Cluster Model TCP/IP or Infinband RDMA Network 8 © 2017 IBM Corporation
What Spectrum Scale is NOT Not a client-server file system like NFS, CIFS or AFS Client Nodes TCP/IP Network No single-server performance and bottleneck scaling limits File Server Storage data Network Metadata Server metadata Metadata Data No centralized metadata server 9 © 2017 IBM Corporation
The Logical File System View IBM Spectrum Scale Simple. Powerful. Economical. Maximum file system size of one million yottabytes FS 1 FS 2 . . . • One big file system or you can divide into as many as 256 smaller file systems FS 256 • Each file system can be further divided into fileset containers (tree branches) Spectrum Scale policy based data migration File 1 File 2 File 3 • A rule can apply to any file being created or only to files being created within a specific fileset or group of filesets. • Define soft or hard quota by user, group or fileset File 4 IBM and Business Partner Use Only 10 © 2017 IBM Corporation
Enhanced protocol support The challenge: How can I share my storage infrastructure across all of my legacy and new generation applications? The Solution The new IBM Spectrum Scale Protocol Node allows access to data stored in a Spectrum Scale filesystem, using additional access methods and protocols. The Protocol Node functions are clustered and can support transparent failover for NFS and SWIFT protocols as well as SMB protocols. Multiprotocol data access from other systems using the following protocols NFS v 3 and v 4 SMB 2 and SMB 3. 0 mandatory features / CIFS for Windows support. Open. Stack Swift and S 3 API support for object storage. 11 11 © 2017 IBM Corporation
The Solution: IBM Spectrum Scale™ brings it all together IBM Spectrum Scale™ replaces HDFS and NAS file storage IBM Spectrum Scale™ replaces SAN-based file systems Replaces NTFS, EXT 4, JFS 2 and other POSIX file systems Used by over 200 of the top 500 supercomputers No file transfers required between different OS Can be used with everything from databases to video streaming For x 86, POWER and z System servers Secure with Data-at-rest encryption Full Hadoop interfaces for Map/Reduce analytics processing No transfer or ingest required as the data is already there Fully protected with Backup Software File-level access support for NFS, CIFS, FTP, SCP and HTTPS Supports Enterprise File Sync-and-Share via Own. Cloud or Funambol IBM Spectrum Scale™ offers Object access Object-level access based on Open. Stack Swift driver and Amazon S 3 APIs Global Name Space IBM Spectrum Scale™ supports all media Spans flash, disk and tape media 12 12 © 2017 IBM Corporation
Unleash New Storage Economics on a Global Scale Client workstations Users and applications Compute Farm Single name space SMB/CIFS POSIX NFS Map Reduce Connector Open. Stack Cinder Manila Swift Glance IBM Spectrum Scale Site A Site B Automated data placement and data migration Off Premise Site C Tape Flash Disk Storage Rich Servers Multi-cloud Storage Toolkit 1 313 © 2017 IBM Corporation
Why is Spectrum Scale of interest to customers? What problems does Spectrum Scale solve? Insufficient Capacity and Performance Spectrum Scale-based file servers can scale to enormous performance and capacity, avoiding storage islands and staying easy to manage. Unreliable Storage Spectrum Scale based file systems can survive failures of many components without incurring data loss and while remaining available. It has techniques for monitoring components and recovering from failures extremely quickly. Cost Escalation By avoiding storage islands much cost is saved. Management cost, cost of application downtime, cost of over/under provisioning storage, all can be reduced or eliminated. Additional cost can be saved by having Spectrum Scale automatically move files to cheaper disks or even to tape. Geographically distributed data sharing Spectrum Scale lets organizations securely share data across different branches and locations, providing remote access and automated distribution getting maximum value from data organization-wide. 15 IBM Spectrum Scale - General overview © 2017 IBM Corporation
16 © 2017 IBM Corporation
17 © 2017 IBM Corporation
18 © 2017 IBM Corporation
Supports a Wide Range of Hardware and Software Operating systems Hardware platforms Linux® Red Hat Enterprise Linux v 6 / 7 SUSE Linux Enterprise Server v 11/12 Debian v 6 / 7 Ubuntu 14. 04/16. 04 IBM Power Big Endian IBM Power Little Endian x 86_64 IBM AIX® v 6. 1 / 7. 1 Windows® Server 2008 x 64 (SP 2); Windows® Server 2008 R 2; Windows 7 x 64 SP 1; Windows Server 2012 R 2; Windows 8. 1 Storage z Systems IBM Storage and storage hardware from all vendors such as: EMC, Hitachi, Hewlett Packard, DDN 19 IBM Spectrum Scale - General overview © 2017 IBM Corporation
IBM Spectrum Scale benefits over NAS Better performance Simplified Management Lower Cost Eliminate hotspots with massively parallel access to files Sequential I/O with ES greater than 400 GB/s Throughput advantage for parallel streaming workloads, e. g. Tech Computing and Analytics More Storage. More Files. Hyper Scale. Easier management with one global namespace instead of managing islands of NAS arrays, e. g. no need to copy data between compute clusters Integrated policy driven automation Fewer storage administrators required Optimizes storage tiers including flash, disk and tape Increased efficiency and more efficient provisioning due to parallelization and striping technology Remove duplicate copies of data, e. g. run analytics on one copy of data without having to set up a separate silo 20 20 © 2017 IBM Corporation
Object Store vs. File System Object Store Interface File System Interface Web-based: GET/PUT/DELETE POSIX: Open/Seek/Read/Write/Close RESTful: Stateless Stateful Metadata Synchronization Eventual Consistency No distributed locking Software Defined Storage Synchronization Strict Consistency Uses distributed locking Hardware and Software Best of breed hardware Commodity hardware Designed not to fault Designed to fault but never fail Admin controlled recovery Built to auto-recover by design Features Basic services that scale (KISS) Features Abundant enterprise features built into the products SW extendible weba Web interfaces REST: Representational State with Transfer, Service API 21 21 © 2017 IBM Corporation
The History of Spectrum Scale This infographic is the genealogy of IBM Spectrum Scale, from it’s birth as a digital media server and HPC research project to it’s place as a foundational element in the IBM Spectrum Storage family. It highlights key milestones in the product history, usage, and industry to convey that Spectrum Scale may have started as GPFS, but it is so much more now. IBM has invested in the enterprise features that make it easy to use, reliable and suitable for mission critical storage of all types. 2 2 22 © 2017 IBM Corporation
Resources ibm. com: ibm. com/systems/platformcomputing/products/gpfs/ Public Wiki: ibm. com/developerworks/community/wikis/home? lang=en#!/wiki/General Parallel File System (GPFS) IBM Knowledge Center: ibm. com/support/knowledgecenter/SSFKCN/gpfs_welcome. html? lang=en Data sheet: IBM General Parallel File System (GPFS) Version 4. 1 ibm. com/common/ssi/cgibin/ssialias? subtype=SP&infotype=PM&appname=STGE_DC_ZQ_USEN&htmlfid=DCD 12374 USEN&attachment=DCD 12374 USEN. PDF Presentation: will be published soon Spectrum Scale Quick Install for Linux on IBM System z IBM Internal: Spectrum Scale Sales Wiki: https: //w 3 -connections. ibm. com/wikis/home? lang=en#!/wiki/GPFS Sales Wiki/page/Elastic Storage Sales Wiki GPFS Quick Reference Guide: http: //w 3 -01. ibm. com/sales/ssi/cgibin/ssialias? subtype=ST&infotype=SA&appname=STGI_DC_ZQ_USEN&htmlfid=DCY 12364 USEN&attachment=DCY 12364 USEN. PDF GPFS Conversation Starter: http: //w 3 -01. ibm. com/sales/ssi/cgibin/ssialias? subtype=RG&infotype=PM&appname=STGI_DC_ZQ_USEN&htmlfid=DCO 01637 USEN&attachment=DCO 01637 USEN. PPT Presentation: Selling General Parallel File System (GPFS™): http: //w 3. ibm. com/sales/ssi/cgibin/ssialias? subtype=PS&infotype=SA&appname=STGI_DC_ZQ_USEN&htmlfid=DCP 03210 USEN&attachment=DCP 03210 USEN. PPT https: //w 3 -01. sso. ibm. com/learning/registry/assets/Learning. Technologies/ltu 43563? source. Url=http%3 A%2 F%2 Flt. be. ibm. com%2 Fstg%2 Fltu 43563 Technical overview of GPFS - GPFS Tutorial 2013 (presentations from Frank Schmuck): https: //gpfs. almaden. ibm. com/wiki/GPFS_Presentations#Frank_Schmuck 23 IBM Internal Only © 2017 IBM Corporation
Easier to Try Spectrum Scale virtual machine Turn-key Spectrum Scale VM available for download Try the latest Spectrum Scale enhancements Full functionality on laptop, desktop or server Incorporate external storage Use for live demonstrations, proof of concepts, education, validate application interoperability Scripted demonstrations External download link http: //www-03. ibm. com/systems/storage/spectrum/scale/trial. html Download VM image, quick start guide, explorer guide, advanced user guide Limitations Virtual. Box hypervisor only Type-2 Hypervisor limits performance Not supported for production workloads Can not be migrated to bare metal 24 © 2017 IBM Corporation
- Slides: 21