Science DMZ Christopher Paolini Computational Science Research Center

  • Slides: 12
Download presentation
Science DMZ Christopher Paolini Computational Science Research Center San Diego State University 100 G

Science DMZ Christopher Paolini Computational Science Research Center San Diego State University 100 G and Beyond Workshop: Ultra High Performance Networking in California Calit 2 Auditorium • First floor, Atkinson Hall • UC San Diego • La Jolla, CA Tuesday, February 26, 2013 · Campus and Lab Strategies Panel · 11: 00 AM – 12: 00 PM

The Problem: Security vs. Performance � � � � University network operations centers support

The Problem: Security vs. Performance � � � � University network operations centers support multiple, conflicting missions. Network Security or Network Performance: which is more important? NOCs typically accountable to university business divisions and contend with legal and public relations pressures → security wins always. NOCs not usually accountable to research groups (often never communicate with faculty). University enterprise (e. g. general purpose/financial/personal) computing: security > performance Computational and “Big Data” research: performance > security What can we do to ensure efficient scientific data transfer between universities and national labs? vs.

The Solution: Science DMZ � � A network optimized for business is not designed

The Solution: Science DMZ � � A network optimized for business is not designed or capable of supporting data intensive science. Universities will always need to support security features that protect organizational financial and personnel data. Solution: create separate data intensive science network, external to university enterprise network Design formalized by ESnet, based on traditional network DMZ paradigm

Basic Science DMZ � Science DMZ: (1) dedicated access to high-performance WAN, (2) high-performance

Basic Science DMZ � Science DMZ: (1) dedicated access to high-performance WAN, (2) high-performance switching infrastructure (large buffer memory), (3) dedicated data transfer nodes

Science DMZ through CENIC Cal. REN � Science DMZ using CENIC California Research and

Science DMZ through CENIC Cal. REN � Science DMZ using CENIC California Research and Education Network resources

SDSU Science DMZ Implementation NSF Office of Cyber. Infrastructure CC-NIE Grant 1245312 • Alcatel-Lucent

SDSU Science DMZ Implementation NSF Office of Cyber. Infrastructure CC-NIE Grant 1245312 • Alcatel-Lucent 10 and 40 Gbps switching devices, per CSU policy • DMZ spans four campus buildings: Administration, Life Sciences (CSRC Data Center), Education & Business Administration (UCO Data Center), and Chemical Sciences (Viz. Center) • Primary users: CSRC affiliated faculty and students • AL Omni. Vista 2500 for network management

Computational Science Network (CSRCnet) • Computational science network connects to the DMZ • Funded

Computational Science Network (CSRCnet) • Computational science network connects to the DMZ • Funded in 2009 through NSF MRI award 0922702 • 8 Cisco 10 Gbps Catalyst 4900 M switching devices • CSRCnet spans five campus buildings: Administration, Life Sciences (CSRC Data Center), Education & Business Administration (UCO Data Center), Physics, and Engineering • Sole users: CSRC affiliated faculty and students • 10 G access to SDSC

SDSU Science DMZ Features and Goals � � � � � Facilitate high-performance data

SDSU Science DMZ Features and Goals � � � � � Facilitate high-performance data transfer for scientific applications using Globus Online Grid. FTP Alcatel-Lucent Omni. Switch 10 K (core device) Two Alcatel-Lucent Omni. Switch 6900 s (satellite devices) Dedicated and independent 10 GE (maybe 40 GE) uplink to Internet 2 and ESnet via CENIC Optimized network for high-volume bulk transfer of scientific datasets Unencumbered, high-speed access to online scientific applications and data generated at SDSU External access to science resources not impacted by regular “enterprise” or business class Internet traffic Focus on “Big. Data” Intensive Science: earthquake rupture and wave propagation, parallel 3 D unified curvilinear coastal ocean modeling, geologic sequestration simulation of supercritical CO 2, large-scale proteomic data, bioinformatics of gene promoter analysis, microbial metagenomics, and high -order PSIC methods for simulation of pulse detonation engines Network performance measurement based on the Perf. SONAR framework In. Common Federation global federated system for identity management and authentication to DMZ connected hosts and services

Globus Online Grid. FTP � � Extension of the standard, two channel FTP protocol

Globus Online Grid. FTP � � Extension of the standard, two channel FTP protocol Control Channel ◦ Command/Response ◦ Used to establish data channels ◦ Basic file system operations (e. g. mkdir, delete, etc. ) Data channel: Pathway over which file is transferred Scheduled transfers using command line interface: $ scp xsede#lonestar 4: ~/GO/bigdatafile xsede#trestles: ~/GO/bigdatafile $ scp xsede#trestles: ~/GO/bigdatafile paolini#sdsu: ~/GO/bigdatafile

� � � Science DMZ performance monitoring accomplished using perf. SONAR tool suite Server

� � � Science DMZ performance monitoring accomplished using perf. SONAR tool suite Server side tools run on designated hosts attached to key switches End-to-end testing with collaborating perf. SONAR sites Determine one way latencies and packet loss between hosts using One-Way Active Measurement Protocol (OWAMP) owping -c 10000 -i. 01 remotedmz Periodic throughput tests to remote Science DMZs using Bandwidth Test Controller (BWCTL) Resource allocation and scheduling daemon for regularlyscheduled Iperf tests bwctl -s remotedmz -P 4 -t 30 -f M -w 4 M -S 32

� � U. S. education and research identity federation service Provides common framework for

� � U. S. education and research identity federation service Provides common framework for trusted shared management of access to on-line resources Provide users single sign-on convenience and privacy protection – Shibboleth Service Provider Federating software Site admins can delegate responsibility for administering service provider (SP) metadata to another admin

SDDU Science DMZ Planning and Integration � Primary SDSU faculty/staff for Science DMZ implementation:

SDDU Science DMZ Planning and Integration � Primary SDSU faculty/staff for Science DMZ implementation: Name Role E-Mail Phone Christopher Paolini CSRC Affiliated Faculty, Network Engineering and Research paolini@engineering. sdsu. edu (619) 594 -7159 Jose Castillo Director of Computational Science Research Center jcastillo@mail. sdsu. edu (619) 594 -3430 Rich Pickett Campus CIO rich. pickett@sdsu. edu (619) 594 -8370 Kent Mc. Kelvey Director of Network Services kent@sdsu. edu (619) 594 -3245 Skip Austin Network Planning and Design austin@mail. sdsu. edu (619) 594 -4211 Gene Le. Duc Technology Security Officer (TSO) gleduc@mail. sdsu. edu (619) 594 -0838 Robert Osborn Infrastructure Installation, Configuration, and Support osborn@mail. sdsu. edu (619) 594 -6004 � Current and planned DMZ related research: Development of new transport layer protocols that use compressed sensing techniques to perform sparse sampling on streaming petabyte sized datasets originating from remote CO 2 sequestration, curvilinear coastal ocean modeling, and earthquake rupture and wave propagation simulations Development of a new Alcatel-Lucent SDN/Application Fluent Network based protocol for the OS 10 K that bridges Lustre RDMA traffic between 40 GE and FDR Infini. Band