Stephen Dart La RDS Service Manager Monash eResearch

  • Slides: 24
Download presentation
Stephen Dart La. RDS Service Manager Monash e-Research Centre La. RDS Staging Post Enhancing

Stephen Dart La. RDS Service Manager Monash e-Research Centre La. RDS Staging Post Enhancing Workgroup Productivity

Managing User Expectation

Managing User Expectation

In a perfect world • • • Dedicated wire 1 Gb/s 125 MB per

In a perfect world • • • Dedicated wire 1 Gb/s 125 MB per second 7. 5 GB per minute 450 GB per hour 10 TB per day In reality, inconsistency • Slow Speed • 3~30 MB per second • Workstation, Server or La. RDS? • Share Hangs or Disconnects • Please Explain!

Network at the Edge

Network at the Edge

Complications at the Core

Complications at the Core

Current La. RDS Samba service - La. RDS Samba service for workgroup sharing files

Current La. RDS Samba service - La. RDS Samba service for workgroup sharing files End user experience is speed limitations Not suited for workstation backup Not suited for bulk upload Oversubscribed disk is pushed to tape Something faster please

Many factors to make things work slow • Current situation – La. RDS Samba

Many factors to make things work slow • Current situation – La. RDS Samba based on virtual server – Workstations at the edge of the network – Network bandwidth contention getting to La. RDS

Current ARMI workstation service - Single Network Port per workstation - 1 Gb/s bit

Current ARMI workstation service - Single Network Port per workstation - 1 Gb/s bit rate on port - Effective throughput peak below 10% - Common network switch for whole floor - Can handle many point to point within floor - Must share floor bandwidth to building switch - Common network switch for building - Must share building bandwidth to precinct switch

What can be done now - Provide a local data service for workstations -

What can be done now - Provide a local data service for workstations - Install Staging Post on same switch as users - bypass Ve. RA for uploads and backup - Increase bandwidth between floor switch and the precinct router - Extra floor and building uplinks - Faster links between switches

What can be done now - Offload the big data as quickly as possible

What can be done now - Offload the big data as quickly as possible - To a local cache that can be used as a working share - Sync the data on a daily basis with La. RDS

Something still not right • NAS on same switch and subnet as workstation •

Something still not right • NAS on same switch and subnet as workstation • One session ok, but second session kills first! • Network engineers insist NAS too slow and dropping packets • Serious detective work starts

Network Engineers in Denial • Network bandwidth to NDT server – http: //ndt. its.

Network Engineers in Denial • Network bandwidth to NDT server – http: //ndt. its. monash. edu. au/toolkit/ • Network bandwidth to Speedtest. net – http: //www. speedtest. net/ • Network Weather Map all clear – http: //cacti. its. monash. edu. au/cacti/weather map/weathermap. html – Low utilization and no errors

Qo. S Policy set at default for VOIP

Qo. S Policy set at default for VOIP

Research networks generate data at the edge for upload to the core Traditional Corporate

Research networks generate data at the edge for upload to the core Traditional Corporate Intranet Research and Instrumentation Intranet

Tackle System Integration • Rethink Qo. S – Trial with Qo. S off (unmanaged)

Tackle System Integration • Rethink Qo. S – Trial with Qo. S off (unmanaged) – Open call with CISCO – TCP/IP behaviour – Get Network Engineers trained in Qo. S • Make sure NAS connected to AD – Ve. RA Samba was not AD connected

What can be done now - Offload the big data as quickly as possible

What can be done now - Offload the big data as quickly as possible - To a local cache that can be used as a working share - Sync the data on a daily basis with La. RDS

Updated Qo. S rolled out to all switches

Updated Qo. S rolled out to all switches

Five Size Options for Staging Post Capacity User load, NIC Speed Cost QNAP-509 Pro

Five Size Options for Staging Post Capacity User load, NIC Speed Cost QNAP-509 Pro 5 x 1. 5 TB (6 TB RAID 5) ~10 users, $2, 500 QNAP-809 Pro 8 x 2 TB ~20 users, 1 Gb/s (12 TB RAID 5) $4, 300 QNAP-859 URP (rackmounted) 8 x 3 TB ~30 users, 1 Gb/s (18 TB RAID 6) $4, 750 QNAP-1279 U-RP 12 x 3 TB ~50 users, 10 Gb/s (30 TB RAID 6) $8, 000 SGI ISS-3500 24 x 2 TB ~100 users, 10 Gb/s (40 TB RAID 6) $25, 000

Re arrange existing disk usage • Provide two file systems match usage • Working

Re arrange existing disk usage • Provide two file systems match usage • Working data sets (fast, local disk) – Online now, used often, interim results • Archive data sets (deep, NFS to DMF) – Step or phase completion – Reference for future work – Storage object as a group of files – Publication and citation

Integrate with Grid Access • Grid Users using DMF for home folders – Grid

Integrate with Grid Access • Grid Users using DMF for home folders – Grid processes flooding DMF shares – Many small files gone by the time they hit the front of the migration queue – DMF recalls stall Grid jobs • Provide non-DMF Grid Scratch – Don’t back it up

Outstanding Issues • Speeding up other VMs without hardware scale out • Presenting Samba

Outstanding Issues • Speeding up other VMs without hardware scale out • Presenting Samba users with indication of Offline status • User Indoctrination

Questions

Questions