Grid Network Performance Monitoring for eScience Mark Leese

  • Slides: 17
Download presentation
Grid Network Performance Monitoring for e-Science Mark Leese - Daresbury Laboratory Thursday 15/5/2003 m.

Grid Network Performance Monitoring for e-Science Mark Leese - Daresbury Laboratory Thursday 15/5/2003 m. j. leese@dl. ac. uk http: //gridmon. dl. ac. uk/~mjl

Contents • • • Purpose of the work Grid. Mon: what it does &

Contents • • • Purpose of the work Grid. Mon: what it does & how it does it Progress & examples The future Conclusion Questions?

Purpose of the work “…design and deploy an infrastructure for network performance monitoring within

Purpose of the work “…design and deploy an infrastructure for network performance monitoring within the UK e-Science community. ” • Fault finding • Performance prediction Key aspects: • • Publish results (adaptive) Grid middleware and Grid apps Visualisation for humans End-to-end ability of TCP wrt high b/w networks

Monitoring: What?

Monitoring: What?

Monitoring: How(1)? Iperf. ER Monitor Ping. ER Node UDPmon Miperf. ER bbcp/ftp 30 mins

Monitoring: How(1)? Iperf. ER Monitor Ping. ER Node UDPmon Miperf. ER bbcp/ftp 30 mins Publication service www. visualisation Tools installed on dedicated & similar node at each centre Grid middleware Monitoring Architecture MESH

Monitoring: How(2)?

Monitoring: How(2)?

Progress • Intitial toolkit (Iperf. ER, Ping. ER, UDPmon) at 11 of 12 e-Science

Progress • Intitial toolkit (Iperf. ER, Ping. ER, UDPmon) at 11 of 12 e-Science centres. Other useful UK sites being added. • Miperf. ER on beta trial at Cambridge, Cardiff, MCC and Newcastle. • Active map also on trial. • Click for AMAZING live demo

Examples: Ping. ER

Examples: Ping. ER

Examples: Iperf. ER(1)

Examples: Iperf. ER(1)

Examples: Iperf. ER(1)

Examples: Iperf. ER(1)

Examples: Iperf. ER(2)

Examples: Iperf. ER(2)

Examples: Miperf. ER

Examples: Miperf. ER

Examples: bbcp & bbftp

Examples: bbcp & bbftp

The Future (Tasker’s) Trident Tools bbcp/ftp Grid. FTP Web service www i/f Longer term:

The Future (Tasker’s) Trident Tools bbcp/ftp Grid. FTP Web service www i/f Longer term: • More sites…but mesh doesn’t scale! • Wishlist features (but perhaps not all 3, 000) • Investigate other issues: window sizes, Qo. S… And…

TCP Best Practice • Demo TCP best practice to users – “this is what

TCP Best Practice • Demo TCP best practice to users – “this is what you can achieve, if you ‘tune’ your machine like this. . . ” • Possibles “variables”: – Kernel versions and patches – MTU – Interrupt handling • However: – “. . . performance from copper-based Gig. E cards are intimately connected with judicious use and understanding of the corresponding driver(s). . . ” Gray and Betz – Bus speed, bus width… No new fangled gizmos – using what we already have!

Conclusion • • • Near national infrastructure A little basic, but improving Poised for

Conclusion • • • Near national infrastructure A little basic, but improving Poised for web i/f into historic data …. then web i/f tests on demand What do YOU do next? . . – http: //gridmon. dl. ac. uk/

Network Monitoring for e-Science ? ? ? Questions m. j. leese@dl. ac. uk http:

Network Monitoring for e-Science ? ? ? Questions m. j. leese@dl. ac. uk http: //gridmon. dl. ac. uk/~mjl