PCSF A PC based simulation facility running Windows
PCSF - A PC based simulation facility running Windows NT Frédéric Hemmer CERN-IT/PDP CHEP 98 - 3 September 98 1
Overview n n n Configuration & pictures Applications Data access Specific work & solutions Key issues Conclusions Frédéric Hemmer CERN-IT/PDP CHEP 98 - 3 September 98 2
People involved n n A. Baran, J. P. Baud, C. Boissat, N. Buncic, J. Bunn, C. Charbonnier, F. Collin, V. Dore, V. Faine, S. Jarp, I. Mc. Laren, S. O’Neale, A. Pfeiffer, H. Tang, A. Simmins, C. Von Praun, J. Wessels, R. Yaari and all those that I forgot to mention. . . Frédéric Hemmer CERN-IT/PDP CHEP 98 - 3 September 98 3
Goals n n Make PC+NT a standard option for Physics Data Processing, starting with simulation Establish a minimum management model for NT farm management Address scalability issues Gain Windows NT experience Frédéric Hemmer CERN-IT/PDP CHEP 98 - 3 September 98 4
Milestones n n n Joined RD 47 in Autumn 96 Price inquiry issued in 12/96 Hardware delivered 4/97 Ready to use 6/97 RD 47 report 10/97 Expansion 5/98 Frédéric Hemmer CERN-IT/PDP CHEP 98 - 3 September 98 5
Configuration (1) n Server running NT 4. 0 Server SP 3 • 1 dual capable Ppro @ 200 MHz, 96 MB, with 9 GB data disk (with mirroring). LSF central queues. n Server running NT Terminal Server Beta 2 • 1 dual Ppro @ 200 MHz, 128 MB, with 4 GB data disk. Runs IIS 3. 0 and is accessible from outside CERN. It also host the asp’s for Web access n Servers running NT 4. 0 Workstation SP 3 • 9 dual Ppro’s @ 200 MHz, 64 MB, 2*4 GB • 25 dual PII’s @ 300 MHz, 128 MB, 2*4 GB All equipped with boot proms Frédéric Hemmer CERN-IT/PDP CHEP 98 - 3 September 98 6
Configuration (2) n n n Machines interconnected with 4 3 com 3000 100 Base. T switch Display/Keyboard/Mouse connected to a Raritan multiplexor PC Duo for remote admin access There were problems with other products n All running LSF 3. 0. LSF 3. 2 does not work, support weak n Completely integrated with NICE Frédéric Hemmer CERN-IT/PDP CHEP 98 - 3 September 98 7
Racking evolution 1998 1997 Frédéric Hemmer CERN-IT/PDP CHEP 98 - 3 September 98 8
Applications n n n ATLAS Dice simulation NA 45 1996 reconstruction CMS reconstruction with Objectivity being tested LHCB simulation code ready ATLAS reconstruction being ported ATLAS/Marseille event filter prototype scalability tests (see poster) 10/21/2021 12
Data access Unix RFIO Server IO F R NT PC NT NTPC PC N et w or k Unix Tape Server stagexxx commands Frédéric Hemmer CERN-IT/PDP CHEP 98 - 3 September 98 13
PCSF Usage Frédéric Hemmer CERN-IT/PDP CHEP 98 - 3 September 98 14
Frédéric Hemmer CERN-IT/PDP CHEP 98 - 3 September 98 15
Specific work so far n n n Installation (Remote Boot, Winstall, NICE replica’s, Install Server) User codes, CERNLIB, SHIFT Job Starter PC MGR WNTS Web Interface Frédéric Hemmer CERN-IT/PDP CHEP 98 - 3 September 98 16
Installation Disk cloning + change SID Fastest method, but not very automated n Remote boot n • Remote boot install procedures with virtual disk • Use unattended setup, installs Winstall and other things • Third party packages installed through Winstall boot prom support on some hardware Frédéric Hemmer CERN-IT/PDP CHEP 98 - 3 September 98 17
Porting Usually porting code from Unix to NT is easy (NA 45 code ported in 1 week) n Usually porting production environment from Unix to NT is difficult (shell scripts) n Porting build environment is difficult, better to use native tools (Dev Studio) Mixing Unix and NT build environment, revision control, etc. n Frédéric Hemmer CERN-IT/PDP CHEP 98 - 3 September 98 18
Jobstarter Initially inherited from Unix LSF CERN Job. Starter n Rewritten in C++, using Pc. Mgr. Svc for drive mapping n Check execution preconditions n Clean up normal and abnormal job end n Kill popup dialog windows Excel & Winzip in batch n Frédéric Hemmer CERN-IT/PDP CHEP 98 - 3 September 98 19
Pc. Mgr. Svc/Ctl n Checks • Status of monitored processes/services • Amount of scratch space • Drive mapping(s) n n Map/Unmap drives Sync. with time servers Generate alarms on request Gets all parameters from registry Frédéric Hemmer CERN-IT/PDP CHEP 98 - 3 September 98 20
Web Interface n As a solution to • Remote access from outside CERN • Access from non NT hosts n n Implemented as ASP’s with VB Requires IIS on the server Frédéric Hemmer CERN-IT/PDP CHEP 98 - 3 September 98 21
Web Interface - Overview Frédéric Hemmer CERN-IT/PDP CHEP 98 - 3 September 98 23
Windows NT Terminal Server Frédéric Hemmer CERN-IT/PDP CHEP 98 - 3 September 98 26
Key Issues n n n AFS access LSF support Boot proms, equipment interoperability CODE reintegration (Physics & CERNLIB) Think Windows Scalability & Management (home grown solution vs. commercial apps. ) Frédéric Hemmer CERN-IT/PDP CHEP 98 - 3 September 98 27
Next Steps n n n n Finish and understand remote boot issues Complete remote boot - remote install AFS Integration Build up resilience Investigate how to use the new Wf. M, DMI, PXE, ACPI, etc. initiatives Investigate whether WSH is an alternative Investigate NT’s I/O capabilities Frédéric Hemmer CERN-IT/PDP CHEP 98 - 3 September 98 28
Conclusions n n PC+NT has proven to work in batch environment, and is now an option for Physics Data Processing Farm management is less of a concern after have built a few tools (alternatives would be to use SMS or TNG), but some work is still needed Scalability has started to be addressed, but the relatively small number of nodes does not help here Considerable NT experience has been gained Frédéric Hemmer CERN-IT/PDP CHEP 98 - 3 September 98 29
- Slides: 23