PDSF Computing model Thomas Davis ASGNERSC LBNL LCCWS
PDSF Computing model Thomas Davis ASG/NERSC, LBNL LCCWS
Contents " Why PDSF at a supercomputing center? " Short history of PDSF " What does PDSF do? " How do I get access to PDSF? " Hardware " Software
Why PDSF at a Supercomputing Center? " Sharing of resources � 24 x 7 x 365 � Can glass house. leverage some vendors expertise. � Access to expertise in data management, networking, and other important services. � Large cluster could be the next HPC system; so PDSF provides production experience with clusters.
Short history of PDSF " Came from SSC, crated and moved to Livermore. " No new hardware, software for close to 8 years. " " Life support applied when NERSC moved to Berkeley from Livermore in 1997 Has picked up experimental support ever since, with what appears to be a 1000% growth rate.
How do I get PDSF access? " Buy in service model. Clients pay for a fixed slices of CPU time/disk space. � Over time, as cluster grows, the amount of disk space and cpu time drops. Moore's law is in effect. " Client has no direct ownership of cluster. PDSF admins have the right to move resources around when needed. " Client can get more CPU power than what they buy � if no one else is using CPU's in cluster, then resources are available to other users. � LSF Fairshare is used to arbitrate between clients. " Equipment is retired based on Moore's law, and warranties.
PDSF Hardware model " CPU power is always moving. � We expect to retire any Intel based system after 3 years. " " Disk size is climbing even faster; new drives replace old disks, increasing volume size and performance. Memory constraints can also force retirement of systems.
Hardware, Continued. " " " New hardware is always bought as late as possible. CPU speeds are based on the knee of the curve; 2 x 650 mhz machines are better than 1 x 1 Ghz machine. Memory is also bought based on size; always buy largest dimm's when possible.
Software model. " Platform LSF is used for batch queuing. � Fought for better pricing. . but Platform still wants lots of money. " Redhat 6. 1 is software base. � Only updated when needed; many experiments can't change. " " Control of software is vigorous; because PDSF has many experiments, any software changes are controlled. Opensource is preferred, but properiety software is acceptable.
- Slides: 8