Proposal for a IS schema Massimo Sgaravatto INFN
- Slides: 9
Proposal for a IS schema Massimo Sgaravatto INFN Padova
Introduction n n We have to converge on a set of information needed by our scheduler (for M 9 release) We have to agree with the other WPs on a “common” schema We have to negotiate with other WPs (WP 4, …) who is in charge to implement the required information providers As main customer of the monitoring and information service we should make a proposal …
Proposal n Draft proposal for a “new” schema proposed in the context of WP 1 INFNGRID (Evaluation of the Globus toolkit) n n http: //www. infn. it/globus/Docs/gisfarm. pdf Useful for our needs ? ? ?
Assumptions (for M 9) n n Globus based prototype A computing element is represented by a set of local queues of an underlying resource management system The Grid Scheduler submit jobs to queues A queue represents a set of “homogeneous” resources n When a job is submitted to a specific queue, it doesn’t matter in which node of this queue the job is dispatched
Proposed schema Service (Job. Manager) Queue. Entry (Job)
Jobmanager service (front-end to the farm) n n Host name of the “front-end” machine Name of job manager (jobmanager-lsf, jobmanager-pbs, …) Contact string Resource management (LSF/PBS/…) n n n Which local resource management systems do we have to consider at M 9 ? Resource management version GRAM Version
Queue n n n n Name of the queue Architecture Operating System Total CPUs Free CPUs Total number of jobs (running and pending) in the queue Number of running jobs Number of idle jobs Maximum number of running jobs allowed for this queue Maximum number of jobs (running and idle) allowed for this queue Status of the queue (ready or not ready to dispatch jobs to the executing machine) Run windows (when the queue is active) Priority Maximum CPU time for a job submitted to this queue Maximum wall clock time for a job submitted to this queue
Job n n n n n Grid job ID Local job ID (LSF job id, Condor job id…) Grid user id (subject of his/her certificate) Local user id Pending reason JDL expression Submit time Start time Wall clock time
Missing info n What about the authorization policies ? n n What about the scratch area ? n n n Grid-mapfiles ? Available disk space …? What about the storage elements ? What about the application environments ? … ? ? ?