Proposal for a IS schema Massimo Sgaravatto INFN

  • Slides: 9
Download presentation
Proposal for a IS schema Massimo Sgaravatto INFN Padova

Proposal for a IS schema Massimo Sgaravatto INFN Padova

Introduction n n We have to converge on a set of information needed by

Introduction n n We have to converge on a set of information needed by our scheduler (for M 9 release) We have to agree with the other WPs on a “common” schema We have to negotiate with other WPs (WP 4, …) who is in charge to implement the required information providers As main customer of the monitoring and information service we should make a proposal …

Proposal n Draft proposal for a “new” schema proposed in the context of WP

Proposal n Draft proposal for a “new” schema proposed in the context of WP 1 INFNGRID (Evaluation of the Globus toolkit) n n http: //www. infn. it/globus/Docs/gisfarm. pdf Useful for our needs ? ? ?

Assumptions (for M 9) n n Globus based prototype A computing element is represented

Assumptions (for M 9) n n Globus based prototype A computing element is represented by a set of local queues of an underlying resource management system The Grid Scheduler submit jobs to queues A queue represents a set of “homogeneous” resources n When a job is submitted to a specific queue, it doesn’t matter in which node of this queue the job is dispatched

Proposed schema Service (Job. Manager) Queue. Entry (Job)

Proposed schema Service (Job. Manager) Queue. Entry (Job)

Jobmanager service (front-end to the farm) n n Host name of the “front-end” machine

Jobmanager service (front-end to the farm) n n Host name of the “front-end” machine Name of job manager (jobmanager-lsf, jobmanager-pbs, …) Contact string Resource management (LSF/PBS/…) n n n Which local resource management systems do we have to consider at M 9 ? Resource management version GRAM Version

Queue n n n n Name of the queue Architecture Operating System Total CPUs

Queue n n n n Name of the queue Architecture Operating System Total CPUs Free CPUs Total number of jobs (running and pending) in the queue Number of running jobs Number of idle jobs Maximum number of running jobs allowed for this queue Maximum number of jobs (running and idle) allowed for this queue Status of the queue (ready or not ready to dispatch jobs to the executing machine) Run windows (when the queue is active) Priority Maximum CPU time for a job submitted to this queue Maximum wall clock time for a job submitted to this queue

Job n n n n n Grid job ID Local job ID (LSF job

Job n n n n n Grid job ID Local job ID (LSF job id, Condor job id…) Grid user id (subject of his/her certificate) Local user id Pending reason JDL expression Submit time Start time Wall clock time

Missing info n What about the authorization policies ? n n What about the

Missing info n What about the authorization policies ? n n What about the scratch area ? n n n Grid-mapfiles ? Available disk space …? What about the storage elements ? What about the application environments ? … ? ? ?