Introduction to Message Passing Interface MPI Introduction 942012

Shared-Memory Systems Processor Bus interface Processor/ memory us Memory controller Shared memory 9/4/2012 Memory

For processors that share the same memory, if one processor computes a value

Distributed-Memory Systems Interconnection network Messages Processor For clusters each processor cannot access the memory

If one processor computes a value or values needed other processors, it has

MPI (Message Passing Interface) Widely adopted message passing library standard. MPI-1 finalized in 1994,

Message-Passing Sockets (very low level) Parallel Virtual Machine (created by Oak Ridge National Lab)

To begin MPI_Init() – Initializes the processes and gets them ready to run. Should

To begin Time Process 0 MPI_Init() Process 0 Process 1 Process 2 Process 3

$To begin int main(int argc, char **argv) { MPI_Init(&argc, &argv); <Code executed by all$

To begin MPI_Init takes two arguments &argc and &argv, which are the arguments of

Other Useful Functions To determine how many processes there are: MPI_Comm_size(MPI_COMM_WORLD, &NP); To determine

How is the number of processes determined? When you run your MPI program, you

How can these be used? #include <mpi. h> int main(int argc, char **argv) {

Compiling and Running $ mpicc hello. c –o hello $ mpirun –np 5 hello

All processes are executing the same code (although asynchronously) How can one have

Use their rank if (myrank >=0 && myrank < x) { … // Code

Sending messages Messages can be sent between processes using the MPI_Send() and MPI_Recv() functions.

Message passing concept using library routines Note each computer executes its own program Parallel

Sending messages int MPI_Send(void *buf, int count, MPI_Datatype datatype, int dest, int tag, MPI_Comm

Receiving messages int MPI_Recv(void *buf, int count, MPI_Datatype datatype, int source, int tag, MPI_Comm

Parameters of blocking send MPI_Send(buf, count, datatype, dest, tag, comm) Datatype of each item

MPI Datatypes (defined in mpi. h) MPI datatypes MPI_BYTE MPI_PACKED MPI_CHAR MPI_SHORT MPI_INT MPI_LONG

Example (Hello World 2) #include <stdio. h> #include <string. h> #include <stddef. h> #include

Example (Hello World 2) MPI_Init(&argc, &argv); MPI_Comm_size(MPI_COMM_WORLD, &NP); MPI_Comm_rank(MPI_COMM_WORLD, &rank); gethostname(machine_name, 255); 9/4/2012 Parallel

Example (Hello World 2) if(rank == 0) { printf ("Hello world from master process

Example (Hello World 2) else { sprintf(message, "Hello world from process %d running on

Result $ mpirun –np 9. /hello Hello world from master process 0 running on

Any source or tag In the MPI_Recv, the source can be MPI_ANY_SOURCE The tag

Another Example (array) source int array[100]; … // rank 0 fills the array with

Another Example (scalar) Number of elements for a scalar is only 1 int N;

Another Example (Ring) In the ring example, each process (except the master) receives a

Another Example (Ring) Each process (excepts the master) receives a token from the process

Another Example (Ring) #include <stdio. h> #include <mpi. h> int main (int argc, char

Another Example (Ring) if (myrank != 0) { // Everyone except the master receives

Another Example (Ring) } else { // The master sets the initial value before

Another Example (Ring) // Now process 0 can receive from the last process. if

Results (Ring) Process 1 received token 1 from process 0 Process 2 received token

Send and Receive Semantics The MPI_Send and MPI_Recv are locally blocking They return after

Matching up sends and recvs Notice in code how you have to be very

Measuring the Execution Time double MPI_Wtime( void ) Returns a double that is the

Measuring the Execution Time Alternatively, one can use gettimeofday (a system call to the

Executing program on multiple computers Usually computers specified in a file containing names of

Cluster at UNCW User Computers Dedicated Cluster Master node Switch Ethernet interface Submit Host:

Cluster at UNCW We use the Sun Grid Engine (SGE) to schedule jobs on

SGE But running is done through a job submission file (or job description file)

SGE Example job submission file (hello. sge): #!/bin/sh # Usage: qsub hello. sge #$

SGE Example job submission file (hello. sge): # -- our name --#$ -N Hello

SGE Example job submission file (hello. sge): #$ -cwd # Make sure that the.

SGE Example job submission file (hello. sge): mpirun -np $NSLOTS. /hello And finally the

SGE Example $ qstat $ qsub hello. sge Your job 106 ("Hello") has been

SGE Example $ qstat job-ID prior name user state submit/start at queue slots ja-task-ID

SGE Example $ ls hello. c Hello. o 106 Hello. po 106 hello. sge

Deleting a job $ qstat job-ID prior name user state submit/start at queue slots

Executing program on UNCC cluster On UNCC cci-gridgw. uncc. edu cluster, mpiexec command is

Specifying number of processes to execute on each computer Machines file can include how

Eclipse IDE PTP Parallel Tools Platform plug-in • Supports development of parallel programs (MPI,

Visualization Tools Programs can be watched as they are executed in a space-time diagram

Questions? 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 62

Slides: 62

Download presentation

Introduction to Message Passing Interface (MPI) Introduction 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 1

Shared-Memory Systems Processor Bus interface Processor/ memory us Memory controller Shared memory 9/4/2012 Memory All processors can access all of the shared memory Parallel Programming C. Ferner & B. Wilkinson, 2014 2

For processors that share the same memory, if one processor computes a value or values needed other processors, they need to synchronize Processors that need the data must wait for the processor that computes it However, all processors can access the same memory so nothing else is required. 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 3

Distributed-Memory Systems Interconnection network Messages Processor For clusters each processor cannot access the memory of other processors. Memory is private. Local memory Computers 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 4

If one processor computes a value or values needed other processors, it has to transmit that data This requires message-passing All data is private 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 5

MPI (Message Passing Interface) Widely adopted message passing library standard. MPI-1 finalized in 1994, MPI-2 in 1996, MPI-3 in 2012 Process-based -- processes communicate between themselves with messages. Point-to-point and collectively. A specification, not an implementation. Several free implementations exist, Open. MPI, MPICH, Large number of routines: MPI-1 128 routines, MPI-2 287 routines, MPI-3 440+ routines, but typically only a few used. C and Fortran bindings (C++ removed from MPI-3) Originally for distributed systems but now used for all types, clusters, shared memory, hybrid. Parallel Programming C. Ferner & B. Wilkinson, 2014 6

Message-Passing Sockets (very low level) Parallel Virtual Machine (created by Oak Ridge National Lab) MPI (Message Passing Interface) - standard LAM MPICH Open. MPI (we use this one) All of these are libraries that can be used from within a C program 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 7

To begin MPI_Init() – Initializes the processes and gets them ready to run. Should be the first executable statement in the program MPI_Finalize() – cleans up after the parallel program is done. Should be the last executable statement in the program Although all of the processes exist before and after the Init and Finalize, it is convenient to think of MPI_Init() as when all the processors are created and MPI_Finialize() is when they are destroyed 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 8

To begin Time Process 0 MPI_Init() Process 0 Process 1 Process 2 Process 3 MPI_Finalize() Process 0 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 9

$To begin int main(int argc, char **argv) { MPI_Init(&argc, &argv); <Code executed by all$

To begin int main(int argc, char **argv) { MPI_Init(&argc, &argv); <Code executed by all processes> MPI_Finalize(); } 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 10

To begin MPI_Init takes two arguments &argc and &argv, which are the arguments of main. MPI_Finalize takes no arguments. Each processor is assigned a rank in the range 0 ≤ rank < NP, where NP is the number of processes being used. 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 11

Other Useful Functions To determine how many processes there are: MPI_Comm_size(MPI_COMM_WORLD, &NP); To determine the current process’ rank among all the processes: MPI_Comm_rank(MPI_COMM_WORLD, &myrank); Each processor is given a unique rank in the range 0 ≤ rank < NP 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 12

How is the number of processes determined? When you run your MPI program, you can specify how many processes you want: $ mpirun –np 8 <program> The –np option tells mpirun to run your parallel program using the specified number of processes. OR $ mpiexec –n 8 <program> The –n option tells mpiexec to run your parallel program using the specified number of processes. 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 13

How can these be used? #include <mpi. h> int main(int argc, char **argv) { int NP, myrank; MPI_Init(&argc, &argv); MPI_Comm_size(MPI_COMM_WORLD, &NP); MPI_Comm_rank(MPI_COMM_WORLD, &myrank); printf("Hello world from rank %d of %d. n", myrank, NP); MPI_Finalize(); } 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 14

Compiling and Running $ mpicc hello. c –o hello $ mpirun –np 5 hello Hello world from rank 0 of 5. Hello world from rank 3 of 5. Hello world from rank 4 of 5. Hello world from rank 1 of 5. mpicc is essentially gcc but makes sure that the MPI libraries are included Why are the statements not in order of rank? Hello world from rank 2 of 5. $ 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 15

All processes are executing the same code (although asynchronously) How can one have them execute separate code? Or how can one have a section of code executed by only one process? 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 16

Use their rank if (myrank >=0 && myrank < x) { … // Code executed by a subset of processes } OR Client/Server model if (myrank == 0) { … // Code executed by only one process } else { … // Code executed by all other processes } 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 17

Sending messages Messages can be sent between processes using the MPI_Send() and MPI_Recv() functions. These are one-to-one communication MPI_Recv is blocking, meaning that execution will stop until the appropriate message is received. There are other non-blocking forms of communication as well as one-to-many and many-to-many. 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 18

Message passing concept using library routines Note each computer executes its own program Parallel Programming C. Ferner & B. Wilkinson, 2014 2. 19

Sending messages int MPI_Send(void *buf, int count, MPI_Datatype datatype, int dest, int tag, MPI_Comm comm) buf is the address of the data to send count is the number of elements (1 if scalar, N if an array, or strlen+1 if a string) datatype is the type of elements dest is the rank of the destination 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 20

Sending messages int MPI_Send(void *buf, int count, MPI_Datatype datatype, int dest, int tag, MPI_Comm comm) tag is user-defined (allows you to mark different message with your own tag). This is useful when two processors are sending multiple messages between each other. Comm is what is know as a Communicator. Basically, it is a subset of processors. MPI_COMM_WORLD is used for all processors. 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 21

Receiving messages int MPI_Recv(void *buf, int count, MPI_Datatype datatype, int source, int tag, MPI_Comm comm, MPI_Status *status) buf is the address in which to store the message count is the size of the buf. Can be bigger than the actual message. datatype is the type of elements source is the rank of the sender 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 22

Receiving messages int MPI_Recv(void *buf, int count, MPI_Datatype datatype, int source, int tag, MPI_Comm comm, MPI_Status *status) tag is user-defined Comm is the Communicator. MPI_COMM_WORLD is used for all processors. status is a structure that contains information about the transmission 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 23

Parameters of blocking send MPI_Send(buf, count, datatype, dest, tag, comm) Datatype of each item Address of send buffer Notice a pointer Message tag Communicator Number of items to send Rank of destination process Parameters of blocking receive MPI_Recv(buf, count, datatype, src, tag, comm, status) Address of receive buffer Datatype of each item Maximum number of items to receive Usually send and recv counts are the same. Message tag Communicator Rank of source Status after process operation In our code we do not check status but good programming practice to do so. 24

MPI Datatypes (defined in mpi. h) MPI datatypes MPI_BYTE MPI_PACKED MPI_CHAR MPI_SHORT MPI_INT MPI_LONG MPI_FLOAT MPI_DOUBLE MPI_LONG_DOUBLE MPI_UNSIGNED_CHAR Parallel Programming C. Ferner & B. Wilkinson, 2014

Example (Hello World 2) #include <stdio. h> #include <string. h> #include <stddef. h> #include <stdlib. h> #include <mpi. h> main(int argc, char **argv ) { char message[256]; int i, rank, NP, tag=99; char machine_name[256]; MPI_Status status; 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 26

Example (Hello World 2) MPI_Init(&argc, &argv); MPI_Comm_size(MPI_COMM_WORLD, &NP); MPI_Comm_rank(MPI_COMM_WORLD, &rank); gethostname(machine_name, 255); 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 27

Example (Hello World 2) if(rank == 0) { printf ("Hello world from master process %d running on %sn", rank, machine_name); for (i = 1; i < NP; i++) { MPI_Recv(message, 256, MPI_CHAR, i, tag, MPI_COMM_WORLD, &status); printf("Message from process = %d : %sn", i, message); } } 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 28

Example (Hello World 2) else { sprintf(message, "Hello world from process %d running on %s", rank, machine_name); // The destination is the master process (rank 0) MPI_Send(message, strlen(message) + 1, MPI_CHAR, 0, tag, MPI_COMM_WORLD); } MPI_Finalize(); } 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 29

Result $ mpirun –np 9. /hello Hello world from master process 0 running on compute-0 -0. local Message from process = 1 : Hello world from process 1 running on compute-0 -0. local Message from process = 2 : Hello world from process 2 running on compute-0 -0. local Message from process = 3 : Hello world from process 3 running on compute-0 -0. local Message from process = 4 : Hello world from process 4 running on compute-0 -0. local Message from process = 5 : Hello world from process 5 running on compute-0 -0. local Message from process = 6 : Hello world from process 6 running on compute-0 -0. local Message from process = 7 : Hello world from process 7 running on compute-0 -0. local Message from process = 8 : Hello world from process 8 running on compute-0 -1. local $ 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 30

Any source or tag In the MPI_Recv, the source can be MPI_ANY_SOURCE The tag can be MPI_ANY_TAG These cause the Recv to take any message destined for the current process regardless of the source and/or regardless of the tag Ex. MPI_Recv(message, 256, MPI_CHAR, MPI_ANY_SOURCE, MPI_ANY_TAG, MPI_COMM_WORLD, &status); 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 31

Another Example (array) source int array[100]; … // rank 0 fills the array with data destination if (rank == 0) MPI_Send (array, 100, MPI_INT, 1, 0, MPI_COMM_WORLD); else if (rank == 1) MPI_Recv(array, 100, MPI_INT, 0, 0, MPI_COMM_WORLD, &status); tag Number of Elements 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 32

Another Example (scalar) Number of elements for a scalar is only 1 int N; … // rank 2 assigns a value to N, which is needed by // rank 3 dest tag if (rank == 2) MPI_Send (&N, 1, MPI_INT, 3, 5, MPI_COMM_WORLD); else if (rank == 3) MPI_Recv(&N, 1, MPI_INT, 2, 5, MPI_COMM_WORLD, &status); source 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 33

Another Example (Ring) In the ring example, each process (except the master) receives a token from the process with rank 1 less than its own rank Then each process increments the token and sends it to the next process (with rank 1 more than its own) The last process sends the token to the master 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 34

Another Example (Ring) Each process (excepts the master) receives a token from the process with rank 1 less than its own rank. Then each process increments the token by 2 and sends it to the next process (with rank 1 more than its own). The last process sends the token to the master 0 1 7 2 6 3 5 4 Question: Do we have pattern for this? Parallel Programming C. Ferner & B. Wilkinson, 2014 Slide based upon slides from C. Ferner, UNC-W 35

Another Example (Ring) #include <stdio. h> #include <mpi. h> int main (int argc, char *argv[]) { int token, NP, myrank; MPI_Status status; MPI_Init (&argc, &argv); MPI_Comm_size(MPI_COMM_WORLD, &NP); MPI_Comm_rank(MPI_COMM_WORLD, &myrank); 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 36

Another Example (Ring) if (myrank != 0) { // Everyone except the master receives from the processor 1 less // than its own rank. MPI_Recv(&token, 1, MPI_INT, myrank - 1, 0, MPI_COMM_WORLD, &status); printf("Process %d received token %d from process %dn", myrank, token, myrank - 1); 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 37

Another Example (Ring) } else { // The master sets the initial value before sending. token = -1; } token += 2; MPI_Send(&token, 1, MPI_INT, (myrank + 1) % NP, 0, MPI_COMM_WORLD); 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 38

Another Example (Ring) // Now process 0 can receive from the last process. if (myrank == 0) { MPI_Recv(&token, 1, MPI_INT, NP - 1, 0, MPI_COMM_WORLD, &status); printf("Process %d received token %d from process %dn", myrank, token, NP - 1); } MPI_Finalize(); } 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 39

Results (Ring) Process 1 received token 1 from process 0 Process 2 received token 3 from process 1 Process 3 received token 5 from process 2 Process 4 received token 7 from process 3 Process 5 received token 9 from process 4 Process 6 received token 11 from process 5 Process 7 received token 13 from process 6 Process 0 received token 15 from process 7 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 40

Send and Receive Semantics The MPI_Send and MPI_Recv are locally blocking They return after local actions are complete MPI_Send returns after data has been copied into a buffer, message prepared and on its way MPI_Recv returns after data has been received and buffer copied into user data (blocks until message arrives) There are other variations of send and receive (more on this later) 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 41

Matching up sends and recvs Notice in code how you have to be very careful matching up send’s and recv’s. Every send must have matching recv. The sends return after local actions complete but the recv will wait for the message so easy to get deadlock if written wrong Pre-implemented patterns are designed to avoid deadlock. We will look at deadlock again Parallel Programming C. Ferner & B. Wilkinson, 2014 2. 42

Measuring the Execution Time double MPI_Wtime( void ) Returns a double that is the number of seconds since epoch date (e. g. January 1, 1970). double start_time, end_time, elapsed_time; … start_time = MPI_Wtime(); … Measure time to execute this section end_time = MPI_Wtime(); elapsed_time = end_time – start_time; 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 43

Measuring the Execution Time Alternatively, one can use gettimeofday (a system call to the operating system) #include <sys/time. h> double elapsed_time; struct timeval tv 1, tv 2; gettimeofday(&tv 1, NULL); … This is useful if you are creating a sequential (non-MPI) version with which to compare. Measure time to execute this section gettimeofday(&tv 2, NULL); elapsed_time = (tv 2. tv_sec - tv 1. tv_sec) + ((tv 2. tv_usec - tv 1. tv_usec) / 1000000. 0); 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 44

Executing program on multiple computers Usually computers specified in a file containing names of computers and possibly number of processes that should run on each computer. Then specify file with –machines option with mpiexec (or –hostfile or –f options). Implementation-specific algorithm selects computers from list to run user processes. Typically MPI would cycle through list in round robin fashion. If a machines file not specified, a default machines file used or it may only run on a single computer. Parallel Programming C. Ferner & B. Wilkinson, 2014 2. 45

Cluster at UNCW User Computers Dedicated Cluster Master node Switch Ethernet interface Submit Host: babbage Head Node: harpua Compute nodes Compute Nodes: compute 0 -0, compute-0 -1, compute-0 -2, … 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 46

Cluster at UNCW We use the Sun Grid Engine (SGE) to schedule jobs on the cluster This is to allow users to have exclusive use of the compute nodes so that users’ applications don’t interfere with the performance of others The scheduler (SGE) is responsible for allocating compute nodes to jobs exclusively Compile as normal: $ mpicc hello. c –o hello 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 47

SGE But running is done through a job submission file (or job description file) Some SGE commands: qsub <job submission file> – submits a job to the schedule to run qstat – see the status of submitted jobs (waiting, queued, running, terminated, etc. ) qdel <#> - deletes a job (by number) from the system qhost – see a list of hosts 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 48

SGE Example job submission file (hello. sge): #!/bin/sh # Usage: qsub hello. sge #$ -S /bin/sh #$ -pe orte 16 # Specify how many processors we want # -- our name --#$ -N Hello # Name for the job #$ -l h_rt=00: 01: 00 # Request 1 minute to execute #$ -cwd # Make sure that the. e and. o file arrive in the working directory #$ -j y # Merge the standard out and standard error to one file mpirun -np $NSLOTS. /hello 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 49

SGE Example job submission file (hello. sge): #!/bin/sh # Usage: qsub hello. sge #$ -S /bin/sh #$ -pe orte 16 want 9/4/2012 # Specify how many processors we Parallel Programming C. Ferner & B. Wilkinson, 2014 50

SGE Example job submission file (hello. sge): # -- our name --#$ -N Hello # Name for the job The name of the job plus the name of the output files: Hello. o### and Hello. op### #$ -l h_rt=00: 01: 00 # Request 1 minute to execute Indicates that the job will need only a minute. This is important so that SGE will clean up if the program hangs or terminates incorrectly. May need to increase the time for longer programs or it will terminate the program before it has completed. 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 51

SGE Example job submission file (hello. sge): #$ -cwd # Make sure that the. e and. o file arrive in the working directory Do the job in the current directory #$ -j y # Merge the standard out and standard error to one file SGE will create 3 files: Hello. o##, Hello. e##, and Hello. op##. The –j y command will merge the Hello. o and Hello. e files (std out and error). 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 52

SGE Example job submission file (hello. sge): mpirun -np $NSLOTS. /hello And finally the command to run the MPI program. $NSLOTS is the same number given with the #$ -pe orte 16 line. 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 53

SGE Example $ qstat $ qsub hello. sge Your job 106 ("Hello") has been submitted $ qstat job-ID prior name user state submit/start at queue slots ja-task-ID --------------------------------------------------------106 0. 00000 Hello cferner qw 09/04/2012 09: 08: 38 16 $ The state of “qw” means queued and waiting. 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 54

SGE Example $ qstat job-ID prior name user state submit/start at queue slots ja-task-ID --------------------------------------------------------106 0. 55500 Hello cferner r 09/04/2012 09: 11: 43 all. q@compute-0 -0. local 16 [cferner@babbage mpi_assign]$ The state of “r” means running 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 55

SGE Example $ ls hello. c Hello. o 106 Hello. po 106 hello. sge ring. c ring. sge test. c test. sge $ cat Hello. o 106 Hello world from master process 0 running on compute-0 -2. local Message from process = 1 : Hello world from process 1 running on compute-0 -2. local Message from process = 2 : Hello world from process 2 running on compute-0 -2. local … You will want to clean up the output files when you are done with them or you will end up with a bunch of clutter. 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 56

Deleting a job $ qstat job-ID prior name user state submit/start at queue slots ja-task-ID --------------------------------------------------------108 0. 00000 Hello cferner qw 09/04/2012 09: 18: 20 16 $ qdel 108 cferner has registered the job 108 for deletion $ qstat $ 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 57

Executing program on UNCC cluster On UNCC cci-gridgw. uncc. edu cluster, mpiexec command is mpiexec. hydra. Internal compute nodes have names used just internally. For example, a machines file to use nodes 5, 7 and 8 and the front node of the cci-grid 0 x cluster would be: cci-grid 05 cci-grid 07 cci-grid 08 cci-gridgw. uncc. edu Then: mpiexec. hydra -machinefile machines -n 4. /prog would run prog with four processes, one on cci-grid 05, one on cci-grid 07, one on cci-grid 08, and one on ccigridgw. uncc. edu. 2. 58

Specifying number of processes to execute on each computer Machines file can include how many processes to execute on each computer. For example: # a comment cci-grid 05: 2 # first 2 processes on 05 cci-grid 07: 3 # next 3 processes on 07 cci-grid 08: 4 # next 4 processes on 08 cci-gridgw. uncc. edu: 1# Last process on gridgw (09) 10 processes in total. Then: mpiexec. hydra -machinefile machines -n 10. /prog If more processes were specified, they would be scheduled in round robin fashion. 2. 59

Eclipse IDE PTP Parallel Tools Platform plug-in • Supports development of parallel programs (MPI, Open. MP). • Possible to edit and execute MPI program on client or a remote machine. Eclipse-PTP installed on the course virtual machine. Hope to explore Eclipse-PTP in assignments. 2. 60 http: //download. eclipse. org/tools/ptp/docs/ptp-sc 11 -slides-final. pdf

Visualization Tools Programs can be watched as they are executed in a space-time diagram (or process-time diagram): Process 1 Process 2 Process 3 Computing Time Waiting Message-passing system routine Message Visualization tools available for MPI, e. g. , Upshot. 2. 61

Questions? 9/4/2012 Parallel Programming C. Ferner & B. Wilkinson, 2014 62