CS 566, SP06 | Parallel Processing |
Now with torque or any other bach system you wrap up steps 1 and 2 in a
script (or at least step 2).
You can call it script.sh or whatever and you submit that script via qsub
(or put that in the script as well)
Note: Select same nodes for torque as the ones that your mpd daemons will
be using to form the ring.
Ok now more on the 3 steps stated above & I think that will answer most of questions:
Before we start create a file in homedir called: .mpd.conf (-rw-------)
and whose content is one line:
secretword=some_secret_password
Steps:
-r rsh will let mpdboot know that it will use rsh to bring the
daemons up not ssh which is the default way that mpdboot tries to
communicate
-n 4 means it (mpdboot) 'll bring only four of them(mpd daemons) from the
list(mpd.hosts), say the mpd.hosts had all 64 nodes and you had -n 5 it
would select first five nodes from the list and start mpd daemons on them.
Anyhow best practice is to specify only the nodes you'll be using.
To double check that the ring is indeed up, do the following:
rsh argo1-1 "/usr/common/mpich2-1.0.1/bin/mpdtrace"
argo1-1
argo1-4
argo1-2
argo1-3
Basically this script looks something like the script x and x1 on the argo
homepage.
The most important line above was the qsub -l nodes=$nodes .......
---------------------------------------------------
#!/bin/csh
set nodes = `perl -e 'while
(
qsub -l nodes=$nodes mpiexec -n 4
/home/homes5X/path_to_your_binary/mpihello_pgcc_mpi2libs
---------------------------------------------------
This line will prepare and tell PBS(torque) what nodes to use.
Execute the script, ./script