jacobican be run without any change whatsoever, in parallel, on the SP.
/afs/$AFSCELL/@sys/usr/afsws/bin:/afs/$AFSCELL/@sys/usr/afsws/etc:\ /afs/$AFSCELL/@sys/gnu/bin:/afs/$AFSCELL/@sys/HDF/bin:\ /afs/$AFSCELL/@sys/teTeX/bin:\after
/usr/binin your command search
export AFSCELL=ovpit.indiana.eduAlso place
kloginto AFS before being able to do anything else with it, even to read AFS
/afs/ovpit.indiana.edu/common/www/htdocs/gustav/jacobi.f90to your working directory and rename it to
gustav@sp17:../jacobi 13:45:55 !219 $ xlhpf90 -o jacobi jacobi.f ** jacobi === End of Compilation 1 === 1501-510 Compilation successful for file jacobi.f. gustav@sp17:../jacobi 13:46:12 !220 $
/afs/ovpit.indiana.edu/common/www/htdocs/gustav/jacobi.llto your working directory. This is what this file should look like:
gustav@sp17:../jacobi 13:46:12 !220 $ cat jacobi.ll # @ job_type = parallel # @ environment = COPY_ALL; MP_EUILIB=us; MP_INFOLEVEL=6 # @ requirements = (Adapter == "hps_user") # @ min_processors = 4 # @ max_processors = 8 # @ output = jacobi.out # @ error = jacobi.err # @ executable = /usr/bin/poe # @ arguments = jacobi # @ notification = always # @ class = pa # @ queue gustav@sp17:../jacobi 13:46:55 !221 $
gustav@sp17:../jacobi 13:47:47 !223 $ llsubmit jacobi.ll submit: The job "sp17.434" has been submitted. gustav@sp17:../jacobi 13:48:22 !224 $
jacobi.errin your working directory.
jacobi.errcontains diagnostics that show the program running in parallel on between 4 to 8 nodes, depending on how many the system could have given you at the time:
gustav@sp17:../jacobi 13:51:51 !232 $ head -12 jacobi.err INFO: DEBUG_LEVEL changed from 0 to 4 D1<L4>: ./host.list file did not exist D1<L4>: mp_euilib = us D1<L4>: node allocation strategy = 1 D1<L4>: Using css0 as euidevice for User Space job ATTENTION: 0031-408 6 nodes allocated by LoadLeveler, continuing... INFO: 0031-119 Host sp20.ucs.indiana.edu allocated for task 0 INFO: 0031-119 Host sp41.ucs.indiana.edu allocated for task 1 INFO: 0031-119 Host sp43.ucs.indiana.edu allocated for task 2 INFO: 0031-119 Host sp23.ucs.indiana.edu allocated for task 3 INFO: 0031-119 Host sp42.ucs.indiana.edu allocated for task 4 INFO: 0031-119 Host sp17.ucs.indiana.edu allocated for task 5 gustav@sp17:../jacobi 13:52:01 !233 $
llqin order to view the LoadLeveler queue on the SP. It may happen that there are jobs running in the
paqueue that will not release it for a very long time.
.rhostsfile in your
$HOMEdirectory on the SP. The file should look as follows:
gustav@sp17:../SP 14:49:28 !412 $ cat .rhosts sp17 sp18 sp19 sp20 sp21 sp22 sp23 sp24 sp40 sp41 sp42 sp43 gustav@sp17:../SP 14:49:29 !413 $
hosts.listalso in your SP
rshworks for you on the SP. It should be
/usr/bin/rshto begin with. Try
rsh sp22 date, for example.
poe jacobi-raster -procs 8 -euilib ipand watch the action!