Getting Started using the PPPL cluster
To use the PPPL cluster, do the following:
- if you don't already have one, get an account by
contacting Lena Scimeca in the Information Technology Department
- log into portal.pppl.gov,
the "front end" of the cluster, using ssh (secure shell)
- copy or compile your program so it is in your home directory or
one of its subdirectories
- create a job script to run your
program
- submint the job script to the scheduler for execution in a queue
The job scheduler will check that the attributes you have
specified in your job are met, and that there are enough hosts
available. If all conditions are met, the scheduler starts the job.
If not, it queues the job for later execution.
Typically, a job contains programs that are parallelized, whose threads each
run on individual CPUs, and which run on more than one host
simultaneously.
Here's an example:
You have a multi-threaded, parallelized program, and have compiled this
program using the appropriate parallel compiler. You want to use a
facility called MPI to run
the program on multiple hosts. You would:
- create a job script to run your program (see sample job script)
- submit this job script to the job scheduler)
- collect your output
Interactive use
The PPPL cluster also has two types of interactive hosts
- Non-exclusive: A set of
hosts has been provided with the alias portal.pppl.gov. When you login to portal, logins are
allocated in a least loaded fashion to one of the hosts in this group.
Here, you can do compilations, do test runs, or submit jobs to the
batch queue. The portal hosts are shared by all users, so running very
high CPU or memory intensive jobs is discouraged. In that case, you
should reserve a system for your exclusive use as detailed in the next
section.
- Exclusive: You can
reserve one or more hosts for your exclusive use (i.e. no other users
will be able to use the hosts allocated to you) by ssh'ing to portal and typing the
command use. This command
will ask you a set of questions, like the number of hosts you need and
if you require a host or hosts with a large memory size. It will then
reserve a set of hosts on which you can run programs interactively. The
advantage of using this approach is that the job scheduler will reserve
your hosts for your use only and prevent other users from running on
the hosts you have been allocated.