Skip to content

Commit

Permalink
update job submission details
Browse files Browse the repository at this point in the history
  • Loading branch information
Jane Herriman committed Jun 4, 2024
1 parent d5bd8e9 commit 63da1bf
Show file tree
Hide file tree
Showing 6 changed files with 104 additions and 85 deletions.
14 changes: 7 additions & 7 deletions episodes/01-introduction.md
Original file line number Diff line number Diff line change
Expand Up @@ -229,15 +229,15 @@ Where would `Hello` be written for the following YAML?

```yml
description:
name: MyHello
description: Report a node's hostname.
name: MyHello
description: Report a node's hostname.
study:
- name: give-salutation
description: Write the login node's hostname to a file
run:
cmd: |
echo "hello" > greeting.txt
- name: give-salutation
description: Write the login node's hostname to a file
run:
cmd: |
echo "hello" > greeting.txt
```

1. `~/give-salutation_{date}_{time}/greeting/greeting.txt`
Expand Down
68 changes: 36 additions & 32 deletions episodes/02-maestro_on_the_cluster.md
Original file line number Diff line number Diff line change
Expand Up @@ -33,10 +33,11 @@ should run.

```yml
batch:
type: slurm
host: quartz # enter the machine you'll run on
bank: guests # enter the bank to charge
queue: pdebug # enter the partition in which your job should run
type: slurm
host: ruby # enter the machine you'll run on
bank: guests # enter the bank to charge
queue: pbatch # partition in which your job should run
reservation: HPCC1B # reservation for this workshop
```
Second, we need to specify the number of nodes, number of processes,
Expand All @@ -47,11 +48,11 @@ a time limit of 30 seconds:
```yml
(...)
run:
cmd: |
hostname >> hostname.txt
nodes: 1
procs: 1
walltime: "00:00:30"
cmd: |
hostname >> hostname.txt
nodes: 1
procs: 1
walltime: "00:00:30"
```

Whereas `run` previously held only info about the command we wanted to
Expand All @@ -63,26 +64,27 @@ With these changes, our updated YAML file might look like

```yml
description:
name: Hostnames
description: Report a node's hostname.
name: Hostnames
description: Report a node's hostname.
batch:
type: slurm
host: quartz # machine to run on
bank: guests # bank
queue: pdebug # partition
type: slurm
host: ruby # machine
bank: guests # bank
queue: pbatch # partition
reservation: HPCC1B # reservation for this workshop
study:
- name: hostname-login
description: Write the login node's hostname to a file
run:
cmd: |
hostname > hostname_login.txt
- name: hostname_batch
description: Write the node's hostname to a file
run:
cmd: |
hostname >> hostname.txt
- name: hostname-login
description: Write the login node's hostname to a file
run:
cmd: |
hostname > hostname_login.txt
- name: hostname_batch
description: Write the node's hostname to a file
run:
cmd: |
hostname >> hostname.txt
nodes: 1
procs: 1
walltime: "00:00:30"
Expand All @@ -99,8 +101,8 @@ slurm.
## Running on the cluster

Modify your YAML file, `hostname.yaml` to execute `hostname` on the
_cluster_. Run with 1 node and 1 process using the bank `guest` on
the partition `psummer` on `quartz`.
_cluster_. Run with 1 node and 1 process using the bank `guests` on
the partition `pbatch` on `ruby`.

If you run this multiple times, do you always run on the same node?
(Is the hostname printed always the same?)
Expand All @@ -116,9 +118,10 @@ description:
batch:
type: slurm
host: quartz # machine to run on
bank: guest # bank
queue: psummer # partition
host: ruby # machine
bank: guests # bank
queue: pbatch # partition
reservation: HPCC1B # reservation for this workshop
study:
- name: hostname-login
Expand Down Expand Up @@ -149,8 +152,9 @@ subdirectory `hostname_batch`, you'll find a file called
command ran. If you run the job multiple times, you will probably land
on different nodes; this means you'll see different node numbers in
different `hostname.txt` files. If you see the same number more than
once, don't worry! If you get any answer other than `pascal83`, you're
doing it correctly. :)
once, don't worry! (If you want to double check that the hostnames
printed are not for login nodes, you can run `nodeattr -c login` to
check the IDs of all login nodes on the system.)

::::::

Expand Down
51 changes: 29 additions & 22 deletions episodes/03-maestro-and-mpi.md
Original file line number Diff line number Diff line change
Expand Up @@ -143,9 +143,10 @@ description:
batch:
type: slurm
host: quartz # machine to run on
bank: guests # bank
queue: pdebug # partition
host: ruby # machine
bank: guests # bank
queue: pbatch # partition
reservation: HPCC1B # reservation for this workshop
study:
- name: amdahl
Expand Down Expand Up @@ -184,9 +185,10 @@ description:
batch:
type: slurm
host: quartz # machine to run on
bank: guests # bank
queue: pdebug # partition
host: ruby # machine
bank: guests # bank
queue: pbatch # partition
reservation: HPCC1B # reservation for this workshop
study:
- name: amdahl
Expand Down Expand Up @@ -224,7 +226,7 @@ parallel proportion of the workload.
Hello, World! I am process 0 of 1 on pascal17.
I will do all the serial 'work' for 5.324555 seconds.
Hello, World! I am process 0 of 1 on pascal17.
I will do parallel 'work' for 22.349517 seconds.
Expand Down Expand Up @@ -298,9 +300,10 @@ description:
batch:
type: slurm
host: quartz # machine to run on
bank: guests # bank
queue: pdebug # partition
host: ruby # machine
bank: guests # bank
queue: pbatch # partition
reservation: HPCC1B # reservation for this workshop
study:
- name: amdahl
Expand Down Expand Up @@ -372,9 +375,10 @@ description:
batch:
type: slurm
host: quartz # machine to run on
bank: guests # bank
queue: pdebug # partition
host: ruby # machine
bank: guests # bank
queue: pbatch # partition
reservation: HPCC1B # reservation for this workshop
study:
- name: amdahl
Expand All @@ -401,9 +405,10 @@ description:
batch:
type: slurm
host: quartz # machine to run on
bank: guests # bank
queue: pdebug # partition
host: ruby # machine
bank: guests # bank
queue: pbatch # partition
reservation: HPCC1B # reservation for this workshop
study:
- name: amdahl
Expand Down Expand Up @@ -432,9 +437,10 @@ description:
batch:
type: slurm
host: quartz # machine to run on
bank: guests # bank
queue: pdebug # partition
host: ruby # machine
bank: guests # bank
queue: pbatch # partition
reservation: HPCC1B # reservation for this workshop
study:
- name: amdahl
Expand Down Expand Up @@ -484,9 +490,10 @@ description:
batch:
type: slurm
host: quartz # machine to run on
bank: guests # bank
queue: pdebug # partition
host: ruby # machine
bank: guests # bank
queue: pbatch # partition
reservation: HPCC1B # reservation for this workshop
env:
variables:
Expand Down
14 changes: 8 additions & 6 deletions episodes/04-placeholders.md
Original file line number Diff line number Diff line change
Expand Up @@ -190,9 +190,10 @@ description:
batch:
type: slurm
host: quartz # machine to run on
bank: guests # bank
queue: pdebug # partition
host: ruby # machine
bank: guests # bank
queue: pbatch # partition
reservation: HPCC1B # reservation for this workshop
env:
variables:
Expand Down Expand Up @@ -274,9 +275,10 @@ description:
batch:
type: slurm
host: quartz # machine to run on
bank: guests # bank
queue: pdebug # partition
host: ruby # machine
bank: guests # bank
queue: pbatch # partition
reservation: HPCC1B # reservation for this workshop
env:
variables:
Expand Down
28 changes: 16 additions & 12 deletions episodes/05-chaining_rules.md
Original file line number Diff line number Diff line change
Expand Up @@ -52,9 +52,10 @@ description:

batch:
type: slurm
host: quartz # machine to run on
bank: guests # bank
queue: pdebug # partition
host: ruby # machine
bank: guests # bank
queue: pbatch # partition
reservation: HPCC1B # reservation for this workshop

env:
variables:
Expand Down Expand Up @@ -216,9 +217,10 @@ description:
batch:
type: slurm
host: quartz # machine to run on
bank: guests # bank
queue: pdebug # partition
host: ruby # machine
bank: guests # bank
queue: pbatch # partition
reservation: HPCC1B # reservation for this workshop
env:
variables:
Expand Down Expand Up @@ -341,9 +343,10 @@ description:
batch:
type: slurm
host: pascal # machine to run on
bank: lc #bank
queue: pvis # partition
host: ruby # machine
bank: guests # bank
queue: pbatch # partition
reservation: HPCC1B # reservation for this workshop
env:
variables:
Expand Down Expand Up @@ -568,9 +571,10 @@ description:
batch:
type: slurm
host: quartz # machine to run on
bank: guests # bank
queue: pdebug # partition
host: ruby # machine
bank: guests # bank
queue: pbatch # partition
reservation: HPCC1B # reservation for this workshop
env:
variables:
Expand Down
14 changes: 8 additions & 6 deletions episodes/06-multiple-parameters.md
Original file line number Diff line number Diff line change
Expand Up @@ -93,9 +93,10 @@ description:
batch:
type: slurm
host: quartz # machine to run on
bank: guests # bank
queue: pdebug # partition
host: ruby # machine
bank: guests # bank
queue: pbatch # partition
reservation: HPCC1B # reservation for this workshop
env:
variables:
Expand Down Expand Up @@ -249,9 +250,10 @@ description:

batch:
type: slurm
host: quartz # machine to run on
bank: guests # bank
queue: pdebug # partition
host: ruby # machine
bank: guests # bank
queue: pbatch # partition
reservation: HPCC1B # reservation for this workshop

env:
variables:
Expand Down

0 comments on commit 63da1bf

Please sign in to comment.