...
*** Make sure that Prof. Keller has added you to both the spin and spinquest groups in Rivanna. ***
Using
...
GPU's on Rivanna
#!/bin/bash
#SBATCH -A <allocation_name>
#SBATCH -p gpu
#SBATCH --gres=gpu:a100
#SBATCH --constraint=a100_80gb
#SBATCH -c 4
#SBATCH -t 1:00:00
#SBATCH -J test
#SBATCH -o test.out
#SBATCH -e test.err
#SBATCH --mem=256000
module purge
module load apptainer tensorflow/2.13.0
apptainer run --nv $CONTAINERDIR/tensorflow-2.13.0.sif <file_name>.py
Using Tensorflow with BKM2002-Formulation
1. Copy the sample files from the following Rivanna folder "/project/ptgroup/ANN_scripts/BKM-Formulation-Test/BKM-tf"
cd /project/ptgroup/ANN_scripts/BKM-Formulation-Test/BKM-tf
2. Run the following commands on your terminal
module load anaconda/2020.11-py3.8
module load singularity/3.7.1
module load tensorflow/2.1.0-py37
following step is needed to run only once (it will copy the relevant .sif file to your /home directory)
cp $$CONTAINERDIR/tensorflow-2.1.0-py37.sif /home/$USER
(make sure that you have the same module loads included in your grid.slurm file)
3. Run the following command to submit the job
./jobscript.sh <Name_of_Job> <Number_of_Replicas>
example:
./jobscript.sh CFF_BKM_tf_Test 10
Using PyTorch with BKM2002-Formulation
1. Copy the sample files from the following Rivanna folder "/project/ptgroup/ANN_scripts/BKM-Formulation-Test/BKM-PyTorch"
$ cd /project/ptgroup/ANN_scripts/BKM-Formulation-Test/BKM-PyTorch
2. Run the following commands on your terminal
$ module load anaconda/2020.11-py3.8
$ module load singularity/3.7.1
$ module load pytorch/1.8.1
$ cp $CONTAINERDIR following step is needed to run only once (it will copy the relevant .sif file to your /home directory)
cp $$CONTAINERDIR/pytorch-1.8.1.sif /home/$USER
...
example: $ ./jobscript.sh CFF_BKM_PyTorch_Test 10
Note:
If you download the code from GitHub to a Windows machine and then if you upload those files to Rivanna; then you will need to do the following steps
...