- Connect to the bastion using the SSH config emailed at provisioning.
- From the bastion, SSH into your assigned node (hostname format
gpu-N-IDCODE.apetops-us.com). - Run
nvidia-smito confirm all GPUs are visible and driver version matches. - Activate the pre-installed conda environment:
source /opt/apetops/activate.
Common first-run checks:
nvidia-smi topo -m— verify NVLink topology.ib_write_bw— quick InfiniBand smoke test (HPC nodes only).df -h /mnt/shared— confirm your shared parallel filesystem mount.
Troubleshooting tip: if GPUs are not visible, reload the NVIDIA modules with sudo systemctl restart nvidia-persistenced and try again.