Working with Jupyter Notebooks
Note
You should only be running notebooks for development, testing or simple analysis purposes. All computationally intensive work should be submitted as a batch job to the cluster and be run on the compute nodes in a script.
If you wish you to view and work with Python notebooks on OzSTAR, you simply need to launch a Jupyter Notebook server from one of the login nodes, and then connect to it. We'll show you how to connect via a web browser using port forwarding, or via VS Code.
Note that you'll need to connect to a specific login node e.g. user@tooarrana2.hpc.swin.edu.au
. You cannot rely on the generic ozstar
or nt
logins as a destination for remote persistent processes, since they are both round-robin addresses. You need to pick one of tooarrana1/2
or farnarkle1/2
and stick to it.
Launching a Jupyter Notebook Server
First, activate your Python environment that you wish to work in and ensure you have jupyter
installed in it.
Then, navigate to the directory where you want to serve notebooks from, and launch a Jupyter Notebook server
cd /path/to/your/notebooks
jupyter notebook --no-browser
Take note of which port the server is running on -- it will be displayed in the terminal output, along with a token/password.
You can also specify a port directly with the --port
flag e.g.
jupyter notebook --no-browser --port=8888
Running your notebook
Web Browser
To use Jupyter in a web browser for running your notebooks, you'll need to forward a port from your local machine to a port on the login node. You can do this with SSH tunnelling e.g.
ssh -L 8000:localhost:8888 user@tooarrana2.hpc.swin.edu.au
This command forwards port 8000 on your local machine to port 8888 on the login node.
This can be done either retroactively by opening another terminal and connecting to the same login node, or proactively in the same terminal by forwarding the port before you launch the Jupyter server.
Ensure that you launch the Jupyter server on the same port as you specified in the SSH command. e.g. port 8888 in this case.
Finally, open a web browser on your local machine and navigate to http://localhost:8000 to access the Jupyter server. You will be prompted for the token/password that was displayed in the terminal. Alternatively, you can append the token to the end of the url: http://localhost:8000/?token=YOUR_TOKEN, where you should replace YOUR_TOKEN
with your token.
By default, the Jupyter server will serve the default/classic notebook interface.
If you prefer the newer JupyterLab interface, you can append /lab
to the end of the url e.g. http://localhost:8000/lab, just make sure you have JupyterLab installed in your Python environment. Alternatively, you can launch the server with the command jupyter lab --no-browser
.
VS Code
To use your notebook in VS Code, you'll need connect to the login node using the Remote-SSH extension, and then connect your notebook's kernel to the Jupyter server running on the login node that you set up above.
Once again, ensure that you connect to the same login node as the one that you launched the Jupyter server on, and ensure that you have installed the Jupyter extension. You should not need to forward any ports.
Note
Some VS Code extensions, such as Jupyter, need to be installed in the remote environment; it is not enough to have them installed in your local VS Code.
Next, open your notebook and select the Kernel Picker button on the top right-hand side of the notebook (or run the Notebook: Select Notebook Kernel command from the Command Palette).
You will then need to connect to an 'Existing Jupyter Server' by entering the URL of the server (e.g. http://localhost:8888) and the token/password that was displayed in the terminal when you launched the Jupyter server. You can find more detailed steps in the VS Code documentation.
At this point you can run your notebook as you would normally, and the cells will be executed on the login node inside the Python environment that the Jupyter server was launched from.
Note
We have chosen to connect to an existing Jupyter server, rather than trying to launch the server directly from within VS Code. This is because it is difficult to get VS Code to correctly recognise your Python environment and modules. By following the steps above, you ensure that the server is running in the correct environment.
Other Languages
It is possible to work with languages other than Python in Jupyter notebooks, you just need the relevant kernels to be available to the Jupyter server. For example, to work with R, you can install/load IRkernel, or to work with Julia you'l need IJulia. We have modules for both on OzSTAR, which will set the right environment variables so that they can be found by Jupyter.
Follow the same procedure as above to launch a Jupyter server, and then connect to it using your preferred method.
If you go via a web browser, both the classic and JupyterLab interfaces will have a button to choose a kernel.
If you go via VS Code, you can select the kernel from the Kernel Picker button as described above, after you've entered the server URL.