https://github.com/stefan-it/hetzner-gpu-server

My cheatsheet for Hetzner GPU Server Setup
https://github.com/stefan-it/hetzner-gpu-server

Last synced: 3 months ago
JSON representation

My cheatsheet for Hetzner GPU Server Setup

Host: GitHub
URL: https://github.com/stefan-it/hetzner-gpu-server
Owner: stefan-it
Created: 2024-03-06T10:11:08.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2024-06-25T22:58:07.000Z (about 1 year ago)
Last Synced: 2025-02-08T07:23:23.561Z (5 months ago)
Size: 11.7 KB
Stars: 6
Watchers: 1
Forks: 2
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

        # Hetzner GPU Server Setup

This repository hosts my cheatsheet for Hetzner GPU Server Setup. Hopefully, I could also be useful for others ;)

I opened the [discussion section](https://github.com/stefan-it/hetzner-gpu-server/discussions) so that other GEX44 users can discuss various things.

## GEX44

In March 2024, Hetzner thanksfully released a new [generation of GPU servers](https://www.hetzner.com/press-release/new-gpu-server/), after their [first GPU server](https://web.archive.org/web/20210613172423/https://www.hetzner.com/news/blitzschnell-gestochen-scharf-neuer-dedicated-root-server-ex51-ssd-gpu/) back in March 2017!

The GEX44 comes with an Intel® Core™ i5-13500, 64GB of RAM and a NVIDIA RTX™ 4000 with 20GB GPU-RAM.

I will use the GPU mainly for fine-tuning NLP models with the awesome [Flair](https://github.com/flairNLP/flair) library. I am very excited about the new GEX44 server, as I've also used the old GPU server back in 2018 for releasing a lot of [Flair Embeddings](https://github.com/flairNLP/flair-lms)!

# Initial setup

After using Ubuntu 22.04 (installed from rescue console) the following steps are done in the initial setup stage:

* Updating and upgrading all Ubuntu 22.04 packages

* Upgrade from Ubuntu 22.04 to 24.04

* Install CUDA and NVIDIA drivers

On commandline it looks like:

```bash

$ apt update && apt upgrade -y

$ reboot # To make sure ;)

$ do-release-upgrade -d

```

Follow all upgrade instructions and then we have a working Ubuntu 24.04 installation.

Now all NVIDIA-related stuff is installed:

```bash

$ wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64/cuda-keyring_1.1-1_all.deb

$ dpkg -i cuda-keyring_1.1-1_all.deb

$ apt-get update

$ apt-get -y install cuda-toolkit-12-5

```

After that we install NVIDIA drivers and reboot our nice GEX44:

```bash

$ apt-get install -y cuda-drivers

$ reboot

```

Now let's see if it was working:

```bash

$ nvidia-smi

Wed Jun 26 00:46:20 2024

+-----------------------------------------------------------------------------------------+

| NVIDIA-SMI 555.42.02              Driver Version: 555.42.02      CUDA Version: 12.5     |

|-----------------------------------------+------------------------+----------------------+

| GPU  Name                 Persistence-M | Bus-Id          Disp.A | Volatile Uncorr. ECC |

| Fan  Temp   Perf          Pwr:Usage/Cap |           Memory-Usage | GPU-Util  Compute M. |

|                                         |                        |               MIG M. |

|=========================================+========================+======================|

|   0  NVIDIA RTX 4000 SFF Ada ...    Off |   00000000:01:00.0 Off |                  Off |

| 33%   58C    P8              7W /   70W |       2MiB /  20475MiB |      0%      Default |

|                                         |                        |                  N/A |

+-----------------------------------------+------------------------+----------------------+

+-----------------------------------------------------------------------------------------+

| Processes:                                                                              |

|  GPU   GI   CI        PID   Type   Process name                              GPU Memory |

|        ID   ID                                                               Usage      |

|=========================================================================================|

|  No running processes found                                                             |

+-----------------------------------------------------------------------------------------+

```

Amazing! The GEX44 is now ready to fine-tune nice models 🎉

# User account

We create a new (unpriviliged) user and add it to the `sudoers` list:

```bash

$ useradd -m -g users -s /bin/bash stefan

$ passwd stefan # Set a nice password here

$ usermod -aG sudo stefan

```

Then login with the newly created user.

# PyTorch installation

PyTorch is installed in a fresh virtual environment (via `venv`), as this is the easiest way to get it running.

Possible alternatives are e.g. Anaconda or using Docker with NVIDIA support and e.g. an [NVIDIA PyTorch image](https://catalog.ngc.nvidia.com/orgs/nvidia/containers/pytorch)).

The following steps will install a perfectly working PyTorch installation with GPU support:

```bash

$ sudo apt install python3-venv

$ python3 -m venv venvs/dev

$ source venvs/dev/bin/activate

$ pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121

```

Now let's test if CUDA is available via:

```bash

$ python3 -c "import torch; print(torch.cuda.is_available())"

True

```

Now we can start installing more libraries and fine-tuning own models!

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/stefan-it/hetzner-gpu-server

Awesome Lists containing this project

README