Skip to content

Latest commit

 

History

History
115 lines (82 loc) · 4.25 KB

File metadata and controls

115 lines (82 loc) · 4.25 KB

RPM Installation

Prerequisites

The Fleet Intelligence Agent RPM package has the following runtime dependencies:

  • datacenter-gpu-manager-4-proprietary (DCGM)
  • nvattest (NVIDIA Attestation SDK CLI, NVAT)
  • corelib (NVAT GPU evidence source dependency)

Before installing Fleet Intelligence Agent, ensure the following prerequisites are met:

  • NVIDIA package repository is configured (network or local CUDA repository) so datacenter-gpu-manager-4-proprietary, nvattest, and corelib can be installed
  • DCGM HostEngine 4.2.3 or newer
  • NVIDIA Datacenter Driver major version 510 or newer is installed
  • Install/upgrade commands are run as root or with sudo
  • dnf-plugins-core is installed (required for dnf config-manager)
  • Attestation for the fleetint use case only supports Blackwell and newer GPUs, and applies to non-CC mode systems
  • For NVSwitch systems (driver branch must match installed datacenter driver):
    • Hopper (pre-4th gen NVSwitch): install nvidia-driver:<driver-branch>/fm
    • Blackwell (4th gen NVSwitch): install nvidia-driver-<driver-branch>-open and nvlink-<driver-branch>

References:

Fleet Intelligence Agent package dependencies (datacenter-gpu-manager-4-proprietary, nvattest, and corelib) are available through NVIDIA's CUDA repository. Before installing Fleet Intelligence Agent, add the appropriate NVIDIA CUDA repository for your system.

Install dnf-plugins-core first if dnf config-manager is not available:

sudo dnf install -y dnf-plugins-core

RHEL/Rocky/AlmaLinux Systems

# RHEL/Rocky/AlmaLinux 8 (x86_64)
sudo dnf config-manager --add-repo https://developer.download.nvidia.com/compute/cuda/repos/rhel8/x86_64/cuda-rhel8.repo
sudo dnf clean all

# RHEL/Rocky/AlmaLinux 9 (x86_64)
sudo dnf config-manager --add-repo https://developer.download.nvidia.com/compute/cuda/repos/rhel9/x86_64/cuda-rhel9.repo
sudo dnf clean all

# RHEL/Rocky/AlmaLinux 10 (x86_64)
sudo dnf config-manager --add-repo https://developer.download.nvidia.com/compute/cuda/repos/rhel10/x86_64/cuda-rhel10.repo
sudo dnf clean all

# RHEL/Rocky/AlmaLinux 8 (ARM64)
sudo dnf config-manager --add-repo https://developer.download.nvidia.com/compute/cuda/repos/rhel8/sbsa/cuda-rhel8.repo
sudo dnf clean all

# RHEL/Rocky/AlmaLinux 9 (ARM64)
sudo dnf config-manager --add-repo https://developer.download.nvidia.com/compute/cuda/repos/rhel9/sbsa/cuda-rhel9.repo
sudo dnf clean all

# RHEL/Rocky/AlmaLinux 10 (ARM64)
sudo dnf config-manager --add-repo https://developer.download.nvidia.com/compute/cuda/repos/rhel10/sbsa/cuda-rhel10.repo
sudo dnf clean all

Amazon Linux 2023

# Amazon Linux 2023 (x86_64)
sudo dnf config-manager --add-repo https://developer.download.nvidia.com/compute/cuda/repos/amzn2023/x86_64/cuda-amzn2023.repo
sudo dnf clean all

# Amazon Linux 2023 (ARM64)
sudo dnf config-manager --add-repo https://developer.download.nvidia.com/compute/cuda/repos/amzn2023/sbsa/cuda-amzn2023.repo
sudo dnf clean all

After adding the CUDA repository, package dependencies (datacenter-gpu-manager-4-proprietary, nvattest, and corelib) are resolved automatically by dnf during Fleet Intelligence Agent installation.

Install package

Download the package from Latest stable release, then install:

# RHEL/Rocky/AlmaLinux/Amazon Linux (x86_64)
sudo dnf install ./fleetint-VERSION-1.x86_64.rpm

# RHEL/Rocky/AlmaLinux/Amazon Linux (ARM64)
sudo dnf install ./fleetint-VERSION-1.aarch64.rpm

Verify:

fleetint --version
systemctl status fleetintd

Update

Install the newer package version:

# RHEL/Rocky/AlmaLinux/Amazon Linux (x86_64)
sudo dnf install ./fleetint-VERSION-1.x86_64.rpm

# RHEL/Rocky/AlmaLinux/Amazon Linux (ARM64)
sudo dnf install ./fleetint-VERSION-1.aarch64.rpm

The service will automatically restart with the new version.

Uninstall

sudo dnf remove fleetint