tencent cloud

Detect AI Fraud with Tencent eKYC！Intercept 99%+ Deepfake Attacks!

Hyper Computing Cluster

Product Introduction

Overview

Strengths

Use Cases

Instance Specifications

Purchase Guide

Billing Overview

Instance Regions

Purchasing Hyper Computing Cluster Instances

Operation Guide

Managing Hyper Computing Cluster

Installing nvidia-fabricmanager Service on GPU Instance

Installation Instructions of TCCL on GPU Instances

Installing RDMA Millisecond-Level Monitoring Component on GPU Instances

API Document

FAQs

DocumentationHyper Computing ClusterOperation GuideInstalling nvidia-fabricmanager Service on GPU Instance

Installing nvidia-fabricmanager Service on GPU Instance

Download PDF

Last updated: 2024-08-20 17:04:57

Installing nvidia-fabricmanager Service on GPU Instance

Last updated: 2024-08-20 17:04:57

Download PDF

Overview
The Hyper Computing ClusterPNV4h instance is equipped with A100 GPUs and supports NvLink & NvSwitch. It requires the installation of the nvidia-fabricmanager service corresponding to the driver version to enable interconnection between GPUs. If you are using this instance, see this document to install the nvidia-fabricmanager service. Otherwise, you may not be able to use the GPU instance properly.
Directions
This document takes the driver version 470.103.01 as an example. You can follow the steps below for installation. You can replace the driver version after the  version parameter as needed.
Installing nvidia-fabricmanager Service
1. Log in to the instance. For details, see Logging in to Linux Instance (Standard Method).
2. The installation varies by operating system. Run the corresponding command for installation.
CentOS 7.x Image
Ubuntu 18.04 Image
TencentOS 2.4 Image
version=470.103.01
yum -y install yum-utils
yum-config-manager --add-repo https://developer.download.nvidia.com/compute/cuda/repos/rhel7/x86_64/cuda-rhel7.repo
yum install -y nvidia-fabric-manager-${version}-1
version=470.103.01
main_version=$(echo $version | awk -F '.' '{print $1}')
apt-get updateapt
get -y install nvidia-fabricmanager-${main_version}=${version}-*
version=470.103.01
yum -y install yum-utils
yum-config-manager --add-repo https://developer.download.nvidia.com/compute/cuda/repos/rhel7/x86_64/cuda-rhel7.repo
yum install -y nvidia-fabric-manager-${version}-1
Starting nvidia-fabricmanager Service
Run the following commands in sequence to start the service.
systemctl enable nvidia-fabricmanager
systemctl start nvidia-fabricmanager
Viewing nvidia-fabricmanager Service Status
Run the following command to view the service status.
systemctl status nvidia-fabricmanager
If the following information is output, the service is installed successfully.
﻿
﻿
﻿

Was this page helpful?

You can also Contact Sales or Submit a Ticket for help.

Yes

No

Contact Us

Contact our sales team or business advisors to help your business.

Technical Support

Open a ticket if you're looking for further assistance. Our Ticket is 7x24 avaliable.

7x24 Phone Support

Hong Kong, China

+852 800 906 020 (Toll Free)

United States

+1 844 606 0804 (Toll Free)

United Kingdom

+44 808 196 4551 (Toll Free)

Canada

+1 888 605 7930 (Toll Free)

Australia

+61 1300 986 386 (Toll Free)

EdgeOne hotline

+852 300 80699

More local hotlines coming soon

tencent cloud

New User Offers

Next-Generation CDN：EdgeOne

Elasticsearch Service free trial

Free Tier

Tencent Cloud Startup Program

Special Offers

Lighthouse Special Offers

Cloud Object Storage Special Offers

Featured Products

New Products

Education

Tencent Cloud Online Education Solutions

Gaming

Gaming Solution

Game Media Solutions

E-commerce

E-commerce retail solutions

Audio & Video

Audio/Video Solution

LVB Recording Solution

Interactive Classroom Solution

Interactive Live Streaming Solution

Audio Chat Social Networking Solution

Financial Services

Financial Services Solution

Compute

Cloud Virtual Machine

Auto Scaling

Batch Compute

CVM Dedicated Host

Database

TencentDB for MySQL

TencentDB for Redis®

TencentDB for CTSDB

TDSQL for MySQL

Data Transfer Service

TencentDB for MongoDB

TencentDB for PostgreSQL

TencentDB for SQL Server

Video Service

Cloud Streaming Services

Video on Demand

Media Processing Service

Cloud Application Rendering

Cloud Contact Center

Game Multimedia Engine

Chat

Real-time Communication

Tencent Effect SDK

AI and Machine Learning

Image Creation Large Model

Face Fusion

eKYC

Optical Character Recognition

Video Creation Large Model

Industry Applications

Tencent HealthCare Omics Platform

Container and Middleware

TDMQ for CKafka

Serverless Cloud Function

Tencent Kubernetes Engine

Tencent Kubernetes Engine for Serverless

Networking

Cloud Load Balancer

Virtual Private Cloud

Direct Connect

Cloud Connect Network

NAT Gateway

VPN Connection

Bandwidth Package

Anycast Internet Acceleration

Elastic Network Interface

Flow Logs

Global Application Acceleration Platform

Security

Captcha

Cloud Workload Protection Platform

Data Security Governance Center

Key Management Service