NVIDIA : Install HPC SDK |
Install NVIDIA HPC SDK. | |
| [1] | |
| [2] | Install NVIDIA HPC SDK. |
[root@dlp ~]# dnf group -y install "Development tools" [root@dlp ~]# dnf config-manager --add-repo https://developer.download.nvidia.com/hpc-sdk/rhel/nvhpc.repo [root@dlp ~]# dnf -y install nvhpc-22.5 environment-modules [root@dlp ~]# vi /etc/environment-modules/modulespath # add PATH like follows # This file defines the initial setup for the modulefiles search path
# Each line containing one or multiple paths delimited by ':' will be
# added to the MODULEPATH environment variable.
/usr/share/Modules/modulefiles:/etc/modulefiles:/usr/share/modulefiles:/opt/nvidia/hpc_sdk/modulefiles
[root@dlp ~]# source /etc/profile.d/modules.sh [root@dlp ~]# module avail ------------------------ /usr/share/Modules/modulefiles ------------------------ dot module-git module-info modules null use.own ----------------------- /opt/nvidia/hpc_sdk/modulefiles ------------------------ nvhpc-byo-compiler/22.5 nvhpc-nompi/22.5 nvhpc/22.5 [root@dlp ~]# [root@dlp ~]# module load nvhpc/22.5 nvc --version nvc 22.5-0 64-bit target on x86-64 Linux -tp haswell NVIDIA Compilers and Tools Copyright (c) 2022, NVIDIA CORPORATION & AFFILIATES. All rights reserved.[root@dlp ~]# nvaccelinfo CUDA Driver Version: 11070 NVRM version: NVIDIA UNIX x86_64 Kernel Module 515.48.07 Fri May 27 03:26:43 UTC 2022 Device Number: 0 Device Name: NVIDIA GeForce GTX 1060 6GB Device Revision Number: 6.1 Global Memory Size: 6373441536 Number of Multiprocessors: 10 Concurrent Copy and Execution: Yes Total Constant Memory: 65536 Total Shared Memory per Block: 49152 Registers per Block: 65536 Warp Size: 32 Maximum Threads per Block: 1024 Maximum Block Dimensions: 1024, 1024, 64 Maximum Grid Dimensions: 2147483647 x 65535 x 65535 Maximum Memory Pitch: 2147483647B Texture Alignment: 512B Clock Rate: 1847 MHz Execution Timeout: No Integrated Device: No Can Map Host Memory: Yes Compute Mode: default Concurrent Kernels: Yes ECC Enabled: No Memory Clock Rate: 4004 MHz Memory Bus Width: 192 bits L2 Cache Size: 1572864 bytes Max Threads Per SMP: 2048 Async Engines: 2 Unified Addressing: Yes Managed Memory: Yes Concurrent Managed Memory: Yes Preemption Supported: Yes Cooperative Launch: Yes Multi-Device: Yes Default Target: cc61 |
| [3] | Verify installation with a common user to run a test program. |
[cent@dlp ~]$ module load nvhpc/22.5 # create new #include <stdio.h>
int main() {
printf("Hello World\n");
}
# compile [cent@dlp ~]$ nvc -o helloworld helloworld.c # run [cent@dlp ~]$ ./helloworld Hello World # create new #include <iostream>
int main() {
std::cout << "Hello World!\n";
}
# compile [cent@dlp ~]$ nvc++ -o helloworld2 helloworld.cpp # run [cent@dlp ~]$ ./helloworld2 Hello World! ### Fortran program [cent@dlp ~]$ vi helloworld.f90 # create new program helloworld print *, 'Hello World!' end program helloworld # compile [cent@dlp ~]$ nvfortran -o helloworld3 helloworld.f90 # run [cent@dlp ~]$ ./helloworld3 Hello World! |
No comments:
Post a Comment