Nvidia CUDA Compiler

{{Short description|Proprietary compiler by Nvidia}}

{{Infobox software

| name =

| title =

| logo =

| logo caption =

| logo size =

| logo alt =

| screenshot =

| caption =

| screenshot size =

| screenshot alt =

| author =

| developer = Nvidia

| released = {{Start date and age|2007|06}}

| latest release version = 12.6.0

| latest release date = {{Start date and age|2024|08}}

| latest preview version =

| latest preview date =

| programming language =

| operating system =

| platform =

| size =

| language = English

| language count =

| language footnote =

| genre = Compiler

| license = Proprietary

| website = {{URL|docs.nvidia.com/cuda/cuda-compiler-driver-nvcc}}

}}

Nvidia CUDA Compiler (NVCC) is a compiler by Nvidia intended for use with CUDA. It is proprietary software.

==Compiler==

CUDA code runs on both the central processing unit (CPU) and graphics processing unit (GPU). NVCC separates these two parts and sends host code (the part of code which will be run on the CPU) to a C compiler like GNU Compiler Collection (GCC) or Intel C++ Compiler (ICC) or Microsoft Visual C++ Compiler, and sends the device code (the part which will run on the GPU) to the GPU. The device code is further compiled by NVCC. NVCC is based on LLVM.{{cite web|title=CUDA LLVM Compiler |url=https://developer.nvidia.com/cuda-llvm-compiler|publisher=Nvidia Developer |access-date=Apr 6, 2016}} According to Nvidia provided documentation, nvcc in version 7.0 supports many language constructs that are defined by the C++11 standard, and a few from C99. In version 9.0, several more constructs from the C++14 standard are added.{{Cite web|url=https://docs.nvidia.com/cuda/cuda-c-programming-guide/|title=CUDA C++ Programming Guide |website=NVIDIA Documentation Hub |language=en-us|access-date=2019-06-28}}

Any source file containing CUDA language extensions (.cu) must be compiled with nvcc. NVCC is a compiler driver which works by invoking all the necessary tools and compilers like cudacc, g++, cl, etc. NVCC can output either C code (CPU Code) that must then be compiled with the rest of the application using another tool or Parallel Thread Execution (PTX) or object code directly. An executable with CUDA code requires: the CUDA core library (cuda) and the CUDA runtime library (cudart).

Other widely used libraries:

  • CUBLAS: BLAS implementation
  • CUFFT: FFT implementation
  • CUDPP (Data Parallel Primitives): Reduction, Scan, Sort.
  • Thrust: Reduction, Scan, Sort.

See also

References

{{Reflist}}

=General=

  1. David B. Kirk, and Wen-mei W. Hwu. Programming massively parallel processors: a hands-on approach. Morgan Kaufmann, 2010.
  2. {{Cite web |title=Nvidia CUDA Compiler Driver NVCC |url=https://docs.nvidia.com/cuda/cuda-compiler-driver-nvcc/ |url-status=live |archive-url=https://web.archive.org/web/20231013194328/https://docs.nvidia.com/cuda/cuda-compiler-driver-nvcc/ |archive-date=Oct 13, 2023 |website=Nvidia Documentation Hub}}
  3. {{Cite web |title=CUDPP |url=http://gpgpu.org/developer/cudpp |url-status=dead |archive-url=https://web.archive.org/web/20181117222643/http://gpgpu.org/developer/cudpp |archive-date=Nov 17, 2018 |website=GPGPU}}