depot/third_party/nixpkgs/doc/languages-frameworks/cuda.section.md

# CUDA {#cuda}

CUDA-only packages are stored in the `cudaPackages` packages set. This set
includes the `cudatoolkit`, portions of the toolkit in separate derivations,
`cudnn`, `cutensor` and `nccl`.

A package set is available for each CUDA version, so for example
`cudaPackages_11_6`. Within each set is a matching version of the above listed
packages. Additionally, other versions of the packages that are packaged and
compatible are available as well. For example, there can be a
`cudaPackages.cudnn_8_3` package.

To use one or more CUDA packages in an expression, give the expression a `cudaPackages` parameter, and in case CUDA is optional
```nix
{ config
, cudaSupport ? config.cudaSupport
, cudaPackages ? { }
, ...
}:
```

When using `callPackage`, you can choose to pass in a different variant, e.g.
when a different version of the toolkit suffices
```nix
mypkg = callPackage { cudaPackages = cudaPackages_11_5; }
```

If another version of say `cudnn` or `cutensor` is needed, you can override the
package set to make it the default. This guarantees you get a consistent package
set.
```nix
mypkg = let
  cudaPackages = cudaPackages_11_5.overrideScope (final: prev: {
    cudnn = prev.cudnn_8_3;
  }});
in callPackage { inherit cudaPackages; };
```

The CUDA NVCC compiler requires flags to determine which hardware you
want to target for in terms of SASS (real hardware) or PTX (JIT kernels).

Nixpkgs tries to target support real architecture defaults based on the
CUDA toolkit version with PTX support for future hardware.  Experienced
users may optimize this configuration for a variety of reasons such as
reducing binary size and compile time, supporting legacy hardware, or
optimizing for specific hardware.

You may provide capabilities to add support or reduce binary size through
`config` using `cudaCapabilities = [ "6.0" "7.0" ];` and
`cudaForwardCompat = true;` if you want PTX support for future hardware.

Please consult [GPUs supported](https://en.wikipedia.org/wiki/CUDA#GPUs_supported)
for your specific card(s).

Library maintainers should consult [NVCC Docs](https://docs.nvidia.com/cuda/cuda-compiler-driver-nvcc/)
and release notes for their software package.

## Adding a new CUDA release {#adding-a-new-cuda-release}

> **WARNING**
>
> This section of the docs is still very much in progress. Feedback is welcome in GitHub Issues tagging @NixOS/cuda-maintainers or on [Matrix](https://matrix.to/#/#cuda:nixos.org).

The CUDA Toolkit is a suite of CUDA libraries and software meant to provide a development environment for CUDA-accelerated applications. Until the release of CUDA 11.4, NVIDIA had only made the CUDA Toolkit available as a multi-gigabyte runfile installer, which we provide through the [`cudaPackages.cudatoolkit`](https://search.nixos.org/packages?channel=unstable&type=packages&query=cudaPackages.cudatoolkit) attribute. From CUDA 11.4 and onwards, NVIDIA has also provided CUDA redistributables (“CUDA-redist”): individually packaged CUDA Toolkit components meant to facilitate redistribution and inclusion in downstream projects. These packages are available in the [`cudaPackages`](https://search.nixos.org/packages?channel=unstable&type=packages&query=cudaPackages) package set.

All new projects should use the CUDA redistributables available in [`cudaPackages`](https://search.nixos.org/packages?channel=unstable&type=packages&query=cudaPackages) in place of [`cudaPackages.cudatoolkit`](https://search.nixos.org/packages?channel=unstable&type=packages&query=cudaPackages.cudatoolkit), as they are much easier to maintain and update.

### Updating CUDA redistributables {#updating-cuda-redistributables}

1. Go to NVIDIA's index of CUDA redistributables: <https://developer.download.nvidia.com/compute/cuda/redist/>
2. Copy the `redistrib_*.json` corresponding to the release to `pkgs/development/compilers/cudatoolkit/redist/manifests`.
3. Generate the `redistrib_features_*.json` file by running:

    ```bash
    nix run github:ConnorBaker/cuda-redist-find-features -- <path to manifest>
    ```

    That command will generate the `redistrib_features_*.json` file in the same directory as the manifest.

4. Include the path to the new manifest in `pkgs/development/compilers/cudatoolkit/redist/extension.nix`.

### Updating the CUDA Toolkit runfile installer {#updating-the-cuda-toolkit}

> **WARNING**
>
> While the CUDA Toolkit runfile installer is still available in Nixpkgs as the [`cudaPackages.cudatoolkit`](https://search.nixos.org/packages?channel=unstable&type=packages&query=cudaPackages.cudatoolkit) attribute, its use is not recommended and should it be considered deprecated. Please migrate to the CUDA redistributables provided by the [`cudaPackages`](https://search.nixos.org/packages?channel=unstable&type=packages&query=cudaPackages) package set.
>
> To ensure packages relying on the CUDA Toolkit runfile installer continue to build, it will continue to be updated until a migration path is available.

1. Go to NVIDIA's CUDA Toolkit runfile installer download page: <https://developer.nvidia.com/cuda-downloads>
2. Select the appropriate OS, architecture, distribution, and version, and installer type.

   - For example: Linux, x86_64, Ubuntu, 22.04, runfile (local)
   - NOTE: Typically, we use the Ubuntu runfile. It is unclear if the runfile for other distributions will work.

3. Take the link provided by the installer instructions on the webpage after selecting the installer type and get its hash by running:

   ```bash
   nix store prefetch-file --hash-type sha256 <link>
   ```

4. Update `pkgs/development/compilers/cudatoolkit/versions.toml` to include the release.

### Updating the CUDA package set {#updating-the-cuda-package-set}

1. Include a new `cudaPackages_<major>_<minor>` package set in `pkgs/top-level/all-packages.nix`.

   - NOTE: Changing the default CUDA package set should occur in a separate PR, allowing time for additional testing.

2. Successfully build the closure of the new package set, updating `pkgs/development/compilers/cudatoolkit/redist/overrides.nix` as needed. Below are some common failures:

| Unable to ... | During ... | Reason | Solution | Note |
| --- | --- | --- | --- | --- |
| Find headers | `configurePhase` or `buildPhase` | Missing dependency on a `dev` output | Add the missing dependency | The `dev` output typically contain the headers |
| Find libraries | `configurePhase` | Missing dependency on a `dev` output | Add the missing dependency | The `dev` output typically contain CMake configuration files |
| Find libraries | `buildPhase` or `patchelf` | Missing dependency on a `lib` or `static` output | Add the missing dependency | The `lib` or `static` output typically contain the libraries |

In the scenario you are unable to run the resulting binary: this is arguably the most complicated as it could be any combination of the previous reasons. This type of failure typically occurs when a library attempts to load or open a library it depends on that it does not declare in its `DT_NEEDED` section. As a first step, ensure that dependencies are patched with [`cudaPackages.autoAddOpenGLRunpath`](https://search.nixos.org/packages?channel=unstable&type=packages&query=cudaPackages.autoAddOpenGLRunpath). Failing that, try running the application with [`nixGL`](https://github.com/guibou/nixGL) or a similar wrapper tool. If that works, it likely means that the application is attempting to load a library that is not in the `RPATH` or `RUNPATH` of the binary.
Project import generated by Copybara. GitOrigin-RevId: 5181d5945eda382ff6a9ca3e072ed6ea9b547fee 2022-04-15 01:41:22 +00:00			`# CUDA {#cuda}`

			CUDA-only packages are stored in the `cudaPackages` packages set. This set
			includes the `cudatoolkit`, portions of the toolkit in separate derivations,
			`cudnn`, `cutensor` and `nccl`.

			`A package set is available for each CUDA version, so for example`
			`cudaPackages_11_6`. Within each set is a matching version of the above listed
			`packages. Additionally, other versions of the packages that are packaged and`
			`compatible are available as well. For example, there can be a`
Project import generated by Copybara. GitOrigin-RevId: 6cee3b5893090b0f5f0a06b4cf42ca4e60e5d222 2023-07-15 17:15:38 +00:00			`cudaPackages.cudnn_8_3` package.
Project import generated by Copybara. GitOrigin-RevId: 5181d5945eda382ff6a9ca3e072ed6ea9b547fee 2022-04-15 01:41:22 +00:00
			To use one or more CUDA packages in an expression, give the expression a `cudaPackages` parameter, and in case CUDA is optional
			```nix
Project import generated by Copybara. GitOrigin-RevId: 18036c0be90f4e308ae3ebcab0e14aae0336fe42 2023-08-04 22:07:22 +00:00			`{ config`
			`, cudaSupport ? config.cudaSupport`
			`, cudaPackages ? { }`
			`, ...`
			`}:`
Project import generated by Copybara. GitOrigin-RevId: 5181d5945eda382ff6a9ca3e072ed6ea9b547fee 2022-04-15 01:41:22 +00:00			```

			When using `callPackage`, you can choose to pass in a different variant, e.g.
			`when a different version of the toolkit suffices`
			```nix
			`mypkg = callPackage { cudaPackages = cudaPackages_11_5; }`
			```

			If another version of say `cudnn` or `cutensor` is needed, you can override the
			`package set to make it the default. This guarantees you get a consistent package`
			`set.`
			```nix
			`mypkg = let`
Project import generated by Copybara. GitOrigin-RevId: b85ed9dcbf187b909ef7964774f8847d554fab3b 2023-08-22 20:05:09 +00:00			`cudaPackages = cudaPackages_11_5.overrideScope (final: prev: {`
Project import generated by Copybara. GitOrigin-RevId: 6cee3b5893090b0f5f0a06b4cf42ca4e60e5d222 2023-07-15 17:15:38 +00:00			`cudnn = prev.cudnn_8_3;`
Project import generated by Copybara. GitOrigin-RevId: 5181d5945eda382ff6a9ca3e072ed6ea9b547fee 2022-04-15 01:41:22 +00:00			`}});`
			`in callPackage { inherit cudaPackages; };`
			```
Project import generated by Copybara. GitOrigin-RevId: 40f79f003b6377bd2f4ed4027dde1f8f922995dd 2022-12-17 10:02:37 +00:00
			`The CUDA NVCC compiler requires flags to determine which hardware you`
			`want to target for in terms of SASS (real hardware) or PTX (JIT kernels).`

			`Nixpkgs tries to target support real architecture defaults based on the`
			`CUDA toolkit version with PTX support for future hardware. Experienced`
Project import generated by Copybara. GitOrigin-RevId: e182da8622a354d44c39b3d7a542dc12cd7baa5f 2022-12-28 21:21:41 +00:00			`users may optimize this configuration for a variety of reasons such as`
Project import generated by Copybara. GitOrigin-RevId: 40f79f003b6377bd2f4ed4027dde1f8f922995dd 2022-12-17 10:02:37 +00:00			`reducing binary size and compile time, supporting legacy hardware, or`
			`optimizing for specific hardware.`

			`You may provide capabilities to add support or reduce binary size through`
			`config` using `cudaCapabilities = [ "6.0" "7.0" ];` and
			`cudaForwardCompat = true;` if you want PTX support for future hardware.

			`Please consult [GPUs supported](https://en.wikipedia.org/wiki/CUDA#GPUs_supported)`
			`for your specific card(s).`

			`Library maintainers should consult [NVCC Docs](https://docs.nvidia.com/cuda/cuda-compiler-driver-nvcc/)`
			`and release notes for their software package.`
Project import generated by Copybara. GitOrigin-RevId: f99e5f03cc0aa231ab5950a15ed02afec45ed51a 2023-10-09 19:29:22 +00:00
			`## Adding a new CUDA release {#adding-a-new-cuda-release}`

			`> WARNING`
			`>`
			`> This section of the docs is still very much in progress. Feedback is welcome in GitHub Issues tagging @NixOS/cuda-maintainers or on [Matrix](https://matrix.to/#/#cuda:nixos.org).`

			The CUDA Toolkit is a suite of CUDA libraries and software meant to provide a development environment for CUDA-accelerated applications. Until the release of CUDA 11.4, NVIDIA had only made the CUDA Toolkit available as a multi-gigabyte runfile installer, which we provide through the [`cudaPackages.cudatoolkit`](https://search.nixos.org/packages?channel=unstable&type=packages&query=cudaPackages.cudatoolkit) attribute. From CUDA 11.4 and onwards, NVIDIA has also provided CUDA redistributables (“CUDA-redist”): individually packaged CUDA Toolkit components meant to facilitate redistribution and inclusion in downstream projects. These packages are available in the [`cudaPackages`](https://search.nixos.org/packages?channel=unstable&type=packages&query=cudaPackages) package set.

			All new projects should use the CUDA redistributables available in [`cudaPackages`](https://search.nixos.org/packages?channel=unstable&type=packages&query=cudaPackages) in place of [`cudaPackages.cudatoolkit`](https://search.nixos.org/packages?channel=unstable&type=packages&query=cudaPackages.cudatoolkit), as they are much easier to maintain and update.

			`### Updating CUDA redistributables {#updating-cuda-redistributables}`

			`1. Go to NVIDIA's index of CUDA redistributables: <https://developer.download.nvidia.com/compute/cuda/redist/>`
			2. Copy the `redistrib_*.json` corresponding to the release to `pkgs/development/compilers/cudatoolkit/redist/manifests`.
			3. Generate the `redistrib_features_*.json` file by running:

			```bash
			`nix run github:ConnorBaker/cuda-redist-find-features -- <path to manifest>`
			```

			That command will generate the `redistrib_features_*.json` file in the same directory as the manifest.

			4. Include the path to the new manifest in `pkgs/development/compilers/cudatoolkit/redist/extension.nix`.

			`### Updating the CUDA Toolkit runfile installer {#updating-the-cuda-toolkit}`

			`> WARNING`
			`>`
			> While the CUDA Toolkit runfile installer is still available in Nixpkgs as the [`cudaPackages.cudatoolkit`](https://search.nixos.org/packages?channel=unstable&type=packages&query=cudaPackages.cudatoolkit) attribute, its use is not recommended and should it be considered deprecated. Please migrate to the CUDA redistributables provided by the [`cudaPackages`](https://search.nixos.org/packages?channel=unstable&type=packages&query=cudaPackages) package set.
			`>`
			`> To ensure packages relying on the CUDA Toolkit runfile installer continue to build, it will continue to be updated until a migration path is available.`

			`1. Go to NVIDIA's CUDA Toolkit runfile installer download page: <https://developer.nvidia.com/cuda-downloads>`
			`2. Select the appropriate OS, architecture, distribution, and version, and installer type.`

			`- For example: Linux, x86_64, Ubuntu, 22.04, runfile (local)`
			`- NOTE: Typically, we use the Ubuntu runfile. It is unclear if the runfile for other distributions will work.`

			`3. Take the link provided by the installer instructions on the webpage after selecting the installer type and get its hash by running:`

			```bash
			`nix store prefetch-file --hash-type sha256 <link>`
			```

			4. Update `pkgs/development/compilers/cudatoolkit/versions.toml` to include the release.

			`### Updating the CUDA package set {#updating-the-cuda-package-set}`

			1. Include a new `cudaPackages_<major>_<minor>` package set in `pkgs/top-level/all-packages.nix`.

			`- NOTE: Changing the default CUDA package set should occur in a separate PR, allowing time for additional testing.`

			2. Successfully build the closure of the new package set, updating `pkgs/development/compilers/cudatoolkit/redist/overrides.nix` as needed. Below are some common failures:

			`\| Unable to ... \| During ... \| Reason \| Solution \| Note \|`
			`\| --- \| --- \| --- \| --- \| --- \|`
			\| Find headers \| `configurePhase` or `buildPhase` \| Missing dependency on a `dev` output \| Add the missing dependency \| The `dev` output typically contain the headers \|
			\| Find libraries \| `configurePhase` \| Missing dependency on a `dev` output \| Add the missing dependency \| The `dev` output typically contain CMake configuration files \|
			\| Find libraries \| `buildPhase` or `patchelf` \| Missing dependency on a `lib` or `static` output \| Add the missing dependency \| The `lib` or `static` output typically contain the libraries \|

			In the scenario you are unable to run the resulting binary: this is arguably the most complicated as it could be any combination of the previous reasons. This type of failure typically occurs when a library attempts to load or open a library it depends on that it does not declare in its `DT_NEEDED` section. As a first step, ensure that dependencies are patched with [`cudaPackages.autoAddOpenGLRunpath`](https://search.nixos.org/packages?channel=unstable&type=packages&query=cudaPackages.autoAddOpenGLRunpath). Failing that, try running the application with [`nixGL`](https://github.com/guibou/nixGL) or a similar wrapper tool. If that works, it likely means that the application is attempting to load a library that is not in the `RPATH` or `RUNPATH` of the binary.