When Cuda C is run in the cudart library, the application can be linked to the static library cudart. lib or libcudart. A. The dynamic library cudart. dll or libcudart. So. The Cuda dynamic link library (cudart. dll or libcudart. So) must be included in the installation package of the application.
All running functions of Cuda are prefixed with Cuda.
As mentioned in the heterogeneous programming section, the Cuda programming model assumes that the system is composed of a host and a device with its own memory. The device memory section provides an overview of the runtime functions used to manage the device memory.
The shared memory section describes the usage of the shared memory mentioned at the Thread level to maximize performance.
Page-locked host memory This section describes the page-locked memory, which must occur simultaneously with the kernel function during data exchange between the host and the device.
Asynchronous Parallel Execution This section describes the concepts and APIs of Asynchronous Parallel Execution used at different levels in the system.
The multi-device system section shows how the programming model extends the system where the same host is connected to multiple devices.
This section describes how to properly check errors generated during running.
Call Stack This section describes the runtime functions used to manage the Cuda C call stack.
Texture and surface memory this section displays the texture and surface memory of other methods that access the device memory, and also displays a subset of GPU texture hardware.
Graphic interoperability describes various runtime functions that provide interaction with two major graphics APIs-OpenGL and direct3d.
3.2.1 Initialization
There is no explicit runtime initialization function. Runtime functions (more specifically, functions in the version control section of the device and reference manual) will be initialized during the initial call. During running, it is important to remember that the scheduled running time and the function that interprets the error code will be called.
During initialization, A Cuda context is created for each device in the system at runtime (the context section describes the Cuda context ).
To be continued...