Timeout Detection & RedR (TDR)

Source: Internet
Author: User
Timeout Detection & RedR (TDR)

NVIDIA®Nsight™Development Platform, Visual Studio Edition 2.2 User Guide
Send feedback

TDR stands for timeout detection and recovery. this is a feature of the Windows operating system which detects response problems from a graphics card, and recovers to a functional desktop by resetting the card. if the operating system does not receive a response from a graphics card within a certain amount of time (default is 2 seconds), the operating system resets the graphics card.

Before TDR existed, problems of this nature wocould have resulted in a system freeze and required a reboot of the operating system. if TDR is enabled and you see the TDR error message, "display driver stopped responding and has recovered," This means that the Windows operating system reset the display driver.

There are three different possible debugging deployments:

  • Local debugging with a single GPU,
  • Local debugging with multiple GPUs, or
  • Remote debugging.

Choose the one that most closely reflects your NVIDIA nsight setup:

Local debugging with a single GPU

Disabling TDR removes a valuable layer of protection, so it is generally recommended that you keep it enabled.

However, setting the TDR delay too low can cause the debugger to fail for one of two reasons:

  • Debugging on some GPUs will fails with a TDR delay of less than 10 seconds.
  • Debug builds of Cuda kernels run more slowly and may intrinsically require additional time to complete. With too low of a tdr delay, the kernels may not have enough time to complete.

Therefore, if you are using local debugging with a single GPU, it's recommended that you leave TDR enabled, and set the delay to 10 seconds.

To enable TdR and change the delay, do the following:

  1. Right-click the nsight monitor icon in the system tray.
  2. SelectOptions.



  3. In the options window onGeneralTab, SetWddm TDR EnabledToTrue.
    ChangeWddm TDR DelayFrom the default setting10.

 

Local debugging with multiple GPUs or remote debugging

When using either a local debugging configuration with multiple GPUs, or a remote debugging configuration, it's important to disable TDR. this is because with most Cuda applications, a TDR means that any debugging operation after the TDR will fail. you will not be able to step, set breakpoints, view variables, etc. the application will receive a grid launch failure, andCUcontextWill begin to report errors.

Having TDR enabled can interfere with GPU debugging because the graphics card is perceived by the operating system as unresponsive when the execution of a target application is paused or when the debugger is using Ming certain operations.

To disable TDR, do the following:

  1. Right-click the nsight monitor icon in the system tray.
  2. SelectOptions.



  3. In the options window onGeneralTab, SetWddm TDR EnabledToFalse.

 

For more information about TDR, see:

Http://www.microsoft.com/whdc/device/display/wddm_timeout.mspx

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.