Theano (Deep learning Tool) uses GPU for accelerated configuration and use

Source: Internet
Author: User
Tags theano

Recently used Theano wrote the MLP and CNN program, because the training sample large, CPU speed so slow, and then found a computer with Naivid graphics card configuration using the GPU, encountered a lot of problems, recorded as follows:


Platform Description:

System: WindowsXP

python:2.7, it is recommended to use Python (x, y) directly, including the Theano required NumPy library, save your own configuration

theano:0.6

cuda:3.0


1 Downloads

Download Install Theano:easy_install Theano

Download Cuda: Download drivers, toolkits, software development kits from http://www.nvidia.cn/object/cuda_get_cn_old.html, install sequentially

Note: Students are best to download the 3.0 version of the first, my earliest version of the 2.3, but found that Theano Cuda module on the 2.3 support is very poor, changed. \theano\sandbox\cuda\. CU inside a lot of code, such as Cudamemgetinfor (for the 3.0 version of the interface, 2.3 for the cumemgetinfor, see more http://stackoverflow.com/ questions/7386990/cuda-identifier-cudamemgetinfo-is-undefined), the final compilation is correct, but there is a difficult to deal with the link bug. When you finally switch to 3.0, only a small portion of the code is changed to run successfully (see instructions later).


2 configuration

Open the Theano configuration file (typically in the root directory of the user folder, such as C:\Documents and Settings\administrator\, named. theanorc.txt) and add the following configuration

[NVCC]

Flags=-lc:\python27\libs

Compiler_bindir=c:\program Files\Microsoft Visual Studio 9.0\vc\bin

Note: Version 2.3 only supports MSVC8.0 and MSVC9.0.

Then import Theano.sandbox.cuda to perform the initial test, the following configuration after successful


[Global]

device = GPU

Floatx = float32

The last run theano/misc/check_blas.py to test, or directly import Theano; Theano.test ()


3 problems encountered

1 class "Cudadeviceprop" has no conponent of named "Concurrentkernels"

Follow up on Cuda_ndarray.cu, near line 2796

#if cudart_version >= 3000
Put_in_dict (Dict, "Concurrentkernels", deviceprop.concurrentkernels);
#endif

Find Cudadeviceprop class under C:\CUDA, see there is no concurrentkernels, although vision is 3.0

Solution: Comment out the above section


2 Fatel Error C1083: Cannot open include file: Stdint.h:no such files or directory

Workaround:

To Googlecode download Http://msinttypes.googlecode.com/files/msinttypes-r26.zip, extract will get three files, put Inttypes.h and stdint.h to VC's include directory on it.

I installed the VS2008, installed to the default location, so the include path is:
C:\Program Files\Microsoft Visual Studio 9.0\vc\include


3 How to view GPU status

Download Gpuz


At last, although this machine with Nvidia is very old and old, with GPU acceleration, it's faster than running on my Y470.

GPU, very good very powerful.


Theano (Deep learning Tool) uses GPU for accelerated configuration and use

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.