Caffe Python Interface Learning (4) Mnist instance handwritten digit recognition

Last Update:2018-05-03 Source: Internet

Author: User

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

The following is mainly excerpt Denny blog content, more content everyone to see the original author it

A data preparation

Prepare a list of training set and test set pictures;

Two import Caffe Library, set file path

#-*-Coding:utf-8-*-ImportCaffeFrom CaffeImportLayers as l,params as P,proto,to_proto#Save path to set file root=‘/home/xxx/‘#Root directory train_list=root+‘Mnist/train/train.txt‘# training picture List Test_list=root+mnist/test/test.txt "#< Span style= "COLOR: #008000" > test picture list Train_proto=root+ '  Mnist/train.prototxt "# Training profile Test_proto=root+ "mnist/test.prototxt ' # test configuration file Solver_proto=root+ ' mnist/solver.prototxt "# parameter file

where Train.txt and test.txt files already have, the other three files that we need to write ourselves.

Note here: The General Caffe program is the first to convert the picture into a Lmdb file, but this is a bit cumbersome. So I do not convert, I directly with the original image to operate, the difference is directly with the picture operation, the mean value is difficult to calculate, so you can not reduce the mean.

Three build configuration Files

The configuration file is actually some txt document, but the suffix name is prototxt, we can write it directly into the editor, or it can be generated in code. Here, I use Python to build.

#Write a function that generates a configuration file Prototxtdef Lenet (img_list,batch_size,include_acc=False):#The first layer, the data entry layer, enters the ImageData format, label = L.imagedata (Source=img_list, Batch_size=batch_size, ntop=2, Root_folder=root, transform_param=dict (scale= 0.00390625))#Second layer: Convolution layer conv1=l.convolution (data, kernel_size=5, stride=1,num_output=20, Pad=0,weight_filler=dict (type=‘Xavier‘))#Pool Layer pool1=l.pooling (CONV1, Pool=p.pooling.max, kernel_size=2, stride=2)#Convolution layer conv2=l.convolution (pool1, kernel_size=5, stride=1,num_output=50, Pad=0,weight_filler=dict (type=‘Xavier‘))#Pool Layer pool2=l.pooling (Conv2, Pool=p.pooling.max, kernel_size=2, stride=2)#Fully connected layer fc3=l.innerproduct (pool2, Num_output=500,weight_filler=dict (type=‘Xavier‘))#Activate function layer Relu3=l.relu (FC3, in_place=True)#Full connection Layer FC4 = L.innerproduct (RELU3, Num_output=10,weight_filler=dict (type=‘Xavier‘))#Softmax Layer loss =L.softmaxwithloss (FC4, label)If INCLUDE_ACC:#The test phase requires a accuracy layer ACC =L.accuracy (FC4, label)Return To_proto (loss, ACC) else: return To_proto (loss) def  Write_net (): # Write Train.prototxt with open (Train_proto,  ' w" ) as F:f.write (str (Lenet (train_list,batch_size=64# write Test.prototxt with open (Test_proto, Span style= "COLOR: #800000" > ' w " ) as F:f.write (str (Lenet (test_list,batch_size=100, Include_acc=true)))

The configuration file is what we call the network. The network I generated here may not be the same as the original lenet, but it doesn't make much difference.

Quad Build Solver file

Similarly, you can write directly in the editor, or you can use code generation.

#Write a function to generate the parameter fileDefGen_solver (solver_file,train_net,test_net): s=Proto.caffe_pb2. Solverparameter () S.train_net =Train_net s.test_net.append (test_net) S.test_interval = 938#60000/64, Test interval parameter: Once all the pictures are trained, one Test s.test_iter.append (100)#10000/100 test iterations, need to iterate 100 times to complete the test of all data once s.max_iter = 9380#epochs, 938*10, maximum training times S.BASE_LR = 0.01#Basic Learning Rate S.momentum = 0.9#Momentum S.weight_decay = 5e-4#Weight decay term S.lr_policy =‘Step‘#Learning Rate Change Rule s.stepsize=3000#Learning Rate Change frequency S.gamma = 0.1#Learning Rate Change Index S.display = 20# screen display interval S.snapshot = 938 # Save Caffemodel interval s.snapshot_prefix =root+ ' mnist/lenet< Span style= "COLOR: #800000" > ' #caffemodel prefix s.type = ' sgd " # optimization algorithm S.solver_mode = proto.caffe_pb2. Solverparameter.gpu # acceleration #  write Solver.prototxt with open (Solver_file,  ' w ' ) as F:f.write (str (s))

Five-start training model

During the training process, the tests are also constantly being tested.

# Start training def Training (Solver_proto):    caffe.set_device (0)    Caffe.set_mode_gpu ()    solver =  Caffe. Sgdsolver (Solver_proto)    solver.solve ()

Finally, you can call the above function.

'__main__': Write_net () gen_solver (Solver_proto,train_proto,test_proto) Training (Solver_proto)

Six completed Python files

mnist.py

View Code

I put this file under the Mnist folder in the root directory, so you can use the following code to execute

sudo python mnist/mnist.py

During the training process, some Caffemodel are saved. How often to save, save how many times, can be set in the Solver parameter file.

I set the training to epoch,9000 multiple times, the test accuracy can reach 99%

Caffe Python Interface Learning (4) Mnist instance handwritten digit recognition

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More

Caffe Python Interface Learning (4) Mnist instance handwritten digit recognition

Contact Us

What's Trending

Top 10 Tags

Top 10 Keywords

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support

Caffe Python Interface Learning (4) Mnist instance handwritten digit recognition

Contact Us

What's Trending

Top 10 Tags

Top 10 Keywords

Trending Topic

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support