SVM-based data classification prediction-Italian wine classification identification, svm Italy

Last Update:2014-08-14 Source: Internet

Author: User

Tags svm

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

SVM-based data classification prediction-Italian wine classification identification, svm Italy

Wine data comes from the UCI database and records the chemical composition of wine 13 of different varieties in the same region of Italy, so as to achieve automatic wine Classification through scientific methods.

The data of this classification has a total of 178 samples, each of which has 13 attributes, and provides a correct classification for each sample, which is used to verify the accuracy of SVM classification.

First, we can draw a data visualization diagram:

% Load Test Data wine, including the data in the matrix of classnumber = 3, wine: 178*13, and the column vector load chapter_WineClass.mat of wine_labes: 178*1; % plot the box visualization map of test data figure; boxplot (wine, 'orientation', 'horizontal ', 'labels', categories); title ('box visualization map of wine Data ', 'fontsize', 12); xlabel ('attribute value ', 'fontsize', 12); grid on; % plot the dimension chart figuresubplot (, 1) of the test data ); hold onfor run = 1: 178 plot (run, wine_labels (run), '*'); endxlabel ('samples', 'fontsize', 10 ); ylabel ('category label', 'fontsize', 10); title ('class', 'fontsize', 10); for run = subplot (, run ); hold on; str = ['B B', num2str (run-1)]; for I = 1: 178 plot (I, wine (I, run-1 ), '*'); end xlabel ('samples', 'fontsize', 10); ylabel ('Property value', 'fontsize', 10); title (str, 'fontsize ', 10); end

(Figure 1)

(Figure 2)

Figure 1 shows the box visualization of wine data, and Figure 2 shows the box diagram of wine. It is difficult to tell which type of wine each type is. Next we will try to use SVM for classification.

Data preprocessing

% Selected training set and Test Set % use 1-30 of the first class, 60-95 of the second class, And 131-153 of the third class as the training set train_wine = [wine ,:); wine (60: 95, :); wine (131: 153, :)]; % the labels of the corresponding training set must also be separated. train_wine_labels = [wine_labels (); wine_labels (60: 95); wine_labels (131: 153)]; % convert 31-59 of the First Class, 96-130 of the second class, test_wine = [wine (31: 59, :); wine (96: 130, :); wine (154: 178, :)]; % The labels of the corresponding test set should also be separated. test_wine_labels = [wine_labels (31: 59); wine_labels (96: 130); wine_labels (154: 178)]; <strong> % data preprocessing </strong> % data preprocessing: normalize the training set and test set to the [0, 1] interval [mtrain, ntrain] = size (train_wine); [mtest, ntest] = size (test_wine); dataset = [train_wine; test_wine]; % mapminmax is the built-in normalization function of MATLAB [dataset_scale, ps] = mapminmax (dataset ', 0, 1 ); dataset_scale = dataset_scale '; train_wine = dataset_scale (1: mtrain, :); test_wine = dataset_scale (mtrain + 1) :( mtrain + mtest ),:);

SVM network creation, training, and Prediction

<Span style = "font-size: 12px;"> % SVM network training model = svmtrain (train_wine_labels, train_wine, '-c 2-g 1 '); % SVM network prediction [predict_label, accuracy, dec_value1] = svmpredict (test_wine_labels, test_wine, model); </span>

Result Analysis

% Result Analysis % actual classification and prediction classification chart of the test set % the chart shows that only one test sample is the correct figure; hold on; plot (test_wine_labels, 'o'); plot (predict_label, 'r * '); xlabel ('test set samples', 'fontsize', 12); ylabel ('category label ', 'fontsize', 12); legend ('actual Test Set category', 'prediction Test Set category'); title ('actual classification of Test Set and prediction category ', 'fontsize', 12); grid on;

The svm classification accuracy reaches 98.8764%, and only one of the 89 test samples is incorrectly classified. It can be seen that SVM is powerful in data classification!

END

Identification of Italian wines by svm

Abstract: In order to solve the problem that some products cannot use objective analysis methods for accurate quality identification in actual production, and to make up for the shortcomings of manual perception identification methods, a SVM-based product quality identification method is proposed, the product's chemical components are used for quality classification and identification. The experiment uses the wine data for simulation calculation. The results show that this method can effectively identify the quality of wine with high accuracy.

The dimensions of matlab SVM classification prediction parameters are inconsistent.

Inconsistent dimensions

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More