"Bi thing" data mining algorithms--verification of accuracy

Source: Internet
Author: User

In the original: "Bi thing" data mining algorithms--verification of accuracy

Accuracy Validation Example 1 :--based on kingdoms Database

Data preparation:

Mining Model:
In order: Naive Bayes algorithm, cluster analysis algorithm, decision tree algorithm, neural network algorithm, logistic regression algorithm, correlation algorithm

Lift Chart:

Rank to:
1. Neural Network algorithm (92.69% 0.99)
2. Logistic regression algorithm (92.39% 0.99)
3. Decision Tree Algorithm (91.19% 0.98)
4. Correlation algorithm (90.6% 0.98)
5. Clustering analysis Algorithm (89.25% 0.96)
6. Naive Bayes algorithm (87.61 0.96)

Naive Bayes algorithm--Classification matrix

Description
The 538 samples of other classes had 482 predictions correctly, 32 were divided into military advisers, 24 were divided into generals, and the prediction accuracy was 89.59%.
20 samples of the adviser have 13 predictions correctly, 7 are divided into other classes, the prediction accuracy rate is 65%;
The general 112 samples had 92 predictions correctly, 16 were divided into other classes, 4 were divided into advisers, and the prediction accuracy was 82.14%.
Clustering analysis algorithm--Classification matrix

Description
The 538 samples of other classes had 536 predictions correctly, 2 were divided into general categories, and the prediction accuracy was 99.63%.
20 samples of the adviser have 0 predictions correctly, 20 are divided into other classes, the prediction accuracy rate is 0%;
The general 112 samples had 62 predictions correctly, 50 were divided into other classes, and the prediction accuracy rate was 55.36%.
Decision Tree algorithm--Classification matrix

Description
The 538 samples of the other classes had 538 predictions correctly, and the prediction accuracy was 100%;
20 samples of the adviser have 0 predictions correctly, 20 are divided into other classes, the prediction accuracy rate is 0%;
The general 112 samples had 73 predictions correctly, 39 were divided into other classes, and the prediction accuracy rate was 65.18%.
Neural Network algorithm--classification matrix

Description
The 538 samples of other classes had 524 predictions correctly, 5 were divided into military advisers, 9 were divided into generals, and the prediction accuracy was 97.4%.
20 samples of the adviser have 5 predictions correctly, 15 are divided into other classes, the prediction accuracy rate is 25%;
The general 112 samples had 92 predictions correctly, 20 were divided into other classes, and the prediction accuracy rate was 82.14%.
Logistic regression algorithm--Classification matrix

Description
The 538 samples of other classes had 526 predictions correctly, 6 were divided into military advisers, 6 were divided into generals, and the prediction accuracy was 97.77%.
20 samples of the adviser have 5 predictions correctly, 15 are divided into other classes, the prediction accuracy rate is 25%;
The general 112 samples had 88 predictions correctly, 24 were divided into other classes, and the prediction accuracy rate was 78.57%.
Correlation algorithm--classification matrix

Description
The 538 samples of other classes had 519 predictions correctly, 19 were divided into advisers, and the prediction accuracy was 96.47%.
20 samples of the adviser have 0 predictions correctly, 20 are divided into other classes, the prediction accuracy rate is 0%;
The general 112 samples had 88 predictions correctly, 24 were divided into other classes, and the prediction accuracy rate was 78.57%.
Classification Matrix--prediction of correct rate summary analysis:

Other

Consigliere

General

Neural Network algorithm

97.4%

25%

82.14%

Logistic regression algorithm

97.77%

25%

78.57%

Decision Tree algorithm

100%

0%

65.18%

Correlation algorithm

96.47%

0%

78.57%

Clustering Analysis algorithm

99.63%

0%

55.36%

Naive Bayes algorithm

89.59%

45g

82.14%

It can be seen that the naive Bayes algorithm has the highest accuracy in predicting the status of a strategist, reaching 65%, decision Tree Algorithm, association algorithm, clustering algorithm 0%, neural network algorithm and logistic regression algorithm 25%;
The decision tree algorithm has the highest accuracy in predicting other identities, reaching 100%.
Neural network algorithm and Naive Bayes algorithm are used to predict the accuracy of general identity, and reach 82.14%.

Accuracy Verification example 2:--based on stock data
Data preparation:



The mining model is in turn:
Stockclustering Clustering Analysis algorithm
Strockdecisiontrees Decision Tree Algorithm
Stockneuralnetwork Neural Network algorithm
Stocklogistic Logistic regression algorithm

Lift Chart:

rank to:
1. Logistic regression algorithm (49.73% 0.52)
2. Neural network algorithm (49.63% 0.53)
3. Clustering analysis Algorithm (48.13% 0.51)
4. Decision Tree Algorithm (47.28% 0.50)
Cluster analysis algorithm--Classification matrix:

Description
The flat 114 samples had 0 predictions correctly, 91 were divided into fall, 23 were divided into up, the prediction accuracy was 0%;
The 443 samples of the fall have 340 predictions correctly, 103 errors are up, and the forecast accuracy rate is 76.75%;
The 380-up sample has 111 predictions correctly, 269 errors fall, and the forecast accuracy rate is 29.21%.

Decision Tree algorithm--Classification matrix:

Description
The flat 114 samples have 0 predictions correctly, 114 mistakes fall, and the forecast accuracy rate is 0%;
443 of the 443 samples dropped were correctly predicted and the correct rate was 100%;
The 380-up sample has 0 predictions correctly, 380 errors fall, and the forecast accuracy rate is 0%.

Neural Network algorithm--Classification matrix:

Description
The flat 114 samples had 0 predictions correctly, 60 were divided into fall, 54 were divided into up, the prediction accuracy was 0%;
The 443 samples of the fall have 277 predictions correctly, 166 errors are up, and the forecast accuracy rate is 62.53%;
The 380-up sample has 188 predictions correctly, 192 errors fall, and the forecast accuracy rate is 49.47%.

Logistic regression algorithm--Classification matrix:

Description
The flat 114 samples had 0 predictions correctly, 89 were divided into fall, 25 were divided into up, the prediction accuracy was 0%;
The 443 samples of the fall have 380 predictions correctly, 63 errors are up, and the forecast accuracy rate is 85.78%;
The 380-up sample has 86 predictions correctly, 294 errors fall, and the forecast accuracy rate is 22.63%.

Classification Matrix--prediction of correct rate summary analysis:

Flat

Fell

Rose

Logistic regression algorithm

0%

85.78%

22.63%

Neural Network algorithm

0%

62.53%

49.47%

Clustering Analysis algorithm

0%

76.75%

29.21%

Decision Tree algorithm

0%

100%

0%

"Bi thing" data mining algorithms--verification of accuracy

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.