Author: finallyliuyu Note: Please indicate the source for data usage Download Test Data Resources include the total accuracy rate of cross-validation in the case that the dataset size is, and, and the feature dimensions are 10, 20, and respectively. The file named textcategorization_0_100_10 indicates that the size of the document set is 200 (100 articles in one category ).Article). The current feature dimension is 10. Linear. (In my experiment, libsvm uses linear kernels) Feature Word SelectionAlgorithmImpact on text classification accuracy (I): discusses whether the feature selection algorithm is effective The Influence of Feature Word Selection Algorithm on text classification accuracy (2): Describes the prerequisites and basics of this experiment and the intermediate data format. Influence of Feature Word Selection Algorithm on text classification accuracy (3): The number of feature words is discussed. The VSM model dimension has an influence on classification accuracy. The larger the number of feature words, the higher the VSM model dimension, more accurate classification Influence of Feature Word Selection Algorithm on text classification accuracy (4): classification accuracy in classical probability model (this model is used in textbooks) Influence of Feature Word Selection Algorithm on text classification accuracy (5): classification accuracy in probability model designed by layman like me |