山西大学大数据科学与产业研究院

Generalization Performance of Pure Accuracy and Its Application in Selective Ensemble Learning

Authors: Jieting Wang, Yuhua Qian, Feijiang Li, Jiye Liang, Qingfu Zhang

Abstract:

Abstract—The pure accuracy measure is used to eliminate random consistency from the accuracy measure. Biases to both majority and minority classes in the pure accuracy are lower than that in the accuracy measure. In this paper, we demonstrate that compared with the accuracy measure and F-measure, the pure accuracy measure is class distribution insensitive and discriminative for good classifiers. The advantages make the pure accuracy measure suitable for traditional classification. Further, we mainly focus on two points: exploring a tighter generalization bound on pure accuracy based learning paradigm and designing a learning algorithm based on the pure accuracy measure. Particularly, with the self-bounding property, we build an algorithm-independent generalization bound on the pure accuracy measure, which is tighter than the existing bound of an order O(1/√N) (N is the number of instances). The proposed bound is free from making a smoothness or convex assumption on the hypothesis functions. In addition, we design a learning algorithm optimizing the pure accuracy measure and use it in the selective ensemble learning setting. The experiments on sixteen benchmark data sets and four image data sets demonstrate that the proposed method statistically performs better than the other eight representative benchmark algorithms.

Keywords： Pure Accuracy, Linear-fractional Measure, Generalization Performance Bound, Selective Ensemble Learning

Generalization_Performance_of_Pure_Accuracy_and_its_Application_in_Selective_Ensemble_Learning (1).pdf

Tue Aug 22 09:00:00 CST 2023