Kraeva E.M., Masich I.S.

Siberian State Aerospace University

named after academician M. F. Reshetnev, Russia

 

The criterion function and its modification in construction of patterns for a classification model

 

A modification of the criterion function of the optimization model in the method of logical analysis of data to increase different rules in the classification model is proposed. The empirical confirmation of the advisability of modifying the criterion function is given.

In [1] we have considered the optimization model, the decision of which is patterns in the data to construct the classifier.

Denote Ωs+ and Ωs are sets of n-dimensional binary vectors of positive and negative classes constituting the training data set.

The basis of this approach is the notion of a pattern. A positive pattern called a subcube of the space of Boolean variables B2t, which intersects the set Ωs+ and has a limited number of elements in common with the set Ωs. Negative pattern is defined similarly.

A positive ω-pattern for ω{0,1}t is a pattern containing a point ω.

We define corresponding subcube using variables yj:

 

 

Optimization model has the following form.

 

                                                           (1)

 for every , .       (2)

 

According to the criterion (1), we seek to maximize the coverage of the resulting pattern, i.e. the number of objects captured with the pattern. Each subsequent pattern maximizes its coverage, capturing objects that are representative of the class, and atypical class objects remain uncovered and in the final rule there are no patterns that take into account the objects of the class. Therefore, we obtain a set of similar patterns for the class, thereby reducing the quality of the classification. To obtain a classification model with higher different patterns, which can allocate significantly different subsets of objects, we propose to modify the criterion (1) as follows:

 

                                          (3)

 

where Kσ – is weight of the positive observations σΩs+, which is reduced when this observation is covered, thereby reducing its priority to participate in the formation of the next pattern in favor of uncovered observations.

In order to use the criterion (3) for the formation of patterns, it is necessary to set the initial weight for all objects and rule changes weights for objects that participated in the formation of the current pattern. We offer the initial weights is set to 1 for each object in the training set. Rule of changes for the weight of the object, which took part in the formation of this pattern:

 

,

where Ki, Ki+1 are weights of the covered object to the formation the current pattern and the next pattern, Nmax is parameter defined by the researcher, which means the maximum number of patterns that can cover the training sample object in the model.

Parameter Nmax for each class is given in the range from 1 to the maximum number of patterns constructed for this class. If Nmax is set to close or equal to the maximum number of patterns for this class, then the value of the criterion (3) tends to the value the criterion (1) for each pattern in the model, therefore, the new model works similarly to the previous classification. If Nmax tends to 1, then the number of patterns with a value of the criterion (3) greater than 0 is reduced, because initially all objects are captured and their weights are reset. In the new classification model, insufficient number of patterns is presented, which eventually is not capable to classify new incoming objects, i.e. classification quality decreases.

Thus, using the criterion (3), we obtain the patterns that cover substantially different subset of the objects.

According to the results obtained it can be noted that the use of an optimization model with the criterion (3) allows to simplify the classification model, reducing the number of patterns in the model in 2-3 times compared to the full set for a specific task. The accuracy of classification is not decreased or is decreased only slightly. When the optimization model with the criterion (3) is used, average degree of patterns becomes smaller, which means that the patterns themselves become clearer and easier to interpret.

The work is supported by the Grant of President of Russian Federation MK-1371.2013.8.

 

References

1.                 Masich I.S. Pattern search in data for solving practical problems of recognition. Materiály VI²I mezinárodní vědecko - praktická konference «Zprávy vědecké ideje - 2012». – Díl 21. Matematika. Fyzika: Praha. Publishing House «Education and Science», p. 14-15.