Affine-Gradient Based Local Binary Pattern Descriptor for Texture Classiffication

05/19/2017 ∙ by You Hao, et al. ∙ Institute of Computing Technology, Chinese Academy of Sciences 0

We present a novel Affine-Gradient based Local Binary Pattern (AGLBP) descriptor for texture classification. It is very hard to describe complicated texture using single type information, such as Local Binary Pattern (LBP), which just utilizes the sign information of the difference between the pixel and its local neighbors. Our descriptor has three characteristics: 1) In order to make full use of the information contained in the texture, the Affine-Gradient, which is different from Euclidean-Gradient and invariant to affine transformation is incorporated into AGLBP. 2) An improved method is proposed for rotation invariance, which depends on the reference direction calculating respect to local neighbors. 3) Feature selection method, considering both the statistical frequency and the intraclass variance of the training dataset, is also applied to reduce the dimensionality of descriptors. Experiments on three standard texture datasets, Outex12, Outex10 and KTH-TIPS2, are conducted to evaluate the performance of AGLBP. The results show that our proposed descriptor gets better performance comparing to some state-of-the-art rotation texture descriptors in texture classification.

READ FULL TEXT VIEW PDF
POST COMMENT

Comments

There are no comments yet.

Authors

page 4

This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.

1 Introduction

Texture is the most fundamental information on which the majority of all living organisms base their visual cognition and is a key component of computer vision system

[8]. Basically, all the digital images can be regarded as texture. Texture analysis has been applied to many visual problems such as material categorization, surface inspection, medical image analysis, object recognition, image segmentation, pedestrian detection, face analysis and so on.

Over the years, lots of texture descriptors have been proposed [9, 18, 21, 17]. Among these descriptors, local patterns have achieved good performance in most texture applications [15, 12, 1]. In particular, LBP is an efficient descriptor for describing local structures [15]. LBP descriptors have already demonstrated powerful discriminative capability, low computational complexity, and low sensitivity to illumination variation. For further improving the discrimination of LBP, a large number of LBP variants have been proposed [11]. Most of these changes make efforts on the following three directions.

First is to utilize different forms of information from the original textures. Guo et al. proposed Complete LBP which utilized the sign and magnitude information of local neighborhood in the descriptor [6]. Some other methods concentrate on the local derivative information respected to a local region, such as LDP [23], CLDP [22], LDDP [5], POEM [20] and so on. Second is rotation invariance, which is an important topic in texture classification. Many methods have been proposed to achieve rotation invariance, such as SRP [10, 19], SIFT [12] and so on. Third is feature selection. The exponential increasing in the number of features with the patch size is a limitation for the traditional LBP. The uniform LBP descriptor proposed by Ojala et al. [15] is the first attempt to solve this problem.

The main contributions of the paper are threefold. Firstly, we propose the Affine-Gradient based method to describe texture information. Affine-Gradient (AG) has some properties that Euclidean-Gradient (EG) does not have, which will be elaborated detailedly in the following. Secondly, an improved method for determining the local reference direction is proposed to reach rotation invariance, which is fast to compute and effective for the rotation transformations. Finally, we propose a simple but effective feature selection method considering both the distribution of patterns and the intraclass variance on the training datasets. Experiments show that the proposed feature selection method not only increases the discriminative power but also reduce the dimension of descriptor effectively.

2 Affine-Gradient based Local Pattern Descriptor

In this section we elaborate our approach in detail. First, we give a brief review of LBP. Second, we discuss how to make full use of multi-information, especially Affine-Gradient (AG), for texture classification. The properties of AG are discussed in detail. Then we discuss the method we proposed to achieve the rotation invariance. Finally, the criterion for feature selection are discussed.

2.1 Overview of LBP Method

Th traditional LBP operator extracts information that is invariant to local gray-scale variations in the image. It is computed at each pixel location, considering the values of a small circular neighborhood around the central pixel . Then, the LBP is defined as following:

(1)

where is the central pixel and are the values of its neighbors. is the index of the neighbor, is the radius of the circular neighborhood and is the number of pixels in the neighborhood. Then the histogram of these patterns is used to describe the texture of the image.

There are three obvious disadvantages of LBP. First, it has no rotation invariance. Second, it is just 1-th order sign information used in the descriptor. Third is the exponentially length increasing with the parameter . The proposed method has been improved in these three direction.

2.2 Affine-Gradient based Descriptors

In here, we propose the method based on the AG information to increase the discrimination of the descriptor. The Euclidean Gradient (EG) can de defined as . It is 2-norm of gradient in Euclidean space that remains invariant only under Euclidean transformation.

Olver et al. [16] proposed that there are two basic relative affine differential invariant of 2-order in two-dimensional affine spaces as following:

(2)
(3)

All other 2-order differential invariants can be made up of these two expressions. And their ratios constitute absolute invariant of differential in affine space. The affine gradient magnitude () can be defined as equation (4). In order to avoid the calculation fault of zero-denominator, we can make some changes to the definition as .

(4)

The Affine-Gradient is superior than Euclidean-Gradient (EG), because AG is invariant for the affine transformation, and the EG just remains invariant under Euclidean transformation. Using the AG information can improve the robustness of descriptor for the geometric transformation. Ge et al. constructed a new descriptor using the AG to replace the EG in SIFT, which get much better performance than the original SIFT [2]. The gradient and AG information are shown in Fig. 1.

Figure 1: The EG and AG information of image example: (a) image example; (b) EG magnitudes of example; (c) AG of example range in (0-0.2); (d) AG of example range in (0.2-1).

In Fig. 2 and 2, we can see that the histogram of EG is much more continuous and smooth than that of AG. In fact, the range of AG is from 0 to 162, not limited to 0 to 1 corresponding to Fig.2. It’s just more sparse where the value bigger than 1. But the distribution of EG just ranges form 0 to 763 corresponding to Fig. 2

. So intuitively, the information of AG ranging (0,1) probably corresponding to that of EG as shown in Fig.

1 and 1. And there are some local extreme information in the AG as shown in Fig. 1.

Figure 2: The histogram of EG and AG: (a) histogram of the gradient; (b) histogram of the AG.

For further verification of the validity of AG, experiments are conducted on Outex12 dataset. The Local Gradient Pattern (LGP) and Local Affine-Gradient Patter (LAGP) can be defined as

(5)
(6)

The function is defined in equation (1). The Multi-Information based descriptor MI-G, can be defined as the concatenation of LGP and LBP. Similarly, MI-AG is the concatenation of LAGP and LBP. Then the experimental results are listed in Table 1.

Problem form
Outex12 55.26 58.04 58.69
71.37 73.49 79.28
56.98 58.03 60.02
65.09 77.62 77.65
Table 1: Results of Multi-Information based descriptors on Outex12

From the results, we can see that the Multi-Information descriptor based on Affine-Gradient get the best performance in all scenarios. It was demonstrated that the AG information can substantially increase the discriminative power of the descriptors.

2.3 Rotation Invariance

Metha et al. [13] proposed a method that quantizing the directions into discrete values, then make direction with the maximum magnitude of the difference as the reference direction. But this definition discard the sign information of the magnitude and will assign the opposite directions into the same one. In this paper, we take both the sign and magnitude of the discrete directions into consideration. The reference direction can be defined as:

(7)

where is the sign function defined in equation (1). The proposed descriptor is computed by rotating the weights with respect to the reference direction. The rotation invariance LBP (roLBP) can be defined as

(8)

Applying the reference direction selection method to the LAGP descriptor. We can get the rotation invariant descriptor roLAGP as following:

(9)

Then the final descriptor AGLBP can be defined as the concatenation of roLBP and roLAGP.

(10)

2.4 Feature Selection

It is observed the dimensionality of descriptors also increases exponentially with the number of neighboring pixels. In [13], proposed a method depending on the distribution of patterns in the training dataset. Besides, some patterns may be negative to the final classification result. So in our method, the intraclass variance of training datasets is also chosen as the evaluation for feature selection.

In the statistical description, variance is defined as, where is mean value of the array. The distribution of the intraclass variance of all patterns are computed from the training dataset, as shown in Fig. 3.

(a)
(b)
Figure 3: The intraclass variance distribution for roLBP on Outex12 dataset: (a) The variance distribution of roLBP in Outex12 training dataset; (b) The variance distribution of roLAGP in Outex12 training dataset.

The bins of the histogram are sorted in descending order. Then there will be two method for feature selection. One selects the top patterns in the ordered list, the other selects bins which is less than a threshold as the final descriptor. The final patterns selected depend on the threshold parameter or and the training datasets. The final dimensionality of the descriptor is not constant. It varies across different datasets. The accuracy-parameter curve of the two method for roLBP on Outex12 dataset are plotted in Fig. 4.

(a)
(b)
Figure 4: The accuracy-parameter curve for roLBP on Outex12 dataset: (a) the accuracy-N curve of roLBP on Outex12 dataset; (b) the accuracy- curve of roLBP on Outex12 dataset.

It can be observed in Fig. 4(b) that the classification accuracy reach the peak with the threshold value almost between 1.6-2.0, just over the peak of distribution corresponding to Fig. 3(a) . This values results in a significant reduction of the dimensionality.

Thus, the proposed approach consider both the statical frequency and the intraclass variance of the training textures, which not only reduces the dimensionality of descriptors, but also improves the classification accuracy. The effective of the proposed approach will be demonstrated in next section.

2.5 Classification method

Some state-of-the-art methods, such as artificial neural network (ANN), SVM, AdaBoost, can achieve outstanding classification performance, but these methods require complex learning procedure and may influence analysis of discriminative capabilities of features. To make a fair comparison with some other approaches, the Nearest Neighbor (NN) classifier based on the Chi-Square distance was performed as our classification method. The effectiveness of the Chi-Square distance for classification is demonstrated in

[4, 5].

3 Experiments

To evaluate the proposed descriptor (AGLBP), three experiments are conducted on texture datasets: Outex10, Outex12 and KTH-TIPS2. Outex10 and Outex12 datasets are for rotation invariant texture classification with rotation and illumination deformations. The KTH-TIPS2 is for material categorization and includes scale and viewpoints variations. The parameter of proposed method is set to 2 in all our experiments.

3.1 Outex12

Outex is a framework for empirical evaluation of texture classification algorithms[14]. First we conduct experiment on the Outex12 dataset. It consists of 9120 images, which are separated into 24 different texture classes captured with different illuminations and rotations. This dataset contains 20 training images and 360 (2*9*20) testing images under two different illumination and 9 different orientation for each class. In experiment, following two problem proposed in the dataset[14], problem 000 and 001. Considering the length of the final descriptor is depending on the parameter (R,P), we use a conservative setting of the parameter as (1,8),(2,12),(3,16). All the LBP-based methods were performed and the results are shown in Table 2.

Problems (R,P)
Outex10 (1,8) 50.20 57.44 82.78 74.38 72.03 91.40 63.72
(2,12) - 59.62 91.48 86.74 90.52 92.18 95.43
(3,16) - 61.35 95.76 88.92 97.03 94.37 99.22
Outex12-000 (1,8) 54.21 55.81 72.26 65.93 70.85 76.41 61.99
(2,12) - 57.85 86.78 82.66 88.49 86.80 93.31
(3,16) - 58.56 93.50 83.98 91.08 90.85 97.84
Outex12-001 (1,8) 56.32 58.15 70.39 64.26 77.24 77.08 67.50
(2,12) - 57.08 84.77 75.86 91.34 84.09 94.83
(3,16) - 59.49 92.97 79.63 92.40 84.76 97.38
KTH-TIPS2 (1,8) 90.97 85.85 83.65 82.78 88.73 78.98 81.28
(2,12) - 87.92 89.75 87.95 90.87 83.00 95.23
(3,16) - 91.95 94.36 91.52 91.85 85.10 97.12
Table 2: Experiment results of LBP based methods on different datasets

Among these methods, the proposed method with setting (3,16) has achieved the highest accuracy of 97.84% for problem 000 and 97.38% for problem 001. For further analysis, we compare our method with some other state-of-the-art methods. The results are shown in Table 3. It can be seen that the proposed descriptor achieves the best result, the close second is , which get the accuracy 97.15% for problem 000 and 95.37% for problem 001.

3.2 Outex10

Then experiment is conducted on the Outex10 dataset, which includes 4320 images of 24 different classes. These images are captured under the same illumination but rotated at nine different angles. There are 20 images at each angle for each class. Following the problem proposed in the dataset[14], 480 images captured at angle are taken as the training set and the rest 3840 images captured at other angles used for testing.

The results with various setting are shown in Table 2. For further analysis, AGLBP are compared with some other state-of-the-art approaches. The result of these methods are also shown in Table 3. It can be observed that AGLBP performs well under various rotation deformations. Among all, our method with setting (3,16) has achieved the highest accuracy 99.22%, just a little improvement on the results to the 99.19%, which achieved by .

3.3 KTH-TIPS2 Dataset

Experiment on the KTH-TIPS2 dataset has also been conducted for material classification. The KTH-TIPS2 database contains 11 texture classes with different materials. For each class, the images are captured from 4 different samples of materials. And for each sample, 9 different scales with 4 different illumination and 3 different poses are conducted for the imaging. In this experiment, following problem proposed in most research[7, 3], images of one random sample are selected from each class are taken as the training dataset, images from the other samples are taken as the testing dataset.

All the methods were performed and the results are shown in Table 2. As the same, AGLBP is also compared with some other state-of-the-art approaches. The result of these methods are shown in Table 3. The proposed descriptor outperforms all other descriptors again. It can be concluded that our method is effective for texture classification.

Problems
Outex10 95.76 73.16 74.12 97.03 94.37 93.59 92.00
Outex12-000 93.50 63.48 70.16 91.08 90.85 91.34 90.46
Outex12-001 92.97 68.48 68.48 92.40 84.76 92.83 91.74
KTH-TIPS2 94.36 92.74 92.15 91.85 85.10 93.50 95.46
Problems
Outex10 96.64 95.34 98.68 - - 99.19 99.22
Outex12-000 82.79 93.96 88.38 - - 97.15 97.84
Outex12-001 90.08 89.94 92.17 - - 95.37 97.38
KTH-TIPS2 - - - 98.50 96.41 96.78 97.12
Table 3: Experiment results of descriptors on different datasets

4 Conclusion

In this paper we have proposed an Affine-Gradient based Local Binary Pattern (AGLBP) descriptor for texture classification. Affine-Gradient is different from the Euclidean-Gradient and has been proved to have a good improvement for texture classification. In addition, we have proposed an improved method for determining the local reference direction to reach rotation invariance. Importantly, the dimension increasing bringing by multi-information is also alleviated by proposed feature selection method, which considering both the statistical frequency and the intraclass variance of the training texture. Three extensive experiments have been conducted on texture datasets including rotating, scaling and viewpoint deformations. The results demonstrate that the AGLBP performed better than some state-of-the-art approaches for texture classification. The AGLBP utilize the Affine-Gradient which has been demonstrated robust for the viewpoint deformation. For further research, information invariant for projective transformation should be utilized to enhance the robustness to viewpoint deformation.

References

  • [1]

    Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on. vol. 1, pp. 886–893. IEEE (2005)

  • [2] Ge, J., Cao, W., Zhou, W., Gong, M., Liu, L., Li, H.: A local feature descriptor under color affine trans-formation. Journal of Computer-Aided Design & Computer Graphics 25(1), 25–33 (2013), [In Chinese]
  • [3] Guo, Y., Zhao, G., Pietikäinen, M.: Texture classification using a linear configuration model based descriptor. In: BMVC. pp. 1–10 (2011)
  • [4] Guo, Y., Zhao, G., Pietikäinen, M., Xu, Z.: Descriptor learning based on fisher separation criterion for texture classification. In: Asian Conference on Computer Vision. pp. 185–198. Springer (2010)
  • [5] Guo, Z., Li, Q., You, J., Zhang, D., Liu, W.: Local directional derivative pattern for rotation invariant texture classification. Neural Computing and Applications 21(8), 1893–1904 (2012)
  • [6] Guo, Z., Zhang, L., Zhang, D.: A completed modeling of local binary pattern operator for texture classification. IEEE Transactions on Image Processing 19(6), 1657–1663 (2010)
  • [7] Guo, Z., Zhang, L., Zhang, D.: Rotation invariant texture classification using lbp variance (lbpv) with global matching. Pattern recognition 43(3), 706–719 (2010)
  • [8] Haindl, M., Filip, J.: Visual texture: Accurate material appearance measurement, representation and modeling. Springer Science & Business Media (2013)
  • [9] Haralick, R.M., Shanmugam, K.: Textural features for image classification. IEEE Transactions on systems, man, and cybernetics 3(6), 610–621 (1973)
  • [10] Liu, L., Fieguth, P., Clausi, D., Kuang, G.: Sorted random projections for robust rotation-invariant texture classification. Pattern Recognition 45(6), 2405–2418 (2012)
  • [11] Liu, L., Fieguth, P., Guo, Y., Wang, X., Pietikäinen, M.: Local binary features for texture classification: Taxonomy and experimental study. Pattern Recognition 62, 135–160 (2017)
  • [12] Lowe, D.G.: Distinctive image features from scale-invariant keypoints. International journal of computer vision 60(2), 91–110 (2004)
  • [13] Mehta, R., Egiazarian, K.: Dominant rotated local binary patterns (drlbp) for texture classification. Pattern Recognition Letters 71, 16–22 (2016)
  • [14] Ojala, T., Maenpaa, T., Pietikainen, M., Viertola, J., Kyllonen, J., Huovinen, S.: Outex-new framework for empirical evaluation of texture analysis algorithms. In: Pattern Recognition, 2002. Proceedings. 16th International Conference on. vol. 1, pp. 701–706. IEEE (2002)
  • [15] Ojala, T., Pietikainen, M., Maenpaa, T.: Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Transactions on pattern analysis and machine intelligence 24(7), 971–987 (2002)
  • [16] Olver, P.J., Sapiro, G., Tannenbaum, A.: Affine invariant detection: edge maps, anisotropic diffusion, and active contours. Acta Applicandae Mathematicae 59(1), 45–77 (1999)
  • [17] Porter, R., Canagarajah, N.: Robust rotation-invariant texture classification: wavelet, gabor filter and gmrf based schemes. IEE Proceedings-Vision, Image and Signal Processing 144(3), 180–188 (1997)
  • [18] Qian, X., Liu, G., Guo, D., Li, Z., Wang, Z., Wang, H.: Object categorization using hierarchical wavelet packet texture descriptors. In: Multimedia, 2009. ISM’09. 11th IEEE International Symposium on. pp. 44–51. IEEE (2009)
  • [19] Skibbe, H., Reisert, M., Schmidt, T., Brox, T., Ronneberger, O., Burkhardt, H.: Fast rotation invariant 3d feature computation utilizing efficient local neighborhood operators. IEEE Transactions on Pattern Analysis and Machine Intelligence 34(8), 1563–1575 (2012)
  • [20]

    Vu, N.S., Caplier, A.: Enhanced patterns of oriented edge magnitudes for face recognition and image matching. IEEE Transactions on Image Processing 21(3), 1352–1365 (2012)

  • [21]

    Wu, W.R., Wei, S.C.: Rotation and gray-scale transform-invariant texture classification using spiral resampling, subband decomposition, and hidden markov model. IEEE Transactions on Image Processing 5(10), 1423–1434 (1996)

  • [22] Yin, S., Dai, X., Ouyang, P., Liu, L., Wei, S.: A multi-modal face recognition method using complete local derivative patterns and depth maps. Sensors 14(10), 19561–19581 (2014)
  • [23] Zhang, B., Gao, Y., Zhao, S., Liu, J.: Local derivative pattern versus local binary pattern: face recognition with high-order local pattern descriptor. IEEE transactions on image processing 19(2), 533–544 (2010)