Fractal and Multi-Scale Fractal Dimension analysis: a comparative study of Bouligand-Minkowski method

01/16/2012 ∙ by André Ricardo Backes, et al. ∙ Universidade de São Paulo 0

Shape is one of the most important visual attributes to characterize objects, playing a important role in pattern recognition. There are various approaches to extract relevant information of a shape. An approach widely used in shape analysis is the complexity, and Fractal Dimension and Multi-Scale Fractal Dimension are both well-known methodologies to estimate it. This papers presents a comparative study between Fractal Dimension and Multi-Scale Fractal Dimension in a shape analysis context. Through experimental comparison using a shape database previously classified, both methods are compared. Different parameters configuration of each method are considered and a discussion about the results of each method is also presented.



There are no comments yet.


page 3

This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.

1 Introduction

In pattern recognition and image analysis, shape is one of the most important visual attributes to characterize objects. It provides the most relevant information about an object in order to perform its identification and classification. It is a classical problem, and literature presents a large amount of techniques to extract information related to shape geometric aspect, allowing to separate and to label different parts of an image [11, 8].

An approach widely used in shape analysis applications is the study of shape through its complexity. In this analysis, complexity is straight related to the irregularity pattern presented by the shape under analysis and, respectively, the amount of the space the shape occupies [6, 1].

An interesting way to estimate the complexity of an object is using the Fractal Dimension [12]. Different of topological dimension, which is an integer number, the Fractal Dimension is a fractionary value that describes how irregular is an object and how much of the space it occupies. Bouligand-Minkowski method is one of the most accurate methods to compute Fractal Dimension. Is is based on the study of the shape influence area computed by shape dilation [16, 15].

An alternative method for estimating shape complexity is the Multi-Scale Fractal Dimension. It consists in estimating a curve that represents the changes in complexity shape as we change the visualization scale. Different of Fractal Dimension, which is a numeric value, this approach produces a curve which performs a more accurate shape discrimination [16, 9, 10, 13].

In this paper, we propose to evaluate both Fractal Dimension and Multi-Scale Fractal Dimension in a shape analysis context. For this, a shape database is used and experiments are performed considering different configurations of each method. This paper is organized as follows: Section 2 describes the Fractal Dimension method while section 3 shows how to compute the Multi-Scale Fractal Dimension. In section 4 is showed how to use the Multi-Scale Fractal Dimension as a shape signature. The signature making process is based on Fourier analysis and it is described in section 5. Section 6 describes the methodology adopted by this work. Results and conclusion are finally given in sections 7 and 8, respectively.

2 Fractal Dimension

Fractal Dimension is a measure of how fragmented a fractal object is, and it may be understood as a characterization of its self-similarity [12]. It is a non-integer number that quantifies the density of fractals in the metric space and it is a way to identify how complex a fractal is, in order to compare it with another. Fractal geometry has various approaches to compute the Fractal Dimension of an object. These approaches can be classified as belonging to the Hausdorff-Besicovitch Dimension (like the BoxCounting and Dividers methods) or to the Bouligand-Minkowski Dimension (Minkowski Fractal Dimension method), where the last is the one which produces the most accurate and consistent results for Fractal Dimension [16, 15, 7]. Bouligand-Minkowski Fractal Dimension method is based on the study of the influence area created by shape dilation using a disc of radius (Figure 1). Small modifications on the shape produce modifications in the computed influence area [16, 15].

Figure 1: Dilation of a shape with a disc of radius .

Consider the shape under analysis, the dilation of , , is defined as the set of points in such distance from is smaller than or equal to :

This dilation can also be defined as:

where is a disc of radius . The influence area, , and the radius, , follows the relation:

So, the Fractal Dimension can be estimated as:

where is the Fractal Dimension estimated by the Bouligand-Minkowski method. Through line regression of log-log curve , it is possible to calculate a line with slope, where is the Fractal Dimension of the shape using the Bouligand-Minkowski method [16].

Computing the influence area of an object is a task of high processing cost. One possibility to optimize this task is using the Euclidean Distance Transform (), which attributes to the pixels of a binary image the minimum distance among pixels from the image object to the pixels from the image background. The distance function considered during the computing is the Euclidean distance, due its rotation invariance (Figure 2) [4, 14].

Figure 2: Euclidean Distance Transform () applied over an binary image.

3 Multi-Scale Fractal Dimension

The main problem with Fractal Dimension in nature shape characterization is that they are not fractals (or self-similar). All objects have a finite size, and it implies that their complexities go to zero as long as observation scale increases. An interesting interpretation of the fractal behavior of a shape is how the dilation, using the Bouligand-Minkowski method, occurs in different points of the shape. Some shapes allow their points to be freely dilated while in other points this dilation is saturated in some radius [16, 9].

This behavior gives to the Bouligand-Minkowski log-log curve a richness of details that can not be expressed by just a numeric value, as performed using line regression. Using the derivate, it is possible to find a function that binds the Fractal Dimension changes to the dilation radius changes (Figure 3) [9, 10, 13]. This function is called Multi-Scale Fractal Dimension (), and it is defined as:

where is the derivative of log-log curve calculated by Bouligand-Minkowski method.

(a) (b)
Figure 3: (a) log-log curve from Bouligand-Minkowski method. (b) Multi-scale Fractal Dimension.

In order to compute the , it is necessary to calculate the derivative of . This is performed using the derivative property of the Fourier Transform. This property allows to compute the derivative curve in the spectrum and it has a better performance when compared with numeric methods, once it considers all data points during the derivative computing [9, 13]. An important detail that requires attention is that derivative methods have a tendency to emphasize high frequency noise. So, it is necessary to use a low pass filter, like Gaussian filter, in order to reduced the this noise [7, 3]. The derivative based on Fourier Transform can be expressed as:


where and are, respectively, the logarithm of radius and the influences area from Bouligand-Minkowski method, is the frequency, is the imaginary number and

is the Gaussian function with standard deviation

[9, 10, 13].

Although the advantages of the derivative property of the Fourier Transform, some important aspects must be considered when computing the curve. These aspects regards to the log-log curve behavior and they are necessary to achieve a good shape discrimination. At first, it is necessary to provide a curve with a good sampling and uniform interval. This is performed when the initial points of the curve are not considered, once they present low sampling (Figure 4

), followed by a linear interpolation, which is performed by filling the space between each two points of the sampled curve by its average point.

Figure 4: In light gray: low sampling region where the points are not considered.

Another problem present in the derivative property of the Fourier Transform is the discontinuity of the method in the curve limits (Figure 5). This phenomenon is known in literature as Gibbs phenomenon [3]. This phenomenon is due to the fact of the Fourier transform do not converge uniformly in discontinuities.

Figure 5: Gibbs phenomenon in the curve limits.

An effective solution for this problem is to use a scheme of duplication and reflexion of the curve, so that, it is possible to make it continuous (Figure 6). This scheme provides a continuous curve for the interval , where is the length of the original curve.

Figure 6: Curve duplication and reflexion scheme to provide a continuous curve.

4 Multi-Scale Fractal Dimension as a shape signature

Image signature is defined as a simplified function or matrix that is able to represent or characterize the original image. In general, techniques used for this task apply a or

transformation, where the computed feature vector represents the original image in a simplified way. More theory and examples can be found in

[10, 5].

This paper presents a study about the possibility of using the technique as a shape complexity signature. Different of true fractals, images from nature are not self-similar. They may look different whether we change the visualization scale and, consequently, their Fractal Dimension is also dependent on the used scale. The technique allows to study the shape behavior through scales. As a result, a 1-D function is calculated for a given shape, where this function describes how the Fractal Dimension of that shape changes as we change the scale (Figure 7).

Figure 7: curve as a shape signature.

5 Fourier Descriptors

The Fourier transform is a useful technique that allows to study the behavior of a signal through its frequency spectrum. It is widely used in pattern recognition tasks, once it presents a great number of advantages, such: noise tolerance, analysis of a signal into different groups of frequencies and easy data normalization, yielding data which is invariant to rotation, translation and scale [3, 2].

When it is applied over a signal , the Fourier transform obtains the complex components :

which are the signal representation in the frequency spectrum. This approach allows to split the original signal into different groups of frequencies, each one with distinct signal features.

Lower frequency coefficients, for example, are associated to the portion of the spectrum that describes the most relevant information about the signal behavior, whereas higher frequency coefficients possess information about noise and details presented in the signal [7]. For the , only the lower frequency coefficients are considered. The Fourier descriptors, , are computed from the magnitude of the selected low frequency coefficients:

A normalization is also performed in such descriptors, and it is done as follows:

This normalization process increases the tolerance of the descriptors to disturbances in the original signal, such as changes in scale, translation and rotation.

6 Experiment

Considering the possibility of using the as a shape signature, an experiment based on image classification is performed to evaluate its efficiency.

An image database with 1352 images previously classified into 26 classes of 52 images is employed. Each class corresponds to an uppercase letter of the occidental alphabet (26 letters latin alphabet). Images from each class are grouped into 4 groups of 13 images, where each group corresponds to a different random noise level. This allows to evaluate the noise tolerance of the method.

Figure 8: Example of noise levels: (a) Original Shape; (b) Level 1; (c) Level 2; (d) Level 3; (e) Level 4.

In order to achieve the best performance of the method, various parameters values have been tested. These parameters refers to the dilation radius used in Bouligand-Minkowski method and the smoothing level used in the calculation. For each different configuration, a new curve is computed for each sample in the image database. An average curve is also computed for each class using 12 samples of it (3 samples for each noise level), and these used curves are discarded in order to not affect the classification results. So, the test database is compound of 1040 image, divided into 26 classes of 40 images. Each class is also divided into 4 noise levels, each one with 10 images. Classification process is performed by Euclidean distance, where it is verified the distance among the remainders curves and the average .

7 Results and Discussion

In this section some properties of curve are discussed, as well as its performance according to its parameter. Besides, we also compare the results from and Fractal Dimension, and the advantages and disadvantages of using Fourier Descriptors from curve for shape classification.

7.1 Method parameters analysis

In order to attain the best method performance, various parameters values have been tested. The tested parameters values are and . The results for each different configuration showed the characteristics and properties present in curve.

We note an increase in success rate as increases (Figure 9). However, the relation between success rate and the dilation radius presents a log-curve behavior. So, from an specific radius value, the success rate becomes constant. The explanation for this behavior lies in the fact that as increases, the dilation produced by Bouligand-Minkowski method makes the shape aspect becomes more similar to a point, whose Fractal Dimension is equal to zero. This behavior is represent in curve as its tendency to zero according to increase. From an specific , none relevant information about the shape is added to curve, except for its zero-tendency, what keeps the success rate stable. We also note that this behavior do not depends on value (Figure 10).

Besides parameter, which refers to dilation radius, we realize that value also carry out some influence over the success rate. The parameter refers to the smoothing level of curve and it is responsible for the presence of more or less details in the curve (Figure 11). We note higher values for this parameter produce a excessive smoothing and it suppresses important details of the curve. As a consequence, the success rate decreases, independent of the value of parameter (Figure 12).

In this experiment, the best performance is achieved when used and .

Figure 9: success rate as radius increases.
(a) (b)
(c) (d)
Figure 10: Example of computed for different dilation radius , using : (a) ; (b) ; (c) ; (d) .
(a) (b)
(c) (d)
Figure 11: Example of computed for different values, using : (a) ; (b) ; (c) ; (d) .
Figure 12: success rate for different values.
A 38 2
B 40
C 40
D 40
E 38 1 1
F 39 1
G 40
H 40
I 40
J 36 1 1 2
K 36 4
L 1 35 4
M 40
N 30 10
O 40
P 40
Q 40
R 6 34
S 40
T 11 27 2
U 1 39
V 40
W 40
X 38 2
Y 1 39
Z 4 1 35
Table 1: Confusion Matrix for Multi-Scale Fractal Dimension method, considering and . Success rate = 94.62 %.
A 11 7 19 3
B 14 16 4 6
C 24 6 6 4
D 1 11 3 18 3 2 2
E 1 7 5 4 1 10 12
F 39 1
G 8 1 9 18 2 2
H 4 13 10 6 1 2 2 2
I 40
J 30 5 5
K 3 3 3 3 1 20 6 1
L 19 9 9 3
M 36 4
N 4 4 7 1 8 10 6
O 2 22 2 2 4 4 4
P 13 14 12 1
Q 3 37
R 1 8 1 1 17 5 5 2
S 2 2 17 3 16
T 5 8 19 8
U 4 2 6 20 8
V 6 2 2 30
W 2 38
X 6 1 33
Y 1 9 30
Z 23 3 1 3 10
Table 2: Confusion Matrix for Fractal Dimension method, considering . Success rate = 47.40 %.

7.2 Fractal Dimension versus Multi-Scale Fractal Dimension

As previously discussed, the main problem with Fractal Dimension in shape characterization is that it does not work properly with real world restrictions. As we increase the visualization scale, the Fractal Dimension of the object goes to zero. In the other hand, curve emphasizes the details present in log-log curve computed from Bouligand-Minkowski method. It allows to produce a more detailed representation of how complexity changes according to the scale. This representation is much more consistent for analysis and less dependent on noise interferences than a simple numeric value as calculated by line regression.

Figure 13: and success rates as dilation radius increases.

Figure 13 shows the classification results for both methods, Fractal Dimension () and Multi-Scale Fractal Dimension (). For this experiment, various radius values have been tested and, in case, an was considered.

We note a higher success rate for when compared with , independent of radius values used. In method, only an numeric value is calculated from and used for shape characterization. This becomes the method more sensitive to small variations or noise in the shape. The results from are also more unstable than . The explanation for this lies in the zero-tendency presented in the Bouligand-Minkowski method. As the dilation radius increases the Fractal Dimension of any object goes to zero, what becomes the classification process more difficult and inaccurate.

Tables 1 and 2 show, respectively, the confusions matrix for and when considered all images in the test database and .

7.3 Noise Tolerance

An important characteristic to be evaluated in any method is its capacity to work properly with samples that present sort of distortion or noise. In general, images under analysis may present distortions from measuring errors (measures from physical world), or noise added to original data (e.g., interference added during data transmission), that difficults the analysis and classification processes.

By analysis of the results from curve, we note this method present a good noise tolerance. This tolerance comes from using a Gaussian low filter during the computing of curve. The Gaussian filter acts reducing the importance of the informations in the high frequency region of the spectrum. Once noise is a typical high frequency information, it is removed from the original data through this data filtering process. We also note this filtering process presents a better performance on images that present a intermediate noise level, i.e., levels 2 and 3 of the used noise (Figures 8c and 8d, respectively) . In other hand, we also realize a stressed decrease in the performance of the method as the noise exceed these noise levels. High noise levels change the geometric patterns in the shape, what difficults the classification process and, as a consequence, decreases the success rate of the method (Figure 14).

Figure 14: success rate at different noise levels and dilation radius.

7.4 Analysis using Fourier descriptors

As previously discussed, the main advantage of curve in pattern recognition is its capacity to represent an object by a curve that binds its complexity along the visualization scale. Nevertheless, the curve presents a high cost to be evaluated once its number of descriptors increase as the radius increases (Figure 15). Besides that, not all information in the curve is relevant for shape classification.

Figure 15: Number of descriptors in the curve for the radius used.

In order to simplify the analysis process the Fourier transform is applied over curve, so descriptors that represent the main behavior of the curve are computed, as mentioned in Section 5.

One important point to be considered is the number of descriptors to be computed. The number of descriptors have been defined through analysis of the Euclidean distance between the original curve and its respective set of descriptors. Figure 16 shows the variation of this distance as we increase the number of Fourier descriptors. We realize as we increase the number of descriptors smaller becomes the distance between the curve and its descriptors. However, this distance becomes stable when the number of descriptors is higher than 50. This shows that most of the main information lies in the 50 first descriptors.

Figure 16: Distance between and its Fourier descriptors as the number of computed descriptors increase.

Once it was defined the number of descriptors that best represent the curve behavior, the performance of these descriptors was compared with the original curve in the shape classification experiment. For this, we considered the curve calculated for different dilation radius, , and . Figure 17 shows the success rate for both Fourier descriptors and curve.

Figure 17: and its Fourier descriptors success rate.

Using Fourier descriptors is a way to simplify the analysis step, once it reduces the amount of information to be analyzed. Otherwise, suppressing part of curve information, specially high frequency information, produces a smoothing effect over the curve, which decreases the distance between classes and, consequently, decrease the success rate. In spite of the large amount of information discarded during the process of Fourier descriptors computing, the decrease in success rate is minimum for . This range of values is where the method has presented its best results, and it becomes evident the great capacity of curve, in association with Fourier descriptors, to shape classification.

8 Conclusion

This paper have presented an experimental comparison between Fractal Dimension and Multi-Scale Fractal Dimension in a shape analysis context. Fourier descriptors have also been performed over Multi-Scale Fractal Dimension curve in order to simplify the analysis step. A previously classified shape database was employed and different configurations of each method have been tested.

Results have shown that Multi-Scale Fractal Dimension performs a more accurate discrimination of the shapes. It also presents a ”good” noise tolerance, what is corroborated by experimental results. Experiments also have shown that Fourier descriptors computed from Multi-Scale curve hold the main information of the curve, with minimal loss of information, while it reduces drastically the number of descriptors necessary to shape classification.


  • [1] G. D. and Lange and W. B. Marks. Fractal methods and results in cellular morphology - dimensions, lacunarity and multifractals. Journal of Neuroscience Methods, 69(2):123–136, November 1996.
  • [2] R. N. Bracewell. The Fourier Transform and Its Applications. McGraw-Hill, New York, 3rd edition, 2000.
  • [3] E. Oran Brigham. The Fast Fourier Transform and its applications. Prentice Hall, Englewood Cliffs, NJ, USA, 1988.
  • [4] Odemir Martinez Bruno and Luciano da Fontoura Costa. A parallel implementation of exact euclidean distance transform based on exact dilations. Microprocessors and Microsystems, 28(3):107–113, 2004.
  • [5] Kenneth R. Castleman. Digital Image Processing. Prentice Hall, 1996.
  • [6] B. B. Chaudhuri and Nirupam Sarkar. Texture segmentation using fractal dimension. IEEE Trans. Pattern Anal. Mach. Intell, 17(1):72–77, 1995.
  • [7] L. da F. Costa and R. M. Cesar, Jr. Shape Analysis and Classification: Theory and Practice. CRC Press, December 2000.
  • [8] R. da S. Torres, A.X. Falc ao, and L. da F. Costa. A graph-based approach for multiscale shape analysis. Pattern Recognition, 37:1163–1174, 2003.
  • [9] C. W. Emerson, N. N. Lam, and D. A. Quattrochi. Multi-scale fractal analysis of image texture and patterns. Photogrammetric Engineering and Remote Sensing, 65(1):51–62, January 1999.
  • [10] Rafael C. Gonzalez and Richard E. Woods. Digital Image Processing. Prentic-Hall, New Jersey, 2nd edition, 2002.
  • [11] S. Loncaric. A survey of shape analysis techniques. Pattern Recognition, 31(9):983–1001, 1998.
  • [12] Benoit Mandelbrot. The fractal geometry of nature. Freeman & Co., 2000.
  • [13] R. O. Plotze, J. G. Padua, M. Falvo, M. L. C. Vieira, G. C. X. Oliveira, and O. M. Bruno. Leaf shape analysis by the multiscale minkowski fractal dimension, a new morphometric method: a study in passiflora l. (passifloraceae). Canadian Journal of Botany-Revue Canadienne de Botanique, 83:287–301, 2005.
  • [14] Toyofumi Saito and Jun ichiro Toriwaki. New algorithms for euclidean distance transformation of an n-dimensional digitized picture with applications. Pattern Recognition, 27(11):1551–1565, 1994.
  • [15] Manfred Schroeder. Fractals, Chaos, Power Laws: Minutes From an Infinite Paradise. W. H. Freeman, 1996.
  • [16] C. Tricot. Curves and Fractal Dimension. Springer-Verlag, 1995.