- Proceedings
- Open access
- Published:
Abrupt skin lesion border cutoff measurement for malignancy detection in dermoscopy images
BMC Bioinformatics volume 17, Article number: 367 (2016)
Abstract
Background
Automated skin lesion border examination and analysis techniques have become an important field of research for distinguishing malignant pigmented lesions from benign lesions. An abrupt pigment pattern cutoff at the periphery of a skin lesion is one of the most important dermoscopic features for detection of neoplastic behavior. In current clinical setting, the lesion is divided into a virtual pie with eight sections. Each section is examined by a dermatologist for abrupt cutoff and scored accordingly, which can be tedious and subjective.
Methods
This study introduces a novel approach to objectively quantify abruptness of pigment patterns along the lesion periphery. In the proposed approach, first, the skin lesion border is detected by the density based lesion border detection method. Second, the detected border is gradually scaled through vector operations. Then, along gradually scaled borders, pigment pattern homogeneities are calculated at different scales. Through this process, statistical texture features are extracted. Moreover, different color spaces are examined for the efficacy of texture analysis.
Results
The proposed method has been tested and validated on 100 (31 melanoma, 69 benign) dermoscopy images. Analyzed results indicate that proposed method is efficient on malignancy detection. More specifically, we obtained specificity of 0.96 and sensitivity of 0.86 for malignancy detection in a certain color space. The F-measure, harmonic mean of recall and precision, of the framework is reported as 0.87.
Conclusions
The use of texture homogeneity along the periphery of the lesion border is an effective method to detect malignancy of the skin lesion in dermoscopy images. Among different color spaces tested, RGB color space’s blue color channel is the most informative color channel to detect malignancy for skin lesions. That is followed by YCbCr color spaces Cr channel, and Cr is closely followed by the green color channel of RGB color space.
Background
Skin cancer is one of the most prevalent cancer types in the United States. The prevalence of skin cancer is increasing dramatically in the United States [1]. Each year the number of patients being diagnosed has raised compared to the previous year. The most common skin cancer type for young adults between 25 and 29 years old is melanoma [1] which can lead to metastatic disease with serious complications from surgery such as scarring, deformity, and death.
Dermoscopy is a method in which pigmented and non-pigmented skin lesions’ features are examined with a handheld device, known as dermatoscope, by health professionals. Dermoscopy is also known as dermatoscopy, skin-surface microscopy, epiluminescence microscopy (ELM) [2, 3]. The ultimate goal of dermoscopy is the early diagnosis of malignant lesions, especially melanomas, by distinguishing them from benign. In order to differentiate malignant skin lesions from benign ones, the abnormal structural features and the borderline of non-pigmented skin lesions need to be taken into account.
Due to the fact that microstructure of the epidermis, the dermoepidermal junction and the papillary dermis are not detectable by the naked eye, dermoscopy is an invaluable asset for diagnosing particularly pigmented skin lesions. According to Vestergaard et al. [4], in the case of the diagnosis of melanoma, the use of dermoscopy provides more accurate results than using clinical evaluation alone by the naked eye.
Potential signs and symptoms of melanoma could be identified by taking the ABCD (Asymmetry Border Color Dermoscopic Features) rule [5] into account during examination of a skin lesion. The ABCD rule assesses geometric and morphologic variables such as asymmetry, color, and border of a given melanocytic lesion [5]. Each of these A,B,C,D features has a preassigned clinical weight factor (e.g. Border’s weight factor is 0.1). B stands for border and it indicates abrupt cutoff in the region of interest (a virtual pie, See Fig. 1). Abrupt cutoff is the region where lesion has sharp circumscription. A virtual pie refers to quarters (pie pieces) of lesion circumference. Figure 1 illustrates a virtual pie and marks (e.g. asterisk) pie pieces with sharp lesion circumscription. For instance B scores 2*0.1 in Fig. 1, since there are two pie pieces marked with abrupt cutoff (e.g. asterisks). A lesion’s B value is calculated if there exists abrupt cutoff at least one quarter of the lesion circumference. Due to applicability of the ABCD rule, it is also recommended for use by clinicians who are not fully trained in dermoscopic observation [6]. Reliability of the ABCD [7] rule has been tested by Nachbar et al. [5]. In this study, it is reported that diagnostic accuracy for melanoma was 80.0 % by using ABCD rule compared to 64.4 % by the naked eye. Assessment of asymmetry, color, differential structure, and border in ABCD rule was used to build a total dermoscopy score (TDS). Each of these features in ABCD contributes to TDS score according to their preassigned clinical weights (e.g. A’s weight is 1.3, B’s weight is 0.1, C’s weight is 0.5, and D’s weight is 0.5). If TDS value is less than 4.75, it is an indicator for lesion being benign. If it is between 4.8 and 5.45, then that lesion is suspicious for malignancy. If TDS score is greater than 5.45, it indicates that the lesion is malignant [7].
A clinical study conducted on 44,258 histopathologically examined skin neoplasms [8] shows that sensitivity in the clinical diagnosis of malignant melanoma is 70.1 %. Even though there have been improvements in the clinical diagnosis of melanoma, melanomas have different clinical accuracies based on their subtypes. For instance, a study by Lin et al. [9] shows that melanoma subtypes of superficial spreading melanomas (SSM), acral lentiginous melanomas (ALM), nodular melanomas (NM), and desmoplastic melanomas (DM) have different clinical diagnosis sensitivity rates. Sensitivity rates of these melanoma subtypes are reported as 77 % for SSM, 73 % for ALM, 41 % for NM, and 21 % for DM.
Moreover, the diagnostic accuracy of a dermatologist is significantly depended on the degree of experience of the examiner. Thus, interobserver variability for diagnosis of pigmented skin lesions amongst dermatologists is an important aspect for clinical diagnostic accuracy. Tan et al. [10] shows up to 0.59 kappa value differences between different dermatologists on the diagnosis of the same cases of melanoma lesions. In the same study [10], interobserver difference reaches up to 0.39 kappa value for benign lesions. Kappa value is a statistical measure for finding inter-rater agreement. For instance, a perfect agreement between two dermatologists on diagnosis of the same cases will result in a kappa value of 1.0, whereas a perfect disagreement on diagnosis of the same cases results a kappa value of 0.0.
One of the criteria for detecting melanoma is abrupt cutoff (abrupt edge). In current clinical practice, in order to detect abrupt cutoff feature for malignancy detection; the lesion is segmented into eight virtual pies (see Fig. 1) in which a dermatologist examines for abrupt cutoff and assigns a score for each of the pie pieces. This process is a manual and a tedious process, over simplified, and it is subjective based on the experience of the dermatologist examining the lesion. To address this challenge, in this study we offer a novel approach to quantify abrupt cutoff along the periphery of the skin lesion border. Instead of analyzing eight virtual pies, we scan whole lesion border’s inner region and quantify each region. To do that, the first step was to accurately define the lesion border.
There are different lesion border detection methods in the literature [11]. We use our density-based lesion detection method (DBLD) [12, 13] for skin lesion border detection. Then Freeman chain code [14] is used to extract extreme border pixels from the detected lesion border. After that, the shrinking (contraction) and the shifting operations are applied to detected border to identify inner border contour. This contour (shadow border) is needed for texture homogeneity analysis. Shadow border refers to a contracted border of the lesion. Shadow borders are found by using vector operations and dynamic scaling operations. Then, along shadow borders, pigment patterns’ statistical texture measures and statistical texture features, especially homogeneity, are computed in various scales in various color spaces. Different color models and color channels are used to investigate which color model is more effective in dermoscopy image analysis. Reader is referred to [15] for details on analysis of color models and color channels on biomedical image processing.
Textures in an image refer to visual patterns with different characteristics such as color, brightness, slope, size, uniformity, roughness, regularity, randomness, granulation, fineness, and coarseness etc. There are mainly structural, statistical, model-based, and transform methods for texture analysis [16]. Structural methods define textures in micro and macrostructural level and there is no clear distinction between them. Structural methods fit better for texture synthesis than texture analysis. Statistical methods attempt to capture non-deterministic properties of an image. More specifically, statistical methods capture distributions and relationships of grey levels. The most popular statistical texture analysis is based on co-occurence matrix. Three of the most common statistical texture analysis methods are based on the Gibbs random field, the Gaussian random field, and the Markov random field (MRF) models [17, 18]. Model based methods use fractals and stochastic models; however, estimation of parameters in both fractal and stochastic based models is a major bottleneck. Transform based methods use different transformation functions such as Fourier, Gabor, and wavelet. In this group of texture analysis methods, image is represented in a different coordinate space. In transform based methods, Fourier poorly performs because after the transform, spatial localization information is lost. On the other hand, Gabor provides better spatial localization; however, it needs multiple filter resolutions to capture spatial structures such as natural textures. This makes the use of Gabor impractical for most cases. Wavelet, on the other hand, represents textures at varying spatial resolutions with varying wavelet functions. However, wavelet transform is not translation invariant.
In a recent study [19] investigators used 95 texture descriptors for melanoma detection in dermoscopy images. However, they obtained up to 83 % specificity and 53 % sensitivity in grayscale images for melanoma detection. In this study, they also found out that image’s uniformity is the best descriptor with standalone 70 % accuracy.
Methods
Dermoscopy image analysis
Preprocessing
We describe essential image processing operations to make the dermoscopy image and its mask (whole lesion; represented as black pixels in the mask) ready for the further steps. These operations include: color space transformations, lesion border detection with density-based lesion detection method [9], complement operation, clearing region remains outside of the rectangle (region of interest), and morphological opening operations. The flow diagram of preprocessing phase is shown in Fig. 2.
In the color transformation phase, RGB color space of dermoscopy image is transformed into different color spaces; gray scale, YCbCr, and HSV color spaces respectively (See Fig. 3). In the next step, in order to obtain the mask from dermoscopy image, density-based lesion detection method [12, 13] is used. These steps are illustrated in Fig. 4.
Moreover, since the mask is a negative version of a regular standardized mask, the complement operation is applied to the mask to generate a standardized mask. Some of the images’ masks include noisy pixels other than the region of interest. Thus, we clear the region outside of the region of interest (out of the rectangle that encapsulates lesion). For finding this rectangular region we use following equations which incorporate percentage clipping parameter from left, right, top, and bottom from the mask.
In the same methodology, the right, top, and bottom boundaries’ pixel locations are determined by the equations above. Percentage of edges to clip parameter is empirically determined for our dataset. This parameter is based on the observations on dermoscopy images that lesion is always on the center of dermoscopy images. Thus, the boundary rectangle tightly encapsulating the lesion can be drawn around the region as seen in Fig. 5. Region remains outside this rectangle is discarded from further processing. This provides two-fold benefit. First, this reduces overall computation time; second, removes noise and eventually increases accuracy of the results. Noise refers to clustered regions that are separated from the lesion.
The final step of the preprocessing phase is morphological opening operation [20]. Morphological opening operation aims to even further remove the leftover noisy pixels from the mask. Opening simply removes small objects from the foreground of the image and puts them back into the background. The mathematics behind morphological opening operation is described in the following equation.
where A denotes an image, B refers to a structuring element and two operation symbols are an erosion and a dilation respectively. Dilation is a morphologic operation which causes objects to dilate or grow in size; whereas erosion causes objects to shrink. The amount of growth or shrinkage depends on the choice of the structuring element. Structural element differentiates dilation/erosion from classical low pass filter. Two of the well-known common structuring elements (given a Cartesian grid) are the 4-connected and 8-connected sets, N4 and N8 respectively. Morphological opening operation is applied to the mask. This morphological opening after edge clipping operation (given in Eqs. from 1.1 to 1.4 illustrated in Fig. 5. Noises are caused by artifacts such as hair, water etc. in dermoscopy images. When image is segmented by DBLD, some of these artifacts are also clustered as disconnected or connected parts of the lesion. In order to remove these regions from further processing, we use a commonality for every dermoscopy image; lesion exists approximately at the center of the image. The largest cluster including the vicinity of the center of the image is our region of interest; thus, other clusters are marked as noise and discarded from further processing. In order to discard the noisy regions, edge clipping and morphological operations are used.
Boundary pixels detection
The chain code was first proposed by Herbert Freeman [14] and, thus, it is also known as Freeman code or Freeman Chain code in the literature. He proposed that an arbitrary geometric curve could be represented by utilizing a rectangular-grid. Moreover, the idea behind the chain code is to obtain a way to identify a binary object representation by encoding its boundary. In a continuous curve, since consecutive points are adjacent to each other, each point is dependent on the previous one. For that reason, it causes a limitation for the next point’s location. Therefore, the next point can only have 8 possible locations which are a sequence of numbers from 0 to 7 in the rectangular-grid. Each number refers to a transition direction in between two consecutive points. As it is seen in the rectangular-grid (see Fig. 6), the increase of numbers progresses in the counter-clockwise direction.
Interpretation of the numbers is that one walks along the pixels on the object’s boundary from starting and ending at the same pixel. For instance, for the given digital boundary in Fig. 6, when one walks around the shape in the clock-wise direction, it would result in a number sequence such as {0,7,6,6,5,4,3,3,1,1}. This sequence simply is referred to as an encoding form of the boundary.
The goal of this stage is to determine extreme pixels at the lesion border. The idea behind this approach is to first scan through the entire pixels and record the ones which belong to the foreground. Then, find the minimum among all rows (vertical direction of the image mask) pixels. With the help of this minimum row value, the lowest value among all column pixels is detected in the image mask. By doing so, this approach gives us the row and column of the starting pixel for the chain code. Once starting pixel for the chain code is determined, the chain code generates a list of clockwise or counter-clockwise adjacent pixels. The starting pixel is shown in Fig. 6b. By applying the chain code, the boundary of the region in a dermoscopy image is captured as depicted in Fig. 7.
Feature extraction
For feature extraction, we developed three different approaches: texture feature extraction along the lesion border (detailed in section 2.1), along the inner border which is found by utilizing vector operations towards the center of the lesion’s mass (detailed in section 2.2), and along the inner aligned/shifted border by utilizing a scaling factor (detailed in section 2.3) respectively. From these three different methods, we extract statistical measures and statistical texture features of dermoscopy images and compare each method’s accuracy for malignancy detection. Statistical measures that are considered here are mean and standard deviation [21], whereas Gray Level Co-occurrance Matrix (GLCM) [20] is described as a texture descriptor in the texture analysis.
The gray-level co-occurrence matrix (GLCM) [20] is a statistical method that scrutinizes texture characteristics that rely on the spatial relationship between pixels. GLCM is also referred to as the gray-level spatial dependence matrix and co-occurrence distribution. The texture of an image can be represented with GLCM functions where specific values of pairs of pixels are computed and spatial relationship that arises in the image are represented by the GLCM. Thus, statistical measures can be extracted from the GLCM matrix. Mathematically, the gray-level co-occurrence matrix (C, as given in below Eq. 2.1) can be described over an image in which co-occurrence distribution parameters are illustrated for specific offset values.
where C is the co-occurrence matrix, I is the image with nxm size, i and j are the image intensity values, p and q are the spatial coordinates in the image, and lastly (∆x, ∆y) is an offset parameter which is the function of the direction θ and the distance d. The co-occurrence matrix is susceptible to rotation due to offset parameters. In order to create the GLCM, the gray co-matrix function [22] is taken into account which constructs the gray-level co-occurrence matrix (GLCM). GLCM is a square matrix where its columns and rows are equal to the number of gray levels in the image. The matrix element C Δx,Δy (i, j) corresponds to a relative frequency of two pixels with intensity values i and j respectively. Moreover, these pixels are separated by a pixel distance (x, ∆y).
However, other spatial relationships can be defined among the two pixels such as ∆x = 2, ∆y = 1. The sum of the frequency occurrence of pixel intensity value i with respect to the pixel intensity value j based on the particular spatial relationship which builds up the basis of each element (i,j) in the GLCM matrix. The size of the GLCM refers to the number of gray levels in the image.
Moreover, numerous statistical features can be acquired by utilizing the GLCM matrix. Homogeneity measures how similar the gray levels are in the spatial distribution of the image. Homogeneity of an MxN image can be expressed with the equation below.
In Eq. (2.2), M denotes number of pixels in the vertical direction and N denotes number of pixels in the horizontal direction of an image.
In most cases, the spatial relationship between two pixels is described as the pixel of interest which is the right adjacent pixel of the current pixel. This means that ∆x = 1, ∆y = 0. For instance, if ∆x = 1, ∆y = 0 is the spatial relationship between two pixels of an exemplary 5 × 5 image as given below:
IMAGE (5 × 5 with given intensity values)
Then the corresponding co-occurrence matrix will be a 4 × 4 matrix as given below:
Co-occurence Matrix (4 × 4; there exists 4 different intensities in the image)
For instance first (0,0) element of co-occurrence matrix corresponds to the frequency of neighboring two pixels in entire image with both pixels having intensity value of 0 with the defined spatial neighborhood relationship of ∆x = 1, ∆y = 0. This relationship corresponds to the number of times that 0 intensity pixel’s right next neighbor pixel also has intensity value of 0. This relation never exists in the sample 5x5 image; thus, co-coccurence matrix’s (0,0) element’s value will be 0. The term 1/20 comes from (M-∆x)*(N-∆y) in Eq. 2.2. For the given 5x5 image example, M = 5, N = 5, ∆x = 1, and ∆y = 0.
1st method: texture feature extraction on the boundary
The goal of this approach is to extract the features mentioned in previous sections from the border of dermoscopy images by employing circular masks at different scales. In order to carry out utilizing circular masks on the border of dermoscopy images, first, the radius and the coordinates of the center of the circle must be determined. Hence, for the first circle’s center, we pick a pixel at the lesion border which is the upmost pixel. This is illustrated in Figs. 8, 9, and 10.
The next step is the feature extraction from where the circular mask, which is illustrated in Fig. 11, is located. Feature extraction is a computation of statistical features of a particular region. In our case, in order to obtain the region from the image, an element by element matrix multiplication is performed in between a color channel of the image and the circular mask. This is expressed in the following equation.
The result of this element by element matrix multiplication operation is shown in Fig. 12. The obtained Circular_region now can be taken into account to accomplish feature extraction operations which are mean, standard deviation and homogeneity with the help of GLCM matrix.
In essence, this feature extraction cycle is simply performed in a particular image channel for each circle along the lesion border. This process continuous for each circle along the lesion border until the cycle is completed. Cycle is completed when process returns back to where it is started (upmost pixel). This method is employed on all 100 images with varying radius of the circles. Algorithm 1 summarizes this entire process below.
2nd method: texture feature extraction on the inner-shifted by vector scaling
The purpose of this method, which is also summarized in the Algorithm 2, is to describe how the vector scaling operation and polygon techniques are achieved in order to identify the intersecting polygon region and extract them from a dermoscopy image. First and foremost, the centroid of the segmented skin lesion is located and marked on the finalized dermoscopy image mask. For this method, a vector is defined in between the starting point on the boundary of the mask and the centroid of the segmented skin lesion. Then, the unit vector is computed from this vector by utilizing Eq. (2.4). Moreover, the unit vector is used to perform a vector operation, namely vector scaling, and to shift boundary points towards to the centroid of the segmented skin lesion. This vector scaling and the shifting operations are shown with the Eqs. (2.5) and (2.6) respectively. For the given equation,
where centroid (x, y) is the center of mass of the skin lesion, boundary (x, y) is the starting point (upmost pixel) on the boundary of skin lesion and finally \( \overrightarrow{u} \) is the unit vector at the boundary directed towards the center. The vector scaling is given by
where \( \overrightarrow{u} \) is the unit vector directed towards the center and r is the radius (a scalar). Furthermore, in order to perform shifting operation to the boundary, simply, the result of a vector scaling operation, which is \( \overrightarrow{u}\cdot r \), is added to the original boundary coordinates, boundary(x, y). Equation (2.6) basically shows how to perform shifting operation. For the given equation,
where r denotes radii, u x and u y refer to x and y coordinates of u unit vector. Figure 13 illustrates how to create the vector between centroid(x, y) and boundary(x, y) and finding a unit vector along that direction. After applying shifting operation (shrinks/contracts boundary towards centroid) to each point of the original boundary, it would create a boundary which is illustrated as dotted boundary in Fig. 14.
The next step is to investigate texture features of the circular regions along the contracted new boundary. As it is seen in Fig. 14, there are two boundaries; actual lesion boundary and contracted boundary where in-between distance of these boundaries are always r. We find the texture homogeneities and statistics between these two boundaries in circular regions whose centers are placed on the inner boundary. Those circular regions are equally distanced from each other. These circles distances are set as 3/2*r (which signifies that r is scaled in 1.5 fold) and this is measured by using the chain code. For instance if r is 2 then next center will be 3 pixels apart in the chain code. After locating all circular regions’ centers on the inner boundary, we apply polygon intersection operation between these circular regions and the actual boundary for each circular region. This helps us to remove noisy data coming from pixels out of the actual boundary, which reduces the automated diagnosis accuracy. It is noted that the intersecting region must be inside of the outer boundary, the reason is that outside of the boundary is considered as noise or outliers that is not incorporated into the intersecting region. Since these regions insert noises into the region of interest, it reduces the accuracy of the results. An exemplary polygon intersection operation between a circular region and the boundary is illustrated in Figs. 15 and 16. Mask of the intersecting polygon is generated for further steps of the work. The mask is illustrated in Fig. 17.
In the last step of this method, in order to extract the intersecting polygon region, an element by element matrix multiplication operation is performed between actual image and the intersecting polygon mask. This helps us to extract texture information of the corresponding region of interest which are the pixels located at the coordinates of the mask’s elements (e.g. pixels). The extracted region is illustrated in Fig. 18. This matrix operation’s equation is described as following:
Then, the statistical texture measures (mean and standard deviation) and the statistical texture features, particularly homogeneity features, are computed over this polygon (e.g. mask and image intersecting pixels). Figures 19 and 20 illustrate the intersecting polygon feature extraction at different scales (different radii).
Additionally, there are some images which include certain characteristics where this method (e.g. shrinking border towards the center of mass) simply fails while determining the intersecting polygon region. For instance, a sample case is shown in Fig. 21 where the extracted region is simply outside of the boundary. This is due to the imbalance of the mask where the unit vector’s definition becomes invalid. Another reason is that when the border is contracted towards the center of mass, many points are mapped to a single point (there is not necessarily unique one-to-one mapping). That’s the reason why we developed a 3rd method to overcome these deficiencies in the 2nd method.
3rd method: texture feature extraction on the inner-shifted by dynamic scaling
The aim of this method is to scale down the boundary by means of a dynamic scaling approach, so that the original boundary would be shifted to the inside and with the help of circular regions, the polygon intersection is determined, and features are calculated over the intersecting polygon region.
First of all, the centroid of the segmented lesion is located and marked on the finalized dermoscopy image mask. The novelty of this method is to explore how to obtain scaling factor. The proposed approach is that the segmented lesion and the scaled region are assumed as perfect circles in a coordinate system. This assumption is shown in Fig. 22 on the right. Under this assumption, the radius is known and the distance between the starting point (x,y) and the scaled point (sx,sy) can be written by means of the following Eq. (2.8).
By deriving the distance Eq. (2.8), the scaling factor becomes (2.13) and the negative one (2.14) is taken into account due to the fact that the operation would be a scale down operation (taking the inner border). Next step is applying this scaling factor to each and every pixel at the lesion border.
Scaling [23] is a transformation where the image size gets bigger or smaller. We are interested in scaling down the original image’s lesion boundary since out of lesion boundary is considered as noise/outlier (e.g. reduces diagnostic accuracy). This transformation is expressed with the Eqs. (2.15) and (2.16).
where x’ is a scaled down x coordinate and y’ is a scaled down y coordinate of a corresponding boundary pixel’s x and y coordinates respectively, and s x and s y are scaling factors along x and y axes for 2D.
This can be also written in a matrix form with homogenous coordinates where last column represents translation (used for aligning centers of actual lesion border and newly produced inner lesion border) as follows.
After obtaining the scale factor, the new boundary coordinates boundary(x ′, y ′) can be calculated by using Eq. (2.17). This is illustrated in Fig. 23. Since the goal is to extract the regions inside the original boundary, the new boundary(x ′, y ′) has to be shifted (translation last column in transformation matrix in Eq. 2.17) from new center to the old boundary’s center. For that reason, a difference vector \( \overrightarrow{d}=\left\{dx, dy\right\} \) is defined in between the centroid of two boundaries which is demonstrated in Fig. 24. This vector is also expressed in the following equation.
In the next step, the \( \overrightarrow{d} \) vector is utilized to locate the inner shifted boundary with the help of the Eq. 2.19. The inner shifted boundary n_boundary(nx, ny) is displayed in Fig. 24.
Moreover, the circle can be located and drawn on the inner-shifted boundary as depicted in Fig. 25. Now, in order to figure out the intersecting polygon regions among two boundaries and the circle, the polygon set operations are applied to all of the circular regions. The outcome of polygon set operations is given in Fig. 26 which is the intersecting polygon region.
After having the intersecting polygon region, the mask of the intersecting polygon region is obtained with the help of polymask operations. The resulted mask is shown in Fig. 27.
Moreover, the region extraction operation is carried out in between the image channel and polymask. This is essentially element by element matrix multiplication that can be computed with the Eq. 2.20.
Finally, statistical measures (mean and standard deviation) and statistical texture feature, particularly, homogeneity, are calculated over these masks. Intersecting polygon feature extraction is shown at different scales (different radii) by computing features in Figs. 28 and 29.
All these processes are carried out on dermoscopy images in our dataset by considering 10 different color channels along with different scales. In brief, Algorithm 3 simply summarizes all steps mentioned in this section.
Experiments and data analysis
A set of 100 dermoscopy images obtained from the Edra Interactive Atlas of Dermoscopy [24] is used as a test bed to perform experiments and validate the proposed approach. These dermoscopy images are 24-bit RGB color images with dimensions ranging from 577 × 397 pixels to 1921 × 1285 pixels.
Initially, the structure of features is created based on the following variations:
-
100 Dermoscopy images
-
Color spaces and channels: 10 base colors (Gray, Red, Green, Blue, Y, Cb, Cr, Hue, Saturation, Value)
-
Circle radius size: 4 different scale (5, 7, 10, 15)
-
Average and Minimum:
-
○ Homogeneity,
-
○ Mean
-
○ Standard deviation
-
A sample of features is illustrated in Table 1. In each circle’s radius size; average homogeneity, minimum homogeneity, average mean, minimum mean, average standard deviation, and minimum standard deviation are described, and there are 4 different circle radius sizes under red color channel of RGB color space. Moreover, each row also refers to a dermoscopy image and there are 6 rows for the given sample table.
Radii sizes determined through trials and errors from dermatologist drawn lesion borders. We realized that homogeneity reduces drastically right at the lesion border. Thus, starting from radius size 2, we increased the radius size by one and tried to find out whether that radius size capturing drastic homogeneity change or not. We continued increasing the size of radius up until the point that it no longer captures homogeneity of an abrupt change at the border. We chose 4 different radii sizes since they captured sudden homogeneity changes at the lesion border. For instance, results obtained for radius size 4 was almost same with having radius size 5 or 6. Thus, we picked only one of them, radius 5. Since lesion border cutoff happens close to the lesion border, for radius sizes >15 we observed that these cuttoffs are missed. Thus, we stopped at radius size 15 pixels.
One of the major preparation steps to make the features ready is to normalize all features in the range of [−1, 1]. As described above, there are 10 distinct base colors and also each circle radius has 6 unique texture feature types, and finally 4 different scales are considered for this experiment. All these make 240 total number of features per a single dermoscopy image. We extract all these features for 100 dermoscopy images. With these in mind, the challenging question is which features are more significant or in other words, which features will yield better result when the classification experiment is performed by Support Vector Machine (SVM) [25] classifier. Hence, in order to select significant features among all, SVM Recursive Feature Elimination (SVM RFE) [26] algorithm, which iteratively works backward from the given set of features, is employed. In each cycle, it essentially sorts the features according to their weights in the SVM classifier by discarding the features that have lower weight. Once the SVM RFE is applied to all features, it basically yields an order, based on rank (R) and sorts these features accordingly. The higher value R has, the more significant the feature is. The color based sorting is illustrated in the Table 1 along with individual rank (R) score. After generating the ranking of these features, the most informative color-based feature is selected in which each color channel (total of 24 channels) for different color spaces is ranked with the following equation,
where k is the feature number and R is the ranking value. The Eq. (3.1) will generate a total score for each color channel by summing up R in each color set. Ranking of total score is shown in the Table 2. What can be interpreted from Table 2 is that the sum of the rank of all Blue-based features is 3910 which ranks top in the list. The second best color is Cr, and Green is the third one. Thus, two sets of features, which are listed below, are considered to perform classification experiments.
-
Blue, Cr, Green (Top three features according to the rank result)
-
Blue, Cr, Green and Fractals by Kockara et al. [27].
As seen from scoring results in Table 2, blue color channel has scored the highest for detecting malignancy in dermoscopy images. It is followed by the green channel from RGB color space. This indicates that malignant lesions are more distinctive in blue. There are some studies investigating effects of phototherapy in visible blue and green light that may enlighten why blue channel is more informative for malignancy detection.
Phototherapy with visible light (specifically blue light that ranges 430–490 nm) has been commonly used as a treatment for certain skin diseases such as acne treatment and psoriasis [28, 29]. Red light penetrates deeper in tissue when compared to blue light [30]. It is proven in [31] that combination of both blue (415 + −5 nm, irradiance 40 mW/cm2, 48 J/cm2 ) and red (633 + − 6 nm, 80 mW/cm2 , 96 J/cm2) light produces an overall decrease in the melanin level. It is also shown that helium-neon laser irradiation with visible light significantly enhances the attachment of melanocytes to type IV collagen and stimulates migration and proliferation in melanocytes [32, 33]. This stimulates nerve growth factor which is a major paracrine maintenance factor responsible for melanocytes’ survival in the skin [34]. It is iterated in [35] that the green and blue light (488 and 514 nm, respectively) of the argon laser is especially absorbed by melanin which is produced by melanocytes [35].
In the classification scheme of this study, SVM [25] is used. The aim of the classification is to correctly classify dermoscopy images as malignant or non-malignant. The following parameters of SVM are varied to select the best classifier over others: SVM Type (ν-SVM, C-SVM), Kernel (Polynomial, Radial Basis, Linear), C (penalization parameter), R (coefficient), D (degree), and G (gamma parameter in the kernel function).
These parameters are determined after several experiments. We used three classification assessment methods in order to find the best set of features. These assessment methods are Leave-one-out, 10-fold cross validation, and Model accuracy. In terms of classification accuracy assessment methods, model accuracy, leave one out (LOO), and 10-fold cross validation (10CV) are standard methods in the machine learning and information gain related studies. Model Accuracy gives a sense that how the problem posed in the feature space (such as linearly separable, polynomial, or radial basis) and how we should approach to this problem. LOO and 10CV methods are similar to each other and assess how a classification model reacts to a new data point or dataset. The advantages of these methods are that they represent very significant information about the classification models we build. On the other hand, in LOO and 10CV assessments, we should leave out one or more than one sample from the original dataset so that they can be tested. This means that we should shrink the dataset to some extent. In the case of LOO, this issue is minimized because only one sample is removed from the dataset before the training of the classification model.
For our case, LOO uses 99 dermoscopy images for building a model and training the SVM classifier. Then, one of the dermoscopy images is used for testing purposes. In 10CV, 90 of the dermoscopy images are used for building a model and training the SVM classifier whereas 10 of the dermoscopy images are intended to use for testing the classifier. This process is repeated 10 times for different randomly selected 10 test data. In model accuracy, we use entire set of dermoscopy images for building a model and training purposes.
Results and discussion
In the assessment of classification accuracy, we used standard metrics, precision, recall, and F-measure. Recall is the percentage of positive (malignant) labeled instances that were predicted as positive and found by TP/(TP + FN). Precision is defined as the percentage of positive predictions that are correct, and calculated by TP/(TP + FN). TP means true positives. In our case, TP corresponds to the melanoma lesions which are correctly classified as melanoma. TN means true negatives. TN corresponds to the non-melanoma lesions which are correctly classified as non-melanoma. FP means false positives. FP corresponds to the non-melanoma lesions which are incorrectly classified as melanoma. FN means false negatives. FN corresponds to the melanoma lesions which are incorrectly classified as non-melanoma.
F-measure is a harmonic mean of precision and recall and given as:
A high score of F-measure indicate that the classifier successfully finds targeted lesion without compromising precision and recall.
The graph in Fig. 30 is shaped by using the best three features, which are Blue, Cr and Green-based features, each of them has 24 possible features and 3 color channels which makes a total of 72 features. As seen in the graph, 72 is the starting point for the x axis. This graph is obtained as follows. First, SVM classifier starts to run with these 72 features, and the best achieved accuracy is obtained and marked on the graph. In the next step, the worst feature is discarded from the 72 features, and 71 features are left behind to train and test SVM. The value which is obtained by this process is also marked on the graph which simply corresponds to 71 in the x axis. This discarding of one feature process is plotted until 1 feature is left behind in this routine. By doing so, the graph is formed and finalized as seen in Fig. 30.
Results presented in Fig. 30 are for the third method that we developed (the inner-shifted boundary by dynamic scaling, details provided in Section 3.4.5). The other two methods given in sections 3.4.3 (74 % accuracy) and 3.4.4 (83 %) as expected have not produced higher accuracies for malignancy detection. This is due to having noise (outside region of the lesion border) incorporated in these methods. Since the model accuracy is 100 %, this emphasizes that the proposed model for 3rd method (section 3.4.5) can accurately distinguish malignant lesions from benign lesions. This means that the proposed model is an accurate approach for classification purposes.
According to the graph in Fig. 30, it can be inferred that 13 features could yield optimal result. The feature size was determined through a supervised learning, which is a feature selection method with SVM RFE [26]. The subset with 13 features was selected from 240 candidates. This basically implies that 13 features are sufficient to model and train the SVM classifier. In the 13 feature scenario on Fig. 31, leave-one-out (LOO) methodology achieves up to 96 % classification accuracy and 10-fold cross validation (10CV) accomplishes up to 90 % classification accuracy. It should be noted that less than 6 features cannot represent the proposed model well based upon the graph.
In Fig. 32, the graph is formed by using the best three features plus fractal features. That means 83 total number of features are used for the experiments. The idea behind generating this graph is the same as one explained in the first case. In each training and testing cycle, the worst feature, based upon its ranking score, is eliminated from the 83 feature set, and training and testing is applied to the remaining feature and obtained classification accuracy is recorded and plotted on the graph. Again, this cycle routine is carried out until one feature is left behind. Thus, the graph is built up as illustrated in Fig. 32.
With the same analytical approach, as it is comprehended from the graph, 16 features stand out a promising result with 94 % correct classification accuracy for LOO method and 91 % correct classification accuracy for 10 CV. Less number of features also can be considered to represent the model.
Conclusions
Accurate detection and objective evaluation of abrupt pigment pattern cutoff at the perimeter of a skin lesion is one of the important criteria for malignancy detection. However, it remains a challenging task and biased by a dermatologist’s experience level. In this article, a novel approach is proposed where abruptness of pigment pattern along the lesion perimeter is measured. The boundary of the skin lesion is determined with the help of the density based lesion border detection technique. By using two different methods, the detected boundary is step by step scaled down. Then, throughout scaled down borders, pigment motifs’ homogeneities are extracted and computed in various color channels and at different scales.
The proposed method has been experimented and validated by selecting a test bed which includes 100 dermoscopy images (69 benign and 31 melanoma). The results prove that the proposed approach is highly effective to detect malignancy in dermoscopy images. More specifically, up to 96 % classification accuracy, 0.96 specificity value, and 0.86 sensitivity value are accomplished in detection of malignancy in a particular color space. Finally, in addition to that, 0.87 of the F-measure is attained on malignancy detection.
References
Cancer Facts and Statistics 2015 | Research | American Cancer Society. [Online]. Available: http://www.cancer.org/research/cancerfactsstatistics/cancerfactsfigures2015/. [Accessed: 23 Mar 2016].
Binder M, Schwarz M, Winkler A, Steiner A, Kaider A, Wolff K, Pehamberger H. Epiluminescence microscopy. A useful tool for the diagnosis of pigmented skin lesions for formally trained dermatologists. Arch Dermatol. 1995;131(3):286–91.
Stanganelli I, Pizzichetta M-A, [Online]. Available: http://emedicine.medscape.com/article/1130783-overview. Accessed 5 Sept 2016.
Vestergaard ME, Macaskill P, Holt PE, Menzies SW. Dermoscopy compared with naked eye examination for the diagnosis of primary melanoma: a meta-analysis of studies performed in a clinical setting. Br J Dermatol. 2008;159(3):669–76.
Nachbar F, Stolz W, Merkle T, Cognetta AB, Vogt T, Landthaler M, Bilek P, Braun-Falco O, Plewig G. The ABCD rule of dermatoscopy. High prospective value in the diagnosis of doubtful melanocytic skin lesions. J Am Acad Dermatol. 1994;30(4):551–9.
Dermoscopy_tutorial. [Online]. Available: http://www.dermoscopy.org/atlas/base.htm. [Accessed: 23-Mar-2016].
Stolz: Abcd rule of dermatoscopy-a new practical… - Google Scholar. [Online]. Available: https://scholar.google.com/scholar_lookup?title=ABCD+rule+of+dermatoscopy%3A+a+new+practical+method+for+early+recognition+of+malignant+melanoma&author=W.+Stolz&author=A.+Riemann&author=A.+B.+Cognetta+et+al.&publication_year=1994. [Accessed: 23-Mar-2016].
Wolf IH, Smolle J, Soyer HP, Kerl H. Sensitivity in the clinical diagnosis of malignant melanoma. Melanoma Res. 1998;8(5):425–9.
Lin MJ, Victoria M, Catriona ML, Rory W, Kelly JW. Diagnostic accuracy of malignant melanoma according to subtype. Australas J Dermatol. 2014;55(1):35–42.
Tan E, Oakley A, Soyer HP, Haskett M, Marghoob A, Jameson M, Rademaker M. Interobserver variability of teledermoscopy: an international study. Br J Dermatol. 2010;163(6):1276–81.
Norton K-A, Iyatomi H, Celebi ME, Ishizaki S, Sawada M, Suzaki R, Kobayashi K, Tanaka M, Ogawa K. Three-phase general border detection method for dermoscopy images using non-uniform illumination correction. Skin Res Technol. 2012;18(3):290–300.
Mete M, Kockara S, Aydin K. Fast density-based lesion detection in dermoscopy images. Comput Med Imaging Graph. 2011;35(2):128–36.
Suer S, Kockara S, Mete M. An improved border detection in dermoscopy images for density based clustering. BMC Bioinformatics. 2011;12(10):1–10.
Freeman H. On the encoding of arbitrary geometric configurations. IRE Trans Electron Comput. 1961;EC-10(2):260–8.
Celebi ME, Kingravi HA, Celiker F. Fast colour space transformations using minimax approximations. IET Image Process. 2010;4(2):70–80.
Materka A, Strzelecky M. Texture analysis methods—a review. Technical University of Lodz 1998, COST B11—technical report. 1998;11: 873–887.
Geman S, Geman D. Stochastic relaxation, Gibbs distribution and the Bayesian restoration of images. IEEE Trans Pattern Anal Mach Intell. 1984;6(11):721–41.
Hassner M, Sklansky J. The Use of Markov Random Fields as Model s of Texture. In: Rosenfeld A, Editor. Image Modelling. New York: Academic Press; 1981. pp. 185–98.
Udrea A, Popescu D, Miron C. An analysis on the realiability of a series of texture and shape descriptors for melanoma diagnosis. UPB Sci Bull Series C. 2016;78(2):23–134.
Digital Image Processing 3rd ed. - R. Gonzalez, R. Woods. [Online]. Available: http://www.academia.edu/7310640/Digital_Image_Processing_3rd_ed._-_R._Gonzalez_R._Woods. [Accessed: 23-Mar-2016].
Young IT, Gerbrands JJ, Van Vliet LJ. Fundamentals of image processing. The Netherlands: Delft University of Technology Delft; 1998.
Properties of gray-level co-occurrence matrix - MATLAB graycoprops. [Online]. Available: http://www.mathworks.com/help/images/ref/graycoprops.html. [Accessed: 24-Mar-2016].
Jähne B. Digital Image Processing. Berlin/Heidelberg: Springer-Verlag; 2005.
Argenziano G, Soyer HP, De Giorgi V. Dermoscopy: A Tutorial. Milan: Edra Medical Publishing & New Media; 2002.
Cortes C, Vapnik V. Support vector networks. Machine Learn. 1995;20:273–97.
Guyon I, Weston J, Barnhill S, Vapnik V. Gene selection for cancer classification using support vector machines. Mach Learn. 2002;46(1–3):389–422.
Kockara S, Mete M, Halic T, Yuruk N, Ercan M, Lawrence A. Fractals for Malignancy Detection in Dermoscopy Images. In: Healthcare Informatics (ICHI). International Conference; 2015. pp. 115–121
Cunliffe WJ, Goulden V. Phototherapy and acne vulgaris. Br J Dermatol. 2000;142(5):855–6.
Avci P, Gupta A, Sadasivam M, Vecchio D, Pam Z, Pam N, Hamblin MR. Low-level laser (light) therapy (LLLT) in skin: stimulating, healing, restoring. Semin Cutan Med Surg. 2013;32(1):41–52.
Aziz-Jalali MH, Tabaie SM, Djavid GE. Comparison of Red and infrared Low-level laser therapy in the treatment of acne vulgaris. Indian J Dermatol. 2012;57(2):128–30.
Lee SY, You CE, Park MY. Blue and red light combination LED phototherapy for acne vulgaris in patients with skin phototype IV. Lasers Surg Med. 2007;39(2):180–8.
Yu H-S, Wu C-S, Yu C-L, Kao Y-H, Chiou M-H. Helium-neon laser irradiation stimulates migration and proliferation in melanocytes and induces repigmentation in segmental-type vitiligo. J Invest Dermatol. 2003;120(1):56–64.
Lan C-CE, Wu C-S, Chiou M-H, Chiang T-Y, Yu H-S. Low-energy helium-neon laser induces melanocyte proliferation via interaction with type IV collagen: visible light as a therapeutic option for vitiligo. Br J Dermatol. 2009;161(2):273–80.
Low Level Laser (Light) Therapy (LLLT) for Cosmetic Medicine and Dermatology. [Online]. Available: http://photobiology.info/Sawhney.html. Accessed 06 May 2016.
Scheinfeld N-S, Goldberg D-J, Laser Treatment of Benign Pigmented Lesions, [Online]. Available: http://emedicine.medscape.com/article/1120359-overview. Accessed 5 Sept 2016.
Acknowledgements
Not applicable.
Declarations
This article has been published as part of BMC Bioinformatics Volume 17 Supplement 13, 2016: Proceedings of the 13th Annual MCBIOS conference. The full contents of the supplement are available online at http://bmcbioinformatics.biomedcentral.com/articles/supplements/volume-17-supplement-13.
Funding
This research and its publication was partially supported by Arkansas Science and Technology Association Award# 15-B-25 and by the Arkansas INBRE program, with an award# P20 GM103429 from the National Institutes of Health/the National Institute of General Medical Sciences (NIGMS).
Availability of data and material
Not applicable.
Authors’ contributions
SK made the overall design of the study. SKa and MB implemented the proposed method as graduate students under SK’s supervision. MM and TH helped in development of the general comparison testbed, performing data analysis, and statistical measurements. HF and HW provided their clinical expertise in dermatology and dermoscopy. HW and HF also interpreted results’ relevance to clinical studies. All of the authors read and approved the manuscript.
Competing interests
The authors declare that they have no competing interests in regards to this study.
Consent for publication
Not applicable.
Ethics approval and consent to participate
Not applicable.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
About this article
Cite this article
Kaya, S., Bayraktar, M., Kockara, S. et al. Abrupt skin lesion border cutoff measurement for malignancy detection in dermoscopy images. BMC Bioinformatics 17 (Suppl 13), 367 (2016). https://doi.org/10.1186/s12859-016-1221-4
Published:
DOI: https://doi.org/10.1186/s12859-016-1221-4