Density-based parallel skin lesion border detection with webCL
BMC Bioinformatics volume 16, Article number: S5 (2015)
Dermoscopy is a highly effective and noninvasive imaging technique used in diagnosis of melanoma and other pigmented skin lesions. Many aspects of the lesion under consideration are defined in relation to the lesion border. This makes border detection one of the most important steps in dermoscopic image analysis. In current practice, dermatologists often delineate borders through a hand drawn representation based upon visual inspection. Due to the subjective nature of this technique, intra- and inter-observer variations are common. Because of this, the automated assessment of lesion borders in dermoscopic images has become an important area of study.
Fast density based skin lesion border detection method has been implemented in parallel with a new parallel technology called WebCL. WebCL utilizes client side computing capabilities to use available hardware resources such as multi cores and GPUs. Developed WebCL-parallel density based skin lesion border detection method runs efficiently from internet browsers.
Previous research indicates that one of the highest accuracy rates can be achieved using density based clustering techniques for skin lesion border detection. While these algorithms do have unfavorable time complexities, this effect could be mitigated when implemented in parallel. In this study, density based clustering technique for skin lesion border detection is parallelized and redesigned to run very efficiently on the heterogeneous platforms (e.g. tablets, SmartPhones, multi-core CPUs, GPUs, and fully-integrated Accelerated Processing Units) by transforming the technique into a series of independent concurrent operations. Heterogeneous computing is adopted to support accessibility, portability and multi-device use in the clinical settings. For this, we used WebCL, an emerging technology that enables a HTML5 Web browser to execute code in parallel for heterogeneous platforms. We depicted WebCL and our parallel algorithm design. In addition, we tested parallel code on 100 dermoscopy images and showed the execution speedups with respect to the serial version. Results indicate that parallel (WebCL) version and serial version of density based lesion border detection methods generate the same accuracy rates for 100 dermoscopy images, in which mean of border error is 6.94%, mean of recall is 76.66%, and mean of precision is 99.29% respectively. Moreover, WebCL version's speedup factor for 100 dermoscopy images' lesion border detection averages around ~491.2.
Dermoscopy is a prevalent method used by dermatologists in the diagnosis of melanoma and other pigmented skin lesions. Dermatologists use a handy, high-resolution imaging tool called dermatoscope to take dermatoscopic images. Dermoscopy is now a well-established diagnostic tool to improve the clinical recognition of a broad spectrum of various skin disorders. Skin cancer detection is the most important indication of dermoscopy. There is evidence that the use of dermoscopy has increased the accuracy of diagnosis . Carli et al. (2004)  showed that the examination of a pigmented skin lesion including melanoma using dermoscopy allows physicians to realize morphologic features which are otherwise not visible to the naked eye. This in turn, comparing to conventional non-dermoscopic examination, allows physicians to reach a more reliable diagnosis of skin lesions. Thus, recent melanoma guidelines promote the use of dermoscopy in skin cancer screening and diagnosis .
Skin cancer is the most common form of cancer in the US and over 3.5 million cases are diagnosed annually . The deadliest form of skin cancer is melanoma. Melanoma is a malignancy of melanocytes which are special cells in the skin located under the outer surface epidermis. 15% of melanoma cases are fatal [3, 4]. Women at 25-29 years of age are the most-commonly affected group . Although melanoma accounts for only 4% of all skin cancers , it is the cause of 75% of skin-cancer-related deaths . Even with the help of dermoscopy, 70% of melanoma claims are still a false-negative diagnosis . Melanoma rates are rising amongst all demographics . With early detection, melanoma can often be cured with a simple excision operation.
Dermoscopy is a set of techniques for optical magnification of a region-of-interest on skin which makes subsurface structures more visible than traditionally photographic techniques . The procedure measures many properties of a skin lesion, such as color, size, symmetry, border, and change over time. The odds of successful diagnosis between naked eye examination and Dermoscopy are 15-6 . Even when dermoscopic images examined by an expert, diagnosis rates are not completely accurate.
An expert system capable of processing dermoscopic images could provide an additional diagnosis tool to aid dermatologist. In many common manual methods of examining photographs of lesions, a border is drawn by a dermatologist, and this border is 'subjectively' analyzed by dermatologist to diagnosis if the lesion is malignant melanoma or melanocytic. Similarly, automated systems designed to processes dermoscopic images usually start with automatic border detection before examining the lesions for the features with diagnostic importance such as color, symmetry, etc. . There are many methods for detecting the border of a lesion. The blue color channel is typically examined because empirical evidence suggests it provides the most accurate results . Reader is referred to  for details on analysis of color models and color channels on biomedical image processing.
Border detection is usually the first stage of analysis of dermoscopic images. Our implementation presented here automatically delineates the lesion border by using density based clustering technique. In order to get a clear definition of the lesion, some preprocessing, such as color space transformations, contrast enhancement, and artifacts removal are typically applied to the image . Following this pre-processing, a partitioning of the image occurs in a process known as segmentation. These disjoint regions are then examined by computer algorithms and scanned for lesion data, and combined to detect the border of the entire lesion.
According to Celebi et al.  automated skin lesion border detection can be divided into four sections: pre-processing, segmentation, post-processing, and evaluation. The pre-processing step involves color space transformations , contrast enhancement  and artifacts removal . The segmentation step involves partitioning of an image into disjoint regions . The post-processing is used to obtain the lesion border . The evaluation involves the assessment of the border detection results by a dermatologist. At the first stage of dermoscopy image analysis, border detection is usually applied  to detect other features more accurately. An active contour model is also introduced to detect skin lesion borders in dermoscopy images . Many other approaches have been applied to dermoscopy images for accurately segmenting the melanocytic skin lesions. Color histogram thresholding is one of them [23, 21]. In Peruch et al.  in addition to thresholding, researchers incorporate cognitive process of dermatologists for accurate melanocytic skin lesion segmentation.
Density based clustering algorithms identify regions of high data density, and require a definition of how dense the data should be . Density based spatial clustering of applications with noise, or DBSCAN , as its name indicates, is a prominent density based clustering method. It is used for spatial data with noise and has the advantages of being able to find irregularly shaped clusters. DBSCAN also has a sense of border and noise data, and requires minimal knowledge of dataset . It requires two inputs: minimum points, a measurement of how many points need to be grouped; and epsilon (ε), a measurement of how close the points need to be grouped. In the context of this study, points refer to pixels. DBSCAN has many applications, including Internet traffic classification; war-game frontline prediction; and facial recognition [25–27].
In DBSCAN, a cluster is a group of points that the number of points is equal to or greater than the minimum number of points (MinPts) in certain neighborhood of core points. Different point (node) definitions of DBSCAN are illustrated in Figure 1. The core point is that the neighborhood of a given radius (ε) has to contain at least a minimum number of points (MinPts), i.e., the density in the neighborhood should exceed pre-defined threshold (MinPts). The definition of a neighborhood is determined by the choice of a distance function for two points p and q, denoted by dist(p,q). For instance, when the Manhattan distance is used in 2D space, the shape of the neighborhood would be rectangular (See Figure 2). Note that DBSCAN works with any distance function so that an appropriate function can be designed for some other specific applications. DBSCAN is significantly more effective in discovering clusters of arbitrary shapes. It was successfully used for synthetic dataset as well as earth science, and protein dataset [28–30]. Once the two parameters ε and MinPts are defined, DBSCAN starts to cluster data points (e.g. pixels for images) from an arbitrary point. If the neighborhood is sparsely populated, i.e. it has fewer than MinPts points in the region query, then that point is labelled as a noise. Points that are causing the cluster to grow called border points. In the context of this paper, terms "node" and "point" are used interchangeably with pixel. Pseudocode of the DBSCAN is given in Algorithm 1.
Algorithm 1 - DBSCAN
D = Set of all Points
ε = Max distance between nodes
MinPts = Density of nodes needed
Procedure DBSCAN(D, ε, MinPts)
Foreach unvisited point P in dataset D
P is visited
NeighborPoints = regionQuery(P, ε)
If(sizeof(neighborpoints) < MinPts)
P is noise
P = next cluster
In our previous study we introduced computationally more efficient version of DBSCAN  for the purpose of automatic border detection. In that version, we removed redundant computations that exist in DBSCAN by only expanding a cluster around a border region since cluster expands from borders. It was proven in  that this approach is computationally more efficient than the traditional DBSCAN.
Even computationally improved version of DBSCAN  requires a lot of time for very large datasets. The need for speeding up DBSCAN is better understood when considering high resolution dermoscopic images, which consist of millions of nodes (pixels). Even improved version of DBSCAN given in  takes some time for generating results. Reader is referred to Table 1 for the timing of density based skin lesion border detection method's serial version for different size dermoscopy images. Not only image size but also complexity and irregularity of the skin lesion also reduce the performance. Thus, in order to benefiting available ubiquitous high performance computing hardware resources found in today's computers, we implement our density based skin lesion border detection method in a high performance parallel computing model called WebCL. WebCL provides a significant speedup and to provide a high level of portability for our case of dermoscopic image processing.
This section summarizes WebCL and driving force behind WebCL. Computing capability of today's computers has been evolving with introduction of more computational cores in chips rather than faster computational cores. This is because of reaching the physical limits of silicon chip, excessive heat dissipation causing noise and high voltage requirement in increasing clock frequency of integrated circuits. As a result, we use multicore and multiprocessors that consists of many computing chips in one single integrated circuit (e.g. multicore CPUs or GPUs). This trend in chip advancement can also be seen in mobile platforms such as smart phones and tablets. Therefore, the performance improvement in any applications, especially applications with high demand in computation power, can be only realized with the use of parallel computation. Although application parallelism for desktop applications is achievable at some degree for quite a while, up until introduction of WebCL web-based client-side applications were not able to use available parallel hardware resources such as multicores and GPUs at the client-side.
Lack of parallelism creates the main limiting factors for many potential web applications. Web applications work solely on the user device and although the device has enough computing ability, application cannot utilize the underlying multi-core and multiprocessor device (e.g. such as IPhone). This is due to the limitations placed on the current Web browsers' designs and lack of any middleware software layer. But with the recent technology called WebCL, it is now possible to design and develop parallel algorithms that could effectively use the parallel hardware. Specifically, WebCL is a recently introduced (first version released in March 2014) open web based standards for heterogonous parallel computing. As it is an open specification, it has been gaining wide acceptance nationally and internationally. WebCL specification has been accepted and driven by Khronos Group (Khronos is an open consortium funded by major software and hardware companies such as Apple, Google, IBM, NVIDIA, Samsung, Qualcomm, PIXAR, Texas instrument etc.). WebCL mainly introduces a new software middleware layer aims at directly accessing to the parallel hardware within the web browsers.
With WebCL, It is now possible to develop high performance web applications such as data visualization, video processing, 3D gaming, interactive simulations, image processing and segmentation that would not be possible before. The Web applications that can now effectively use these computing capabilities of any mobile devices as tablets (e.g. IPad, Android, and Windows based tablets), smartphones (e.g. IPhone, Windows, Firefox OS based phones etc. ) and also prospective devices that will be available in everyday use such as smart watches, smart glasses, and smart devices in smart homes.
The significance of using WebCL is that it removes any device dependency and provides true platform independence. We can design and develop computation intensive web applications that are accessible by any device regardless of their hardware and software platform. This also enables WebCL application portable, mobile, and future compatible; meaning that the application can adopt computing capability of upcoming devices in the future.
Any algorithm developed with WebCL needs to adopt a single instruction multiple sata (SIMD) approach. In SIMD, there is a single execution task working on its own portion of data called kernel. Therefore, kernel can operate in multiple cores at the same time simultaneously and independently which significantly increases the performance of the application.
WebCL is fast and portable. It takes full advantage of parallel architecture in new computing devices. It currently runs on Firefox, Safari, and Chrome web browsers . It can provide instant high performance computing to a desktop, laptop, a tablet, or a smartphone. Compared to java-script, the most popular language for web-based computation, WebCL is 100x faster . In the future, this technology will be implemented on more mobile phones and tablets, allowing for fast computing to be available anywhere, loaded straight from the internet.
WebCL is a parallel programming environment that also takes advantage of general purpose processing for graphics processing units (GPGPU). A GPU has more transistors than a consumer level CPU, and can be considered a more powerful processor . GPUs also produce more FLOPS/watt than CPUs, making them energy efficient alternatives over CPUs . This makes GPUs find application areas in medicine . GPUs use a SIMD programming paradigm. It computes a kernel, or an algorithm, on a stream, or ordered sequence of the same type of data, in parallel. A kernel operates on an entire stream.
Nokia's implementation of WebCL is a Firefox extension. WebCL support can be added to Firefox by installing OpenCL drivers for a local device, and installing Nokia's WebCL add-on. Samsung's WebCL implementation runs on Safari, and is built on top of WebKit. It must be installed from source code, and also relays on OpenCL drivers for a local device. The two implementations agree upon a single API, which is currently a working draft. Both of these implementations are capable of supporting applications in their current state. For our purpose, we use Nokia's Firefox Add-on for WebCL due to ease of deployment, portability across platforms, and access to developer add-ons native to Firefox. Although WebCL has many benefits such as accessibility, portability, and performance, it requires significant design and development time, and more importantly expertise in parallel computation and web application development. This is due to the fact that WebCL can only benefit from the simultaneously execution of application tasks on multiple hardware computation cores. For any application a new novel parallel algorithm needs to be developed designated for WebCL execution scheme and parallel hardware.
Our implementation uses Manhattan Distance to calculate the distance between two pixels. It has two large advantages leading towards a small computational complexity: it maps easily to integer space which performs quickly on a GPU, and it maps naturally onto an image . Manhattan Distance is the change in × plus the change in Y. Distance based calculations are very common in density based clustering. Figure 3 illustrates a sample radius 4 Manhattan region with the region's center being 0. Next section introduces WebCL implementation details of the density based skin lesion border detection method.
To take full advantage of the acceleration available through WebCL, our previous density based skin lesion border detection method was implemented in data-parallel. Nodes are divided into geographic regions, with some redundancy in the area covered in order to limit the cross thread communication requirements. Each node is assigned to a thread, creating up to millions of threads, and the distance is calculated between nodes in the same previously created geographic regions. Using the transitive property, the output of each thread is combined using a tree reduction algorithm. A border is detected by noticing a drop in density amongst the nodes. After the image has been scanned, a line is drawn displaying the border of the lesion. After the image has been processed, it is drawn in an HTML5 canvas with border regions represented in a color that contrasts with the lesion image, thus delivering the border. This is illustrated in Figure 4.
A serial version of density based skin lesion border detection method was implemented in C++. Following that, a parallel WebCL version was designed and implemented, taking full advantage of the speedups made available by the WebCL framework. The serial C++ implementation reads a dermoscopy image, and generates every pixel's cluster id along with whether that pixel is border, core, or noise. The serial C++ version of the implementation uses the pseudocode mentioned in Algorithm 1.
The WebCL implementation accepts a dermoscopy image file as an input. The input file is divided into sizes capable of being stored inside the GPU. More specifically, a dermoscopy image partitioned on to a smaller pieces to be concurrently executed on the GPUs. For effective image partitioning available memory spaces in the GPUs are considered. Nokia's Firefox implementation currently has a tight memory limit. Thus, we had to incorporate this limitation along with the available memory space in the GPUs. Each image partition is then scanned using the parallel implementation of density based skin lesion border detection (see Algorithm 2). The output of each scan (Region Query in Algorithm 2) is merged in to a global memory in the GPU such a way as to combine clusters span that across multiple partitions/borders. How image data is partitioned and how borders on different image partitions are merged are explained in the following section. More specifically, next section explains how merge operation is designed and implemented. Due to the nature of independent concurrent threads running the image partitions, the same clusters falling in to different partitions may be labelled differently. In order to eliminate that problem after parallel threads executions on different image partitions, we need an additional operation called merging.
Algorithm 2 - Parallel version of DBSCAN for each image partition using density based clustering to find the data density around a certain location
N = Contains a location in an.Image Partition
Partition.Image = A 2D array, each location may be a node
Eps = The distance from the center of N to be searched
Neighbors = A list of all Nodes within Eps distance of N
For each Location T inside an Image Partition
If T is a node
If The distance between N and T <= Eps
1) Shingled partitioning
Many devices have memory limitations. Thus, partitioning an image is necessary if the image size is larger than the available memory space. This is better understood when virtual slides are considered. For instance, virtual slides with 10 GB size are quite common. For these very large images, partitioning the image and processing the partitioned image inside the GPU is necessary. Not only that, image partitioning is also important for scalability and portability. A modern day GPU typically can support 2 GB of data. Image partitioning increases the portability, because a device queue can be created to determine a maximum partition size. So that later data can be partitioned and processed in the GPU's device queue. This allows for an application to be optimized for high-end hardware, while still running on lower end or mobile hardware. So, when data is partitioned dynamically according to the available resources and data size, the application can also be dynamically scaled up or down. Algorithm 3 summarizes image partitioning data structure. Xoffset and Yoffset in Algorithm 3 are determined dynamically with an image size and the available resources in the GPU.
Algorithm 3 - Image Partitioning Data Structure
Image = 2D array, each position (pixel) is a node
Xoffset = Where this partition starts on the × axis of the whole image
Yoffset = Where this partition starts on the Y axis of the whole image
Because density based skin lesion border detection uses the density of a point's (a pixel) ε-neighborhood, true partitioning, wherein any given pixel must reside in one and only one partition, would not be satisfied. This is due to the fact that for a point to be properly categorized as a core node, a border node, or a noise node along with a proper cluster ID, the algorithm must be able to examine that point's neighborhood within a radius of ε. This is not a problem for the points falling inside the partition. However, there is a need for a special care for the points (pixels) lying on the edge of a partition (e.g. a neighbor of another partition). This special care is handled by examining the edge points with a large portion of their ε-neighborhood missing (residing in the adjacent partition). More specifically, to deal with the edges of a partition, it becomes necessary to have some redundancy in the border regions of partitions. We call these redundant regions together with the partition that owns the redundant region as shingles. More specifically, in our implementation, each shingle is composed of a core partition plus a lap region which overlays a portion of the core regions of the shingles to its right and bottom. For instance, core partition of partition 1 with the length of ε is illustrated as a red region in Figure 3. The lap region is 2ε+1 pixels, as needed to accurately measure the density of a point residing near a border.
For instance, Figure 3 illustrates shingle 1 as partition 1 along with the redundant zone (lap region) which is illustrated as a shaded area. Notice that, width of that shaded region is selected as ε for not to miss ε-neighbors of the edge points in the red area (core partition). So that, clusters in partition 1 including edge points/nodes ensured that they will have the same cluster IDs with partitions 2, 4, and 5. This happens when partition 1's edge points' ε-neighbor region query determines that points in partitions 2, 4, and 5 fall in to the same cluster.
Serial image partitioning implementation is a simple O(P) operation where P is the number of partitions. In our parallel implementation, partitioning is also done in concurrently. So that, each thread creates its own unique partition. Uniqueness is guaranteed by unique thread offset which is calculated from thread IDs. Creating partitioning concurrently in different threads reduces computation complexity for partitioning to O(1). The × and y offsets along with the number of rows and columns of partitions are calculated from the core partition size, while the total partition height and width are 2ε+1 pixels larger than their respective core sizes due to the inclusion of the lap region. Pseudocode for image partitioning is given in Algorithm 4. Once image is partitioned then next stage is processing these partitions concurrently with density based skin lesion border detection method.
Algorithm 4 - Method for creating list of partitions
Partitionwidth = width of a partition
Partitionheight = height of a partition
Height = height of image
Width = width of image
partitionList = List of Partitions
partsPerRow = width / partitionwidth
PartRows = height / Partitionheight
For each row in partsRow
For each partition in partsPerRow
xOffset = indexof partition in Row * PartitionWidth
yOffset = indexof row in partsRow * Height
partition = new Part(xOffset, yOffset)
2) Partition processing
Partition processing also occurs in parallel inside a WebCL device. Partition processing finds clusters within a single partition. In this stage each pixel of a partition in the image is given a single thread. The thread checks whether the node (pixel) is a core/border/noise node. Then every node in a cluster negotiates a cluster id, agreeing upon the smallest pixel id as a cluster id. The negotiation process is run iteratively until all elements agree. Partition processing summarized in Algorithm 5 and 6 accordingly.
Algorithm 5 - Partition processing. It is performed for every pixel in parallel. This scans for the minimum neighbor ID.
Partition = A partition to process, a global variable
NodeList = A list of all nodes in a partition
C=an integer value of the cluster ID
Procedure PartitionProcessing() //create 1 thread per pixel in parallel
Sum = 0 Sum2 = 1
Sum2 = Sum
Sum = 0
NodeList = Scan(Partition);//scans for minimum cluster ID among ε-neighbors
Foreach Node N in NodeList
Sum += N.C
Algorithm 6 - Scan operates on the data stream of an image, segmented and loaded as a partition. The partition is loaded into the constant memory.
Partition = A partition to search for clusters in, a global variable
minPts = A measure of minimum cluster density, a global variable
NodeList = A list of all nodes and an associated integer value C or cluster ID, a global variable.
Procedure Scan()//1 thread per pixel in parallel
Pixel n = partition.image[x][y];
If(n is a node)//core or noise or border
Neighbors = Region Query// ε-neighbors
If(Neighbors.size >= minPts)
n.C = Smallest C of all Neighbor's Cs
Else if(p is within the range of a core node)
n.C = Smallest C of all Core nodes in n's neighbors
Figure 5 illustrates an exemplary scanning process step by step for ε=2 and minpts = 2 in a binary image. Initially integers are pixel IDs. In each iteration pixel IDs are changed to the minimum cluster ID of the ε-neighbor. In each iteration, ε=2 neighbors' IDs changed to the minimum cluster ID.
The longest path of the merged region determines how many iterations of scan must be completed before each node in a cluster is associated with the same cluster ID. Figure 6 illustrates the longest path as a grey area for the clustered group of pixels in Figure 5. Figure 7 illustrates the worst case scenario.
3) Partition merging
Partition merging works by using the transitive property. Areas of the image that are on the border of a partition are scanned twice. If this node is found in two clusters, than the transitive property demonstrates that the two clusters are actually the same cluster. Nodes clustered in two partitions can be found on the border of a single partition, within 1 ε + 2 of an edge of a partition. This process is completed by merging each partition with a list of all partitions already merged. Algorithms 7 and 8 summarize the merge operation.
Algorithm 7 - shows the high level process of splitting an image into smaller pieces and combining the output of the smaller pieces.
GlobalNodeList - A list of all nodes in the entire image
PartitionList = CreateParititons()
For each partition in PartitionList
NodeList = ParitionProcessing(partition)
Algorithm 8 - Partition merge operation: shows how clusters spanning multiple partitions are detected.
NodeList - A list of all nodes in a partition
GlobalNodeList - A list of all nodes in the entire image
For each node in a cluster in Partition
For each node in GlobalNodeList Within Eps of a Parition border
if two nodes are in the same location
Merge the clusters associated with each node
Figure 8 illustrates how partitions processed by different threads are merged in later stages of the computation. In this specific example, regions with cluster IDs 4 and 0 are merged.
4) Finding the border
Density based clustering can classify nodes into three separate categories, core, border, and noise. A core node has high neighbor density that means that the node has more than MinPts number of neighbors. A border node is inside the neighborhood of a core node, but it does not have a neighbor density greater than MinPts. A noise node has sparse neighbor density, and is not neighbor with any core nodes. We use these definitions to determine the border of a skin lesion, and use it to handle noise values. After each node is classified, we can use the X, Y data of each node to draw the core and border nodes onto an HTML5 canvas. We draw border nodes using a color with high contrast to the color we draw border nodes with (see Figure 2 for an example). This allows for the border to be easily visible. Algorithm 9 demonstrates the step by step procedure of classifying each node into three categories: core, border, and noise.
Algorithm 9 - Node classification
GlobalNodeList = A list of all nodes in the image
MinPts = Minimum density to be determines a cluster
CoreNodeList = A list of nodes that are at the core of a cluster
BorderNodeList = A list of nodes that are at the border of a cluster
NoiseList = A list of nodes that are not in any cluster
For each Node N in GlobalNodeList
ClusterCount = Number of Nodes A where A.C = N.C
If ClusterCount < MinPts
Else if RegionQuery(N, GlobalNodeList).size < MinPts
Results and discussion
Amdahl's law  for a maximum theoretical speedup states that an algorithm can be accelerated by the portion of the algorithm that is parallelizable plus the portion of the algorithm that is serial. In the serial version of density based skin lesion border detection, each pixel needs to look up ε2 other pixels to determine density. Drawing the image is done in linear time big O(#pixels). The serial time is ε2*#pixels, or O(ε2). The speedup achievable according to Amdahl's law is ε / #pixels, or big O(ε). This turns the time complexity from quadratic based on ε and pixel count to a linear based on ε. This assumes that each pixel can be assigned a thread that is run concurrently. Our implementation does not achieve Amdahl's law because of communication overhead. However, the parallel WebCL version of density based skin lesion border detection algorithm achieves an average of around ~491.2 speedup over the serial version on 100 dermoscopy images. See Table 1 for a list of speedup factors for 100 dermoscopy images along with the resolution of each image. While parallel version has obtained considerable speedups, it had exactly the same accuracy ratios of the serial version given in our previous work  in which mean of border error is 6.94%, mean of recall is 76.66%, and mean of precision is 99.29% respectively. Schaefer et al.  is the first study that uses border error (XOR) measure for dermoscopic image analysis.
To determine that parallel version of the algorithm is generating the same accuracies with the serial version for the same images; we conduct controlled experiments on the same images. Controlled experiments mean that whatever the order of randomly selected pixels in the serial version is, we used the same order for the pixels in the parallel version. With these controlled experiments, we obtained the same accuracy ratios as given in Table 2 for both the serial and parallel versions. Table 2 shows precision, recall, and border error for 100 dermoscopy images. However, both serial and parallel versions of the algorithm each time randomly select the pixels for processing, then results may not have exactly the same accuracies. By the way, discrepancy of accuracies will be unnoticeable (e.g. 99.13089% vs. 99.13091%). This unnoticeable discrepancy is even true for the serial version; when serial version runs on the same image at different times (means that generates different order of randomly selected pixels). Thus, it cannot generate exactly the same results.
Table 1 summarizes all the results obtained for 100 dermoscopy images for both serial and WebCL parallel versions. It also shows speedup factors for each dermoscopy image. Figure 9 plots speedup factors of 100 dermoscopy images with varying resolutions. Figure 10 plots speedup factors of 100 dermoscopy images by the size of the lesion (number of pixels in the lesion) in order.
As seen from Figure 10, in some cases even though lesion size is smaller for some images, they have lower speedup factors or vice versa. In some cases; however, lesion size is large but speedup is large too. This is because: the lesion's shape is highly irregular so either causing many region queries or there are many merge operations in the parallel version. Depends on the shape of a lesion, many merge operations may occur even for small size lesions in parallel version. See Figure 11 for an artificially created example. For instance, in this case assume that there are 4 parallel sections which are illustrated by red lines in Figure 11. All of these parallel sections will run concurrently. Thus, each of these concurrent tasks will find too many clusters in their local partitions. However, as can be seen from Figure 11, all of these clusters eventually fall in to the same global cluster. Therefore, image resolution or lesion size may not always be a good indicator for a speedup.
Using a parallel version of density based skin lesion border detection can provide quick skin lesion boarder detection for dermoscopic images while keeping the accuracy of the serial version. Automated border detection can supplement expert dermatologist, and aid them in diagnosis of melanoma or other pigmented skin lesions. While WebCL is currently an emerging technology, a full adoption of WebCL into the HTML5 standard would allow for this implementation to run on a very large set of hardware and software systems through web browsers. WebCL takes full advantage of parallel computational resources on a local machine, and allows for compiled code to run directly from the Web Browser. This makes it a good candidate for computationally expensive algorithms to be placed in a web browser.
Carli P, De Giorgi V, Chiarugi A, Nardini P, Weinstock MA, Crocetti E, andGiannotti B: Addition of dermoscopy to conventional naked-eye examination in melanoma screening: A randomized study. Journal of the American Academy of Dermatology. 2004, 50 (5): 683-689. 10.1016/j.jaad.2003.09.009.
Plüddemann A, Heneghan C, Thompson M, Wolstenholme J, and Price CP: Dermoscopy for the diagnosis of melanoma: primary care diagnostic technology update. The British Journal of General Practice. 2011, 61 (587): 416-417. 10.3399/bjgp11X578142.
NL Hufford D: Etiological Factors in Skin Cancers: Environmental and Biological. In Cancer of the Skin. Edited by: R. DS. 2010, London: Elsevier, 2
AmericanCancerSociety: Cancer Facts&Figures. 2010, Available online (Accessed April 2015), [http://www.cancer.org/acs/groups/content/@nho/documents/document/acspc-024113.pdf]
CJ Riegel DS: Malignant melanoma: prevention, early detection, and treatment in the 21st century. CA Cancer J Clin. 2000, 5: 21-
AmericanCancerSociety: Melanoma Skin Cancer Overview. 2010, Available online (Accessed April 2015), [http://www.cancer.org/acs/groups/cid/documents/webcontent/003063-pdf.pdf]
Troxel DB: Pitfalls in the Diagnosis of Malignant Melanoma: Findings of a Risk management panel study. Am J Surg Pathol. 2003, 27: 1278-1283. 10.1097/00000478-200309000-00012.
David E: Skin cancer. Melanoma and other specific nonmelanoma skin cancers. Cancer. 1995, 75 (S1): 245-259. 10.1002/1097-0142(19950101)75:1+<245::AID-CNCR2820751310>3.0.CO;2-7.
Soyer HP, Kenet RO, Wolf IH, Kenet BJ, Cerroni L, et al: Clinicopathological correlation of pigmented skin lesions using dermoscopy. EJD. 2000, 10 (1): 22-28.
Vestergaard ME, Macaskill PHPM, Holt PE, Menzies SW: Dermoscopy compared with naked eye examination for the diagnosis of primary melanoma: a meta-analysis of studies performed in a clinical setting. British Journal of Dermatology. 159 (3): 669-676.
Celebi ME, Iyatomi H, Schaefer G, Stoecker WV: Lesion border detection in dermoscopy images. Comput Med Imaging Graph. 2009, 33: 148-531. 10.1016/j.compmedimag.2008.11.002. Mar
Kerri-Ann Norton, Iyatomi Hitoshi, E Celebi Emre, Ishizaki Sumiko, Sawada Mizuki, Suzaki , Reiko , Kobayashi Ken, Tanaka Masaru, Ogawa Koichi: Three-phase general border detection method for dermoscopy images using non-uniform illumination correction. Skin Research and Technology. 2012, 18: 290-300. 10.1111/j.1600-0846.2011.00569.x.
Celebi ME, Kingravi HA, and Celiker F: Fast colour space transformations using minimax approximations. IET Image Processing. 2010, 4: 70-80. 10.1049/iet-ipr.2008.0172.
Abbas Qaisar, Celebi E. Emre, Garcia Irene Fondón, Rashid Muhammad: Lesion border detection in dermoscopy images using dynamic programming. Skin Research and Technology. 2011, 17: 91-100. 10.1111/j.1600-0846.2010.00472.x.
Celebi ME, Iyatomi H, Schaefer G, Stoecker WV: Lesion border detection in dermoscopy images. Comput Med Imaging Graph. 2009, 33: 148-53. 10.1016/j.compmedimag.2008.11.002.
Pratt WK: Digital image processing. PIKS Scientific inside. 2007, Hoboken, N.J.: Wiley-Interscience, 4
Gomez D. D, Butakoff C, Ersboll B. K, and Stoecker W: Independent histogram pursuit for segmentation of skin lesions. IEEE Trans Biomed Eng. 2008, 55: 157-61. Jan
Celebi ME, Kingravi HA, Iyatomi H, Aslandogan YA, Stoecker WV, Moss RH, Malters JM, Grichnik JM, Marghoob AA, Rabinovitz HS, and Menzies SW: Border detection in dermoscopy images using statistical region merging. Skin Res Technol. 2008, 14: 347-53. 10.1111/j.1600-0846.2008.00301.x. Aug
Sonka M, Hlavac V, Boyle R: Image processing, analysis, and machine vision. 1999, PWS publishing Pacific Grove, CA, 2:
Celebi ME, Aslandogan YA, Stoecker WV, Iyatomi H, Oka H, Chen X: Unsupervised border detection in dermoscopy images. Skin Res Technol. 2007, 13: 454-62. 10.1111/j.1600-0846.2007.00251.x. Nov
Celebi ME, Wen Q, Hwang S, Iyatomi H, Schaefer G: Lesion Border Detection in Dermoscopy Images Using Ensembles of Thresholding Methods. Skin Res Technol. 2013, 19 (1): e252-8. 10.1111/j.1600-0846.2012.00636.x.
Mete M, Sirakov NM: Lesion detection in demoscopy images with novel density-based and active contour approaches. BMC Bioinformatics. 2010, 11: S23-
Peruch F, Bogo F, Bonazza M, Cappelleri VM, Peserico E: Simpler, Faster, More Accurate Melanocytic Lesion Segmentation Through MEDS. IEEE Transactions on Biomedical Engineering. 2014, 61 (2): 557-565.
Martin Ester, Kriegel Hans-Peter, Sander J, Xu Xiaowe: A density-based algorithm for discovering clusters in large spatial databases with noise. Proceedings of 2nd International Conference on Knowledge Discovery and Data Mining (KDD-96). 1996, 226-231.
Caihong Yang, Wang , Fei , Huang Benxiong: Internet traffic classification using DBSCAN. Information Engineering, 2009. 2009, ICIE'09. WASE International Conference, 163-166.
John Rushing, Tiller , John McDowell, Drew , Tanner Steve: Adaptive Artificial Intelligence for Next Generation Conflict. Final Tech Report, TILLER (JOHN) MADISON AL. 2004
Xueping Su, Peng , Jinye , Feng , Xiaoyi , Wu , Jun , Fan Jianping: Linking names and faces by person-based subset clustering. Proceedings of the Third International Conference on Internet Multimedia Computing and Service. 2011, 120-123.
Mete M, Tang F, Xu X, Yuruk N: Finding Functional Modules. In Systems Biology for Signaling Networks. 2010, Springer New York, 253-273.
Mete M, Tang F, Xu X, Yuruk N: A structural approach for finding functional modules from large biological networks. BMC Bioinformatics. 2008, 9: S19-
Ertoz L, Steinbach M, Kumar V: Finding Clusters of Different Sizes, Shapes, and Densities in Noisy, High Dimensional Data. In Proceedings of Second SIAM International Conference on Data Mining. 2003
Mete M, Kockara S, Aydin K: Fast density-based lesion detection in dermoscopy images. Comp Med Imag and Graph. 2011, 35 (2): 128-136. 10.1016/j.compmedimag.2010.07.007.
Tasneem Brutch, Bourges-Sevenier Mikael: Khronos WebCL: Accelerating Web Applications. 2013
David Luebke, Harris , Mark , Govindaraju , Naga , Lefohn , Aaron , Houston , Mike , Owens , John , Segal , Mark , Papakipos , Matthew , Buck Ian: GPGPU: general-purpose computation on graphics hardware. Proceedings of the 2006 ACM/IEEE conference on Supercomputing, ACM. 2006, 208-
Jeon Won, Gibbs Simon: WebCL for Hardware-Accelerated Web Applications. Tizen Developer Conference. 2012
Erik Smistad, Elster Anne C, Lindseth Frank: Fast Surface Extraction and Visualization of Medical Images using OpenCL and GPUs. The Joint Workshop on High Performance and Distributed Computing for Medical Imaging. 2011
Jeon W, Brutch T, Gibbs S: WebCL for hardware-accelerated web applications. WWW'12 Dev. 2012, Lyon, France
Massoud Pedram, Brooks , David , Pinkston , Timothy : Report for the NSF Workshop on Cross-layer Power Optimization and Management. NSF Workshop on Cross-Layer Power Optimization and Management. 2012
Stone John E, Gohara , David , Shi Guochun: OpenCL: A parallel programming standard for heterogeneous computing systems. Computing in science & engineering. 2010, 12 (3): 66-
Munshi Aaftab: The OpenCL Specification. Khronos OpenCL Working Group. 2011, 1.1: doc. revision 44
Brutch Tasneem: WebCL Overview Roadmap. DevCon 5. 2011
Wang Jigang, and Cooper, Leon N: Improving nearest neighbor rule with a simple adaptive distance measure. Pattern Recognition Letters. 2007, 28 (2): 207-213. 10.1016/j.patrec.2006.07.002.
Amdahl GM: Validity of the single processor approach to achieving large scale computing capabilities. AFIPS spring joint computer conference. 1967, 483-485.
Schaefer G, Rajab MI, Celebi ME, Iyatomi H: Colour and contrast enhancement for improved skin lesion segmentation. Computerized Medical Imaging and Graphics. 2011, 35: 99-104. 10.1016/j.compmedimag.2010.08.004.
This research was partially supported by Arkansas Science and Technology Association Award# 15-B-25 and NSF REU Award# 1062838: "REU Site: HIT@UCA: Applied Research in Health Information Technology". We also would like to thank Mr. Reigada from SUNY Binghampton and Mr. Kruger from Northeastern State University for being part of the NSF REU program.
This article has been published as part of BMC Bioinformatics Volume 16 Supplement 13, 2015: Proceedings of the 12th Annual MCBIOS Conference. The full contents of the supplement are available online at http://www.biomedcentral.com/bmcbioinformatics/supplements/16/S13.
The authors declare that they have no competing interests.
SK made the overall design of the study. MM provided insights and analysis of the density-based algorithms. JL implemented the WebCL version as a graduate student under SK's supervision. SK developed the general comparison testbed, performed data analysis, algorithm testing, and statistical measurements. JL made benchmarking. TH helped develop design parallel algorithm along with the help of implementing the algorithm in WebCL. JL, SK, and TH contributed to the writing of this manuscript. All of the authors read and approved the manuscript.
About this article
Cite this article
Lemon, J., Kockara, S., Halic, T. et al. Density-based parallel skin lesion border detection with webCL. BMC Bioinformatics 16 (Suppl 13), S5 (2015). https://doi.org/10.1186/1471-2105-16-S13-S5
- Core Node
- Serial Version
- Density Base Cluster
- Region Query
- Border Node