Skip to main content

An automated 3D modeling pipeline for constructing 3D models of MONOGENEAN HARDPART using machine learning techniques

Abstract

Background

Studying structural and functional morphology of small organisms such as monogenean, is difficult due to the lack of visualization in three dimensions. One possible way to resolve this visualization issue is to create digital 3D models which may aid researchers in studying morphology and function of the monogenean. However, the development of 3D models is a tedious procedure as one will have to repeat an entire complicated modelling process for every new target 3D shape using a comprehensive 3D modelling software. This study was designed to develop an alternative 3D modelling approach to build 3D models of monogenean anchors, which can be used to understand these morphological structures in three dimensions. This alternative 3D modelling approach is aimed to avoid repeating the tedious modelling procedure for every single target 3D model from scratch.

Result

An automated 3D modeling pipeline empowered by an Artificial Neural Network (ANN) was developed. This automated 3D modelling pipeline enables automated deformation of a generic 3D model of monogenean anchor into another target 3D anchor. The 3D modelling pipeline empowered by ANN has managed to automate the generation of the 8 target 3D models (representing 8 species: Dactylogyrus primaries, Pellucidhaptor merus, Dactylogyrus falcatus, Dactylogyrus vastator, Dactylogyrus pterocleidus, Dactylogyrus falciunguis, Chauhanellus auriculatum and Chauhanellus caelatus) of monogenean anchor from the respective 2D illustrations input without repeating the tedious modelling procedure.

Conclusions

Despite some constraints and limitation, the automated 3D modelling pipeline developed in this study has demonstrated a working idea of application of machine learning approach in a 3D modelling work. This study has not only developed an automated 3D modelling pipeline but also has demonstrated a cross-disciplinary research design that integrates machine learning into a specific domain of study such as 3D modelling of the biological structures.

Background

Studying structural and functional morphology of small organisms is difficult due to the lack of visualization in three dimensions. One viable way to resolve this is by creating digital 3D models using 3D imaging techniques such as confocal microscopy [1, 2] and scanning electron microscopy (SEM) [3,4,5,6]. As highlighted by Galli et al. [1], digital 3D models of monogenean hard parts reconstructed from the confocal microscopy contain morphological details which are not detectable in the corresponding 2D illustrations. However, 3D imaging equipment used for 3D reconstruction are expensive and demand tedious specimen preparation prior to usage [7].

Hence, Teo et al. [8] demonstrated a polygonal modelling method by using 2D illustrations as the templates to construct 3D models of haptoral parts of monogeneans via an existing commercial 3D modelling software, Autodesk 3ds Max. The resulting models produced from Autodesk 3ds Max are shown to be effective in offering 3D visualization of the spatial relationship of morphological characters in a monogenean haptor. However, Autodesk 3ds Max requires technical skill to create a decent 3D shape of a target specimen due to the high complexity of the software user interface. Another approach would be to construct a new digital 3D model from an existing 3D model via a shape deformation technique [9]. This shape deformation technique obviates the need for repeating a complicated modelling process to construct a new 3D model from scratch. One limitation of this technique is that it demands a modeller to manually choose the vertices of a digital generic 3D model and align them with the landmark points of an input 2D image. If this can be automated with minimal human intervention, the entire modelling procedure may become much easier and efficient.

A landmark point detection empowered by a machine learning algorithm could be a solution to enable auto-detection of the landmark points on an input image. Subsequently, the landmark points predicted by the machine learning model can be used to direct the deformation of a digital generic 3D model by automatically mapping its vertices with their corresponding landmark points and eventually produce a target 3D shape.

Landmark point detection has been studied extensively by researchers for tracking human facial feature points which were applied to human face recognition [10,11,12], facial expression analysis [13], facial landmark localization [14, 15], age estimation [16], gender classification [17] and 3D face modelling [18]. These studies share a common idea that the facial feature points located at the eye, nose, mouth, and chin of a 2D facial image carry semantic meaning and these points could automatically be detected via a machine learning algorithm using annotated 2D sample images as a training dataset. The idea of the landmark detection was also applied to several other fields of study such as geometric morphometric for bioimages [19, 20], anatomical features detection for medical diagnosis [21, 22], and human body pose detection [23].

One common machine learning algorithm used in most of the landmark detection studies is Artificial Neural Network (ANN). The popularity of ANN in recent years is due to its effectiveness for image analysis tasks [24]. In some studies, ANN was used along with a clustering algorithm to partition the training sets into several subsets which share the common properties [25] or with autoencoder to rectify the outlier points [15], and such coupling approach was able to enhance the prediction accuracy in the CNN. Autoencoder was also used on its own to automatically identify human facial expressions using some geometrical features, and the features data was trained with an unsupervised method. On the other hand, Chen et al. [12] used RNN in their landmark detection study to deal with the non-linear deformations in the human facial shape. To the best of our knowledge, there is no reported study on the application of ANN for landmark point detection to enable automated construction of digital 3D models of biological specimens such as monogeneans, which is the target specimen of this study.

Monogeneans (Class: Monogenea, Phylum: Platyhelminthes) are parasitic flatworms which possess soft anatomical structures and hard sclerotized copulatory system of both males and female reproductive parts. Their anatomical structures, such as haptoral elements are of diagnostic importance in taxonomic description and identification of the monogenean species [26,27,28,29,30,31]. However, the study of these morphological structures is not easy because monogeneans are soft-bodied flatworms with fragile endodermis, unable to withstand desiccation for long investigation under a microscope. The use of the digital 3D models would help taxonomist to understand the morphology as well as the functions of these morphological structures.

The main aim of this study is to develop an automated 3D modelling pipeline that uses 2D illustrations as input to generate a variety of digital 3D models of monogenean anchor without repeating the tedious 3D modelling process from scratch. The target models of this study are limited to the haptoral anchors of monogenean from eight selected species, which are Dactylogyrus primaries Gusev, 1955, Pellucidhaptor merus Zaika, 1961, Dactylogyrus falcatus Wedl, 1857, Dactylogyrus vastator Bybelin, 1924, Dactylogyrus pterocleidus Gusev, 1955, Dactylogyrus falciunguis Achmerow, 1952, Chauhanellus auriculatum Lim, 1994 and Chauhanellus caelatus Lim, 1994. This automated 3D modelling pipeline will be driven by an ANN that can detect the landmark points’ location on an input 2D illustration and automatically align them with the vertices of a generic 3D model to enable deformation of its 3D shape to produce a target 3D model.

Results

The results obtained from the development of an automated 3D modelling pipeline are presented in the following sections.

Synthetic 2D illustrations

The output of data preparation includes three sets which are 1000, 2500 and 5000 synthetic 2D illustrations generated by the semi-automated augmentation program and the NoSQL database developed to store the details of the synthetic 2D illustrations. All three sets of the synthetic illustrations of monogenean anchor were synthesized based on eight different categories of morphological variant shapes to cater the need for training the machine learning model to detect the landmark points on the input illustrations of all the selected monogenean anchors. Some selected samples of the synthetic illustrations for each category of anchor shape are shown in Fig. 1a-h. The image size of all the synthetic illustrations were standardized in 96 × 96 pixels. The synthetic 2D illustrations as exemplified in Fig. 1a-h are scaled and rotated in random degrees. This is to ensure that a high degree of morphological variant shapes can be obtained to train a more generalized machine learning model.

Fig. 1
figure 1

Selected samples of synthetic 2D illustrations. a Shape Category 1, (b) Shape Category 2, (c) Shape Category 3, (d) Shape Category 4, (e) Shape Category 5, (f) Shape Category 6, (g) Shape Category 7, (h) Shape Category 8

NoSQL database

In this study, a NoSQL database using MongoDB was developed to store the details of all the synthetic illustration training sets such as the illustration name, pixels values of illustrations, 2D coordinates of 34 point primitives (Fig. 2). The details of the training sets are presented as five key-value pairs of data for each record (Fig. 3a). The “_id” key is auto-generated by MongoDB whenever a new record is created. The “name” and the “shape” hold the information of the illustration name and the shape category as a simple string and a numeric number respectively. The 2D coordinates of the 34 point primitives are stored in “landmark” key and they are represented as coordinate arrays (Fig. 3b). The pixel values of each illustration record is held by the “pixels” key and they are represented as an array of integer elements ranging from 0 to 255 (Fig. 3c).

Fig. 2
figure 2

MongoDB Compass that shows details of the stored training sets in the noSQL database

Fig. 3
figure 3

Example of data stored in noSQL database. a Five key-values pairs, (b) 2D coordinates of point primitive represented as a coordinate array, (c) Pixel values of synthetic illustration represented as an integer array

Evaluation of machine learning model based on three different datasets

The train, validation and test accuracy of the machine learning models trained by three different number of the dataset are presented in Table 1. The table shows that the model trained by 5000 datasets results in the highest test accuracy. Hence, the model trained by 5000 datasets was adopted for landmark detection on the input illustrations of eight different monogenean species.

Table 1 Evaluation of trained ANN model based on three different number of datasets

Landmark point detection on the input illustrations of monogenean anchors using the machine learning model

The machine learning model trained by the 5000 datasets was tested on the monogenean anchors of eight selected monogenean species with different shape feature to examine the versatility of the machine learning model to detect landmark point on a diverse shape of anchors (Fig. 4). In general, Fig. 4 shows favorable results of the landmark detection by having most of the landmark points (red dots) localized along the edge of the input 2D illustrations of the monogenean anchors.

Fig. 4
figure 4

Machine detected landmarks on 2D illustrations of eight selected monogenean anchors from Gusev [43] and Lim [44]. a Dactylogyrus primarius (b) Pellucidhaptor merus (c) Dactylogyrus falcatus. d Dactylogyrus vastator (e) Dactylogyrus pterocleidus (f) Dactylogyrus falciunguis (g) Chauhanellus auriculatum (h) Chauhanellus caelatus

3D models of monogenean anchors derived from deformation of the generic 3D model

In this section, the 3D models of monogenean anchors derived from the automated 3D modelling pipeline are presented. Figure 5 shows a group of 3D models (Fig. 5b-e) produced by the automated 3D modelling pipeline. In general, the overall body shape of the resulting 3D models resembles the 2D shape as presented in their corresponding 2D illustrations. Nevertheless, the automated 3D modelling pipeline does not work very well on modelling the extrusion parts of few anchors (Fig. 5k, & o). A closer inspection on the shape discrepancy presented by those anchors is shown in Fig. 6. The automated 3D modelling pipeline shows its limitation to derive the fan-like extrusion structure found in the anchors of D. peterocleidus and C. auriculatum.

Fig. 5
figure 5

Generic 3D model and the input illustrations of anchor along with their corresponding 3D models (coloured). a: Generic 3D anchor model (b-c): Dactylogyrus primarius. d-e: Pellucidhaptor merus. f-g: Dactylogyrus falcatus. h-i Dactylogyrus vastator. j-k: Dactylogyrus pterocleidus. l-m: Dactylogyrus falciunguis. n-o Chauhanellus auriculatum. p-q Chauhanellus caelatus

Fig. 6
figure 6

Discrepancy of 3D shape found in the extrusion part of (a) Dactylogyrus peterocleidus and (b) Chauhanellus auriculatum

Evaluation of the efficiency of the automated 3D modelling pipeline

The box plots show the patterns of the control points and the predicted landmark points for each of the target monogenean anchors (Figs. 7, 8, 9, 10, 11, 12, 13 and 14). In each of the monogenean anchors, box plots for the distribution of coordinate-X (Figs. 7, 8, 9, 10, 11, 12, 13 and 14b) and coordinate-Y (Figs. 7, 8, 9, 10, 11, 12, 13 and 14c) were plotted separately. As shown in Figs. 7, 8, 9, 10, 11, 12, 13 and 14, all the box plots show a similar pattern of distribution between the control points and the predicted landmark points.

Fig. 7
figure 7

Comparison of control points and predicted points on 2D illustration of Dactylogyrus primarius. a Positions of control points (small green markers) and predicted points (red markers) on 2D illustration. b Boxplot for control point-X and predicted point-X (c) Boxplot for control point-Y and predicted point-Y

Fig. 8
figure 8

Comparison of control points and predicted points on 2D illustration of Pellucidhaptor merus. a Positions of control points (small green markers) and predicted points (red markers) on 2D illustration. b Boxplot for control point-X and predicted point-X (c) Boxplot for control point-Y and predicted point-Y

Fig. 9
figure 9

Comparison of control points and predicted points on 2D illustration of Dactylogyrus falcatus. a Positions of control points (small green markers) and predicted points (red markers) on 2D illustration. b Boxplot for control point-X and predicted point-X (c) Boxplot for control point-Y and predicted point-Y

Fig. 10
figure 10

Comparison of control points and predicted points on 2D illustration of Dactylogyrus vastator. a Positions of control points (small green markers) and predicted points (red markers) on 2D illustration. b Boxplot for control point-X and predicted point-X (c) Boxplot for control point-Y and predicted point-Y

Fig. 11
figure 11

Comparison of control points and predicted points on 2D illustration of Dactylogyrus pterocleidus. a Positions of control points (small green markers) and predicted points (red markers) on 2D illustration. b Boxplot for control point-X and predicted point-X (c) Boxplot for control point-Y and predicted point-Y

Fig. 12
figure 12

Comparison of control points and predicted points on 2D illustration of Dactylogyrus falciunguis. a Positions of control points (small green markers) and predicted points (red markers) on 2D illustration. b Boxplot for control point-X and predicted point-X (c) Boxplot for control point-Y and predicted point-Y

Fig. 13
figure 13

Comparison of control points and predicted points on 2D illustration of Chauhanellus auriculatum. a Positions of control points (small green markers) and predicted points (red markers) on 2D illustration. b Boxplot for control point-X and predicted point-X (c) Boxplot for control point-Y and predicted point-Y

Fig. 14
figure 14

Comparison of control points and predicted points on 2D illustration of Chauhanellus caelatus. a Positions of control points (small green markers) and predicted points (red markers) on 2D illustration. b Boxplot for control point-X and predicted point-X (c) Boxplot for control point-Y and predicted point-Y

Discussions

In this study, an automated 3D modeling pipeline empowered by a machine learning algorithm was developed. The box plots in Figs. 7, 8, 9, 10, 11, 12, 13 and 14 demonstrated the capability of the automated 3D modelling pipeline empowered by a machine learning algorithm in predicting the landmark points which can show a similar pattern of distribution with the control points. The distribution patterns presented in the box plots indicated that the predicted landmark points could be used for automating deformation of the generic 3D model to obtain the favorable 3D shape of anchors of the 8 selected monogenean species (Fig. 5). The box plots have verified the visual examination result as shown in Fig. 5 that the resulting 3D models can generally fit most parts of the 2D outline presented in the input illustrations of the anchor.

The automated 3D modelling pipeline developed in this study presents a user friendly way to automate the deformation of the generic 3D model into a target 3D anchor by using a 2D illustration as an input. This entire modelling pipeline offers a very straightforward way to construct a target 3D model with minimal human intervention. The automated 3D modelling approach presented in this study also improved the 3D modelling work published in Teo et al. [8], by obviating the usage of the comprehensive 3D modelling software, Autodesk 3ds Max to create a 3D model from scratch. To the best of our knowledge, this is the first time machine learning has been applied for landmark point detection for modeling a biological specimen. Application of machine learning for landmark point detection for modelling biological specimens is not reported in any published records.

However, the automated 3D modelling pipeline does not work very well on deriving 3D models of anchors with some intricate structures such as the extrusions found in the species of D. pterocleidus and C. auriculatum and the hook in C. caelatus (Fig. 6). Such distortion of shape is because these morphological structures (e.g. extrusions and hook) possess a relatively smaller surface area to be controlled by a higher number of point primitives compared with other regions of the anchor. This means a slight diversion of the predicted coordinates of those point primitives from the control points will cause an obvious distortion of shape.

The machine learning model was trained with the synthetic 2D illustrations of the anchor which are laterally positioned, and all the sizes and colors are standardized to 96 × 96 pixels and grayscale level. Hence, the model cannot effectively detect the landmark points on a wild image for auto-deformation of the generic 3D model to derive a target 3D shape. One must preprocess the input 2D illustration before it can be fetched into the automated 3D modelling pipeline to derive a target 3D model. Besides, it is also not feasible to use more than 5000 synthetic illustrations as training sets for machine learning in this study due to the limitation of the current infrastructure. As a result, the test accuracy level of the trained model is only limited up to approximately 80%.

As part of the future development, the entire project will be migrated to a cloud computing platform which can be harnessed to store the escalating number of training sets in a scalable cloud server while the powerful computing engine of cloud platform can be utilized to build and deploy the machine learning model for the automated 3D modelling pipeline [32]. In the long term, migration to the cloud platform is an essential move to scale up the development of the automated 3D modeling pipeline presented in this study. Finally, a generic 3D model will be created for each of the monogenean hard parts respectively and the training sets preparation and machine learning process will be repeated to train a more versatile machine learning model to detect landmark points on the input 2D illustrations of different monogenean hard parts.

Conclusions

In this study, an automated 3D modelling pipeline was created using a machine learning technique, which is an Artificial Neural Network. The aim of this 3D modelling pipeline is to deform a digital generic 3D model of monogenean anchor into diverse forms of desired 3D anchors in an automated way. Despite some constraints and limitation as discussed above, the automated 3D modelling pipeline developed in this study has demonstrated a working idea of application of machine learning approach in a 3D modelling work. Besides, the development of this automated 3D modelling pipeline is also streamlined with some innovative methods to address some typical issues encountered in a machine learning related studies such as shortage of training set and big data storage management. In short, this study has not only developed an automated 3D modelling pipeline but also has demonstrated a cross-disciplinary research design that integrates machine learning into a specific domain of study such as 3D modelling of the biological structures.

Methods

Prior to developing the entire automated 3D modelling pipeline empowered by a machine learning algorithm, a digital generic 3D model of an anchor was developed using a geometric 3D modelling method presented in a previous study [33]. This digital generic 3D model is used as a 3D template to derive another target 3D shape in the later stage. The development of the 3D modelling pipeline is presented in the following sub-sections.

Data preparation for machine learning

In this study, the training data for machine learning are 2D illustrations of monogenean anchor. A well-trained machine learning model requires massive training data which can range from hundreds to millions to minimize the possibility of learning misleading or irrelevant patterns found in the training data [34]. In this study, 2D illustrations of monogenean anchor that could be extracted from existing publications were limited. It was also tedious to manually extract the 2D illustrations from publications to prepare the training set.

Hence, a semi-automated data augmentation approach was developed to generate a sequence of synthetic 2D illustrations of monogenean anchors. The core of this semi-automated data augmentation approach is a 2D shape interpolation algorithm to generate a sequence of interpolated shapes of the anchor as the synthetic illustrations from a process of morphing a source shape to a user-defined target shape. Pseudocode was drafted based on the 2D shape interpolation algorithm (See Additional file 1) and was used as a blueprint to develop a data augmentation program (Fig. 15). The data augmentation program was designed to generate 24 interpolated 2D shapes from one 2D source shape each time.

Fig. 15
figure 15

Data augmentation program to generate 2D illustrations of monogenean anchors

The procedure of using the data augmentation program to synthesize 2D illustration is as follows:

  1. 1.

    A 2D illustration of monogenean anchor extracted from a selected publication was used as the source shape. It was loaded into the data augmentation program

  2. 2.

    2D point primitives on the source shape were selected individually and moved to a new position in the 2D Cartesian coordinate space to obtain a morphological variant shape which will be used as a target shape (Fig. 16a).

  3. 3.

    The button “Generate 2D Samples” was activated to morph the source shape to the target shape obtained from Step 2. While the process of morphing was on-going, a total of 24 interpolated 2D shapes were captured and displayed on the grid cells (Fig. 16b). Another enlarged cropped view of the 24 interpolated 2D shapes generated from the source shape is presented in Fig. 17.

  4. 4.

    Next, the button “Convert to SVG” was activated to export the captured interpolated 2D shapes to SVG images which will then be converted into JPEG format (Fig. 16c).

  5. 5.

    Finally, the button “Record to Database” button was activated to store the pixel values along with the associated 2D coordinates of the constituent point primitives of each synthetic 2D illustration into a database (Fig. 16d).

Fig. 16
figure 16

Workflow to generate synthetic 2D illustrations using data augmentation program. a Deform a source shape to a target shape, (b) Morph source shape to target shape, (c) Export 24 interpolated shapes to SVG images which will then undergo batch processing using Adobe Photoshop to convert them into JPEG format, (d) Store pixel values and associated 2D coordinates of each synthetic 2D illustration into database

Fig. 17
figure 17

Twenty four interpolated 2D shapes

The data augmentation procedure described above was repeated to generate three sets of synthetic 2D illustrations with each of them consists of 1000, 2500 and 5000 datasets, respectively. Each set of synthetic 2D illustrations were used to train a machine learning model by following a supervised learning process as described below and the accuracy of the trained model is evaluated. This evaluation is to determine the number of datasets which is versatile enough to detect the landmark points on all types of target monogenean anchors in this study.

Data storage

A NoSQL database was developed using MongoDB (https://www.mongodb.com/) to store the training set in this study. The development of the database was initiated by defining a data model (Fig. 18) which is composed of a collection of documents which are “2D illustration document”, “Pixels document” and “Landmarks document”. In the MongoDB context, a collection is a container for structurally or conceptually similar documents whereas a document is a basic unit of data in MongoDB which encapsulates a group of related properties along with their associated values in a JSON object. In the data model, the “2D illustration document” was defined as a parent node to hold all the relevant 2D illustration information in a JSON object, which consists of four properties which are “name”, “shapeID”, “pixels” and “landmarks”. The “name” property was defined to hold an illustration identifier with a maximum of 100 characters whereas the “shapeID” is a numerical type property to hold the type of 2D shape. The “pixels” and “landmarks” properties were defined to hold the “Pixels document” and “Landmarks document” respectively. The “Pixels document” and “Landmark document” can be considered as two child nodes associated with the parent node, “2D illustration document”. In the child node “Pixels document”, there is only one property defined to hold an integer array of pixel values whereas there are two properties defined in “Landmark document” to hold an array of 2D coordinates, “coordinate_X” and “coordinate_Y” of each constituent point primitives that form a synthetic illustration.

Fig. 18
figure 18

noSQL data model for illustration training set

Machine learning model

An Artificial Neural Network (ANN) was trained to enable landmark detection on an input illustration. The ANN was based on a multilayer perceptron architecture adapted from a landmark detection study [35]. The input layer consists of 9216 nodes which hold an array of 96 × 96 pixels values of every 2D illustration training set. The number of hidden layers and the number of nodes in each of the hidden layers were set by a trial and error process started with a single hidden layer and a multiple of 128 nodes (128, 256, and 512) in each hidden layer. In the end, the network architecture comprising of two hidden layers with 512 nodes and 128 nodes, respectively, was adopted as it generally showed the decent performance of landmark detection. In the hidden layers, the ReLU was used as the activation function to calculate the weighted sum of the input nodes because it is one of the popular choices for a regression problem that aims to predict any positive continuous value [34, 36]. In the output layer are the 68 positive continuous values which denote predicted 2D coordinates (x, y) for 34 point primitives. These predicted point primitives are regarded as the landmark points in the context of this study. The trained neural network is aimed to detect these landmark points on an input 2D illustration.

The ANN was trained via a supervised learning process by mapping the pixel values of all the synthetic illustrations retrieved from the MongoDB database to each of their associated 2D coordinates of constituent point primitives (Fig. 19). Before training the ANN, all pixel values samples were divided into a training set (80% of samples) and a test set (20% of samples). The test set contains independent samples which are used to evaluate the accuracy of the trained model in a later stage. Besides, 30% of the samples were set apart from the training set to be used for model validation during the training process.

Fig. 19
figure 19

Supervised learning to map pixel values of illustrations (from four selected samples) to each of their associated 2D coordinates of constituent point primitives

The ANN was trained for 200 epochs in mini-batches of 20 samples by using Mean Square Error (MSE) as the loss function and Stochastic Gradient Descent (SGD) as the optimizer to repeatedly update the weight parameters of the preliminary machine learning models. The hyperparameters adapted from [35] as follows was used to configure the optimizer:

  1. a.

    Learning rate = 0.01

  2. b.

    Momentum = 0.9

  3. c.

    Decay rate = 0

  4. d.

    Nesterov momentum = True

Formation of automated 3D modelling pipeline

In this step, a 3D modelling system which is composed of three modules was developed (Fig. 20). The first module is to read and process the input illustration by translating it into an array of normalized pixel values (Fig. 21a). The second module is to detect the 34 landmark points on the input illustration through the trained model (Fig. 21b). The third module is to deform the digital generic 3D model by aligning the constituent point primitives of the generic model with a corresponding landmark point signified by the predicted coordinate values obtained from the second module (Fig. 21c). These three modules along with the trained machine learning model work synergistically to form an automated 3D modelling pipeline which can automatically deform the digital generic 3D model into a target 3D shape by using a 2D illustration as an input.

Fig. 20
figure 20

3D modelling system design

Fig. 21
figure 21

Automated 3D modelling pipeline. a Input processing, (b) Landmark localization, (c) Deformation of generic 3D model

A pseudocode which features the operating logic of the automated 3D modelling pipeline was drafted (See Additional file 2) and this pseudocode was used as a blueprint to develop the 3D modelling system which is a full stack web application (Fig. 22). The procedures of using the developed 3D modelling system to implement the automated 3D modelling pipeline are as follows:

  1. 1.

    A 2D illustration of a selected monogenean anchor was uploaded onto the system interface. This was done by activating the “Choose File” button and followed with selecting a 2D illustration as the target shape. (Fig. 23a)

  2. 2.

    Next, the button “Deform Model” was activated (Fig. 23b) to fetch the input 2D illustration to the back-end components of this system to enable it to go through the entire 3D modelling pipeline (input processing, landmark detection and deformation of the generic model).

Fig. 22
figure 22

The automatic 3D modelling system

Fig. 23
figure 23

Implementation of the automated 3D modelling pipeline. a Upload 2D illustration, (b) Generate 3D model through the automated 3D modelling pipeline

The procedure of using the 3D modelling system as described above was repeated to generate the eight selected monogenean anchors through the automated 3D modelling pipeline and the results are presented in the following section.

3D model evaluation

The evaluation of the resulting 3D model was based on a quantitative analysis which was done by plotting box plots to compare the distribution of the control points and the predicted landmark points for each of the target monogenean anchors. The shape of the target 3D model produced by the automated 3D modelling pipeline is dependent on the landmark points predicted by the machine learning model which empowers the 3D modelling system (Fig. 24). Ideally, all the predicted landmark points should be positioned along the edge of the 2D illustration so that this can result in an optimum 3D shape which can match with the target shape presented in the input 2D illustration.

Fig. 24
figure 24

Dependency of the target 3D shape on predicted landmark point position

Based on this understanding, the steps below are followed:

  1. 1.

    A sequence of control points were manually annotated on the ideal positions along the edge of an input 2D illustrations (Fig. 25a).

  2. 2.

    The coordinates of the control points were extracted from the Properties Window in Adobe Photoshop and were stored in a CSV file (Fig. 25b).

  3. 3.

    The same input 2D illustration was fetched to the 3D modelling system and the landmark point coordinates predicted by the machine learning model were collected and stored in a CSV file (Fig. 25c).

  4. 4.

    The Coordinate-X and Coordinate-Y of the control points and the predicted landmark points were loaded to create two box plots (Fig. 25d). The disparity pattern of the Coordinate-X and Coordinate-Y between the control points and the predicted landmark points were examined and analyzed.

  5. 5.

    Steps I-IV were repeated for all the 8 input 2D illustrations used for creating the target 3D anchors.

Fig. 25
figure 25

Procedures of creating box plots to examine disparity pattern of control points and landmark points. a Manual annotation of control points on an input 2D illustration. b Extract coordinates of control points from the Properties Window in Adobe Photoshop. c Collect predicted landmark point from the machine learning model. d Create two box plots using Microsoft Excel

Availability of data and materials

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.

Abbreviations

ANN:

Artificial Neural Network

CNN:

Convolutional Neural Network

RNN:

Recurrent Neural Network

SEM:

Scanning Electron Microscope

References

  1. Galli P, Strona G, Villa AM, Benzoni F, Doglia SM, Kritsky DC. Three-dimensional imaging of monogenoidean sclerites by laser scanning confocal fluorescence microscopy. J Parasitol. 2006;92(2):395–9.

    Article  Google Scholar 

  2. Hodová I, Sonneck R, Geinar M. Architecture of Paradiplozoon homoiion: a diplozoid monogenean exhibiting highly-developed equipment for extoparasitism. PLoS One. 2018;13(2):e0192285.

    Article  Google Scholar 

  3. Fannes W, Vanhove MPM, Huyse T. Redescription of Cichlidogyrus tiberianus Paperna, 1960 and C. dossoui Douëllou, 1993 (Monogenea: Ancyrocephalidae), with special reference to the male copulatory organ. Syst Parasitol. 2017;94:133–44.

    Article  Google Scholar 

  4. Antonelli L, Quilichini Y, Marchand B. Biological study of Furnestinia echeneis Euzet and Audouin 1959 (Monogenea: Monopisthocotylea: Diplectanidae), parasite of cultured Gilthead Sea bream Sparus aurata (Linnaeus 1758) (Pisces: Teleostei) from Corsica. Aquaculture. 2010;307:179–86.

    Article  Google Scholar 

  5. Hadied MA, Khidr AA, Samak OA, Said A. Surface topography of the anterior adhesive apparatus of the gill monogenean parasite Diplectanum sp. Diesing, 1858, with some surface criteria. Fish Aquac J. 2015;6(4):150.

    Google Scholar 

  6. Poddubnaya LG, Hemmingsen W, Gibson DI. Ultrastructural observations of the attachment organs of the monogenean Rajonchocotyle emarginata (Olsson, 1876) (Polyopisthocotylea: Hexabothriidae), a gill parasite of rays. Parasitol Res. 2016;115(6):2285–97.

    Article  Google Scholar 

  7. Kremer A, Lippens S, Bartunkova S, Asselbergh B, Blanpain C, Fendrych M, et al. Developing 3D SEM in a broad biological context. J Microsc. 2015;259(2):80–96.

    Article  CAS  Google Scholar 

  8. Teo BG, Sarinder KKS, Lim LHS. A novel alternative method for 3D visualization in parasitology: the construction of a 3D model of a parasite from 2D illustrations. Trop Biomed. 2010;27(2):254–64.

    CAS  PubMed  Google Scholar 

  9. Murakawa J, Yoon I, Hong T, Lank E. Parts, image, and sketch based 3D modelling method. Proceedings of the Third Eurographics conference on Sketch-Based Interfaces and Modelling. 2006:67–74.

  10. Ranjan R, Sankaranarayanan S, Castilo CD, & Chellappa R. An All-In-One Convolutional Neural Network for face analysis. Proceeding of 2017 12th IEEE International Conference on Automatic & Gesture Recognition (FG2017), 2017, 17–2.

  11. Zhang Z, Luo P, Loy CC, Tang X. Learning deep representation for face alignment with auxiliary attributes. IEEE Trans Pattern Anal Mach Intell. 2016;38(5):918–30.

    Article  Google Scholar 

  12. Chen Y, Yang J, Qian J. Recurrent neural network for facial landmark detection. Neurocomputing. 2017;219:26–38.

    Article  Google Scholar 

  13. Gondhi NK, Kour N, Effendi S, & Kaushik K. An efficient algorithm for facial landmark detection using haar-like features coupled with corner detection following anthropometric constraints. Proceedings of 2017 2nd International Conference on Telecommunication and Networks (TEL-NET), 2017, 1–6.

  14. Fan H, Zhou E. Approaching human level facial landmark localization by deep learning. Image Vis Comput. 2016;47:27–35.

    Article  Google Scholar 

  15. Deng W, Fang Y, Xu Z, Hu J. Facial landmark localization by enhanced convolutional neural network. Neurocomputing. 2018;273:222–9.

    Article  Google Scholar 

  16. Zaghbani S, Boujneh N, Bouhlel MS. Age estimation using deep learning. Comput Electr Eng. 2018;68:337–47.

    Article  Google Scholar 

  17. Jia S, Cristianini N. Learning to classify gender from four million images. Pattern Recogn Lett. 2015;58:35–41.

    Article  Google Scholar 

  18. Ferrari C, Lisanti G, Berretti S. A dictionary learning-based 3D morphable shape model. IEEE Trans Multimedia. 2017;19(12):2666–79.

    Article  Google Scholar 

  19. Vandaele R, Aceto J, Muller M, Péronnet F, Debat V, Wang C-W, et al. Landmark detection in 2D bioimages for geometric morphometrics: a multi-resolution tree-based approach. Sci Rep. 2018;8(1):538.

    Article  Google Scholar 

  20. Bromiley PA, Lindner C, Thomson J, Wrigley M, Cootes TF. Multi-point regression voting for shape model matching. Procedia Comput Sci. 2016;90:48–53.

    Article  Google Scholar 

  21. Zhang J, Gao Y, Gao Y, Munsell BC, Shen D. Detecting anatomical landmarks for fast Alzheimer’s disease diagnosis. IEEE Trans Med Imaging. 2016;35(12):2524–33.

    Article  Google Scholar 

  22. Prentašić P, Lončarić S. Detection of exudates in fundus photographs using deep neural networks and anatomical landmark detection fusion. Comput Methods Prog Biomed. 2016;137:281–92.

    Article  Google Scholar 

  23. Lovato C, Castellani U, Zancanaro C, Giachetti A. Automatic labelling of anatomical landmarks on 3D body scans. Graph Model. 2014;76:648–57.

    Article  Google Scholar 

  24. Krizhevsky A, Sutskever I, Hinton G. ImageNet classification with deep convolutional neural networks. Proceedings of the 25th International Conference on Neural Information Processing Systems - Volume 1. 2012, 1097–1105.

  25. Wu Y, Hassner T, Kim K, Medioni G, Natarajan P. Facial landmark detection with tweaked convolutional neural networks. IEEE Trans Pattern Anal Mach Intell. 2017;40(12):3067–74.

    Article  Google Scholar 

  26. Lim LHS, du Preez LH. Sundapolystoma chalconatae n.g., n.sp (Monogenea: Polystomatidae) from Rana chalconota (Schlegel) of Penisular Malaysia. Syst Parasitol. 2001;49:223–31.

    Article  CAS  Google Scholar 

  27. du Preez LH, Lim LHS. Neopolystoma liewi sp. n. (Monogenea: Polystomatidae) from the eye of the Malayan box turtle (Cuora amboinensis). Folia Parasitol. 2000;47:11–6.

    Article  Google Scholar 

  28. Lim LHS. Neocalceostoma Tripathi, 1957 and Neocalceostomoides Kritsky, Mizelle & Bilqees, 1978 (Monogenea: Neocalceostomatidae n. fam.) from ariid fishes of peninsular Malaysia. Syst Parasitol. 1995;30:141–51.

    Article  Google Scholar 

  29. Lim LHS. Eight new species of Hamatopeduncularia Yamaguti, 1953 (Monogenea: Ancyrocephalidae) from Arridae of peninsular Malaysia. Syst Parasitol. 1996;33:53–71.

    Google Scholar 

  30. Lim LHS, Gibson DI. Diplectanocotyla Yamaguti, 1953 (Monogenea: Diplectanoidea) from Megalops cyprinoides (Broussonet) (Teleostei: Megalopidae) off peninsular Malaysia. Syst Parasitol. 2007;67:101–17.

    Article  CAS  Google Scholar 

  31. Yamaguti S. Systema Helminthium volume IV. Mononenea and Aspidocotylea. London: Interscience Press; 1963.

    Google Scholar 

  32. Wright E. Microsoft azure IaaS solutions. Sebastopol, CA: O’Reilly Media; 2018.

  33. Teo BG, Dhillon SK, Lim LHS. A deformable generic 3D model of haptoral anchor of monogenean. PLoS One. 2013;8(10):e77650.

    Article  CAS  Google Scholar 

  34. Chollet F. Deep learning with python. New York, NY: Manning Publications Co; 2017.

    Google Scholar 

  35. Achieving Top 23% in Kaggle’s Facial Keypoints Detection with Keras + Tensorflow. https://fairyonice.github.io/achieving-top-23-in-kaggles-facial-keypoints-detection-with-keras-tensorflow.html. Accessed 20 June 2018.

  36. Pal S, Gulli A. Deep learning with Keras: implement various deep-learning algorithms in Keras and see how deep-learning can be used in games. Birmingham, UK: Packt; 2017.

Download references

Acknowledgements

Authors are thankful to Late Prof. Susan Lim Lee Hong from whom conceived the project idea.

About this supplement

This article has been published as part of BMC Bioinformatics, Volume 20 Supplement 19, 2019: 18th International Conference on Bioinformatics. The full contents of the supplement are available at https://bmcbioinformatics.biomedcentral.com/articles/supplements/volume-20-supplement-19

Funding

No funding was obtained for this study.

Author information

Authors and Affiliations

Authors

Contributions

TBG designed the framework of the automated 3D modelling system. SKD refined the framework by including a validation mechanism to examine the effectiveness of the system. TBG drafted the manuscript and SKD did a substantial revision on the manuscript. Both authors have read and approved the final manuscripts.

Corresponding author

Correspondence to Sarinder Kaur Dhillon.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1.

Pseudocode for 2D shape interpolation algorithm.

Additional file 2.

Pseudocode for the development of automatic 3D modelling system.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Teo, B.G., Dhillon, S.K. An automated 3D modeling pipeline for constructing 3D models of MONOGENEAN HARDPART using machine learning techniques. BMC Bioinformatics 20 (Suppl 19), 658 (2019). https://doi.org/10.1186/s12859-019-3210-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12859-019-3210-x

Keywords