Computational Analysis and Deep Learning for Medical Care. Группа авторов
Чтение книги онлайн.

Читать онлайн книгу Computational Analysis and Deep Learning for Medical Care - Группа авторов страница 12

СКАЧАТЬ SENet, MobileNet V1/V2, and DenseNet. It also deals with the study of the parameters and components associated with the models in detail. The second section discusses the application of these models to segment IVD from the spine image. Finally, theoretically performance and experimental results of the state-of-art of the literature shows that 2.5D multi-scale FCN performs the best with the Dice Similarity Index (DSC) of 90.64%.

      Keywords: CNN, deep learning, intervertebral disc degeneration, MRI segmentation

      The concept of Convolutional Neural Network (CNN) was introduced by Fukushima. The principle in CNN is that the visual mechanism of human is hierarchical in structure. CNN has been successfully applied in various image domain such as image classification, object recognition, and scene classification. CNN is defined as a series of convolution layer and pooling layer. In the convolution layer, the image is convolved with a filter, i.e., slide over the image spatially and computing dot products. Pooling layer provides a smaller feature set.

      One major cause of low back pain is disc degeneration. Automated detection of lumbar abnormalities from the clinical scan is a burden for radiologist. Researchers focus on the automation task of the segmentation of large set of MRI data due to the huge size of such images. The success of the application of CNN in various field of object detection enables the researchers to apply various models for the detection of Intervertebral Disc (IVD) and, in turn, helps in the diagnosis of diseases.

      The details of the structure of the remaining section of the paper are as follows. The next section deals with the study of the various CNN models. Section 1.3, presents applications of CNN for the detection of the IVD. In Section 1.4, comparison with state-of-the-art segmentation approaches for spine T2W images is carried out, and conclusion is in Section 1.5.

      1.2.1 LeNet-5

      The LeNet architecture was proposed by LeCun et al. [1], and it successfully classified the images in the MNIST dataset. LeNet uses grayscale image of 32×32 pixel as input image. As a pre-processing step the input pixel values are normalized so that white (background) pixel represents a value of 1 and the black (foreground) represents a value of 1.175, which, in turn, speedup the learning task. The LeNet-5 architecture consists of succession of input layer, two sets of convolutional and average pooling layers, followed by a flattening convolutional layer, then two fully connected layers, and finally a softmax classifier.

      (1.1)

      (1.2)

      In the first convolutional layer, number of learning parameters is (5×5 + 1) × 6 = 156 parameters; where 6 is the number of filters, 5 × 5 is the filter size, and bias is 1, and there are 28×28×156 = 122,304 connections. The number of feature map calculation is as follows:

      (1.3)

      (1.4)

      W = 32; H = 32; Fw = Fh = 5; P = 0, and the number of feature map is 28 × 28.

      First pooling layer: W = 28; H = 28; P = 0; S = 2

      (1.5)

СКАЧАТЬ
Sl no. Layer Feature map Feature map size Kernel size Stride Activation Trainable parameters # Connections
1 Image 1 32 × 32 - - - - -
2 C1 6 28 × 28 5 × 5 1 tanh 156 122,304
3 S1 6 14 × 14 2 × 2 2 tanh 12 5,880
4 C2 16 10 × 10 5 × 5 1 tanh 1516 151,600
5 S2 16 5 × 5 2 × 2