Fig. 5 The diagram of Faster R-CNN framework
The defect image of the input model is extracted by 13 CONV layers, 13 ReLU layers and 4 pooling layers. The feature map of the surface defect image is extracted using a set of basic CONV+ReLU+pooling layers for the subsequent RPN layers and fully connected layers.