Fig. 5 The diagram of Faster R-CNN framework
The defect image of the input model is extracted by 13 CONV layers, 13
ReLU layers and 4 pooling layers. The feature map of the surface defect
image is extracted using a set of basic CONV+ReLU+pooling layers for the
subsequent RPN layers and fully connected
layers.