How does the AlexNet into FCNs?
FCNs is a network, only contain convolution layers and no FC layer at all. It ' s structure can be shown as the following figures:
This image from the paper: <Fully convolutional Networks for Semantic segmentation> CVPR 2015.
It could locate the location of object target perfectly as shown in above images and it doesn ' t need to resize the Resolut Ion of input images, which is the mostly different from traditional cnns. First, let's review some related network parameters about AlexNet, related structure can be shown as following:
As we can see from the above figure, the input of images must is resized into a fixed resolution, like 224*224, due to the Existance of Fc_layer. The specific pipeline could is found in this blog, Web link: http://blog.csdn.net/sunbaigui/article/details/39938097< /c1>
The output of Conv 5 is:6*6*256, we want to obtain the final results:1*1*1000 (take the 1k classes for an example). How could is the middle Conv 6, Conv 7, Conv 8 layers to bridge the other results? Do we need the pool layers added? How to set the middle parameters in each layers? Does It really work?
Let's do it now.
How does the AlexNet into FCNs?