Bilinear pooling 10 is proposed to obtain rich and orderless global representation for the last convolutional feature, which achieved the state-of-the