Skip to content

Commit 2f8fbf6

Browse files
committed
Update fixres
1 parent 1ac4a71 commit 2f8fbf6

File tree

1 file changed

+4
-5
lines changed

1 file changed

+4
-5
lines changed

paper_notes/fixres.md

Lines changed: 4 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -5,19 +5,18 @@ _February 2020_
55
tl;dr: Conventional imageNet classification has a train/test resolution discrepancy (domain shift).
66

77
#### Overall impression
8-
Scale invariance/equivariance is not guaranteed in CNN (only shift invariance). The same model with different test time input will yield very different statistics.
8+
Scale invariance/equivariance is not guaranteed in CNN (only shift invariance). The same model with different test time input will yield very different statistics. The distribution of activation changes at test time, the values are not in the range that the final cls layers were trained for.
99

1010
In ImageNet training, conventional way is to use 10-time crop (center, four corners, and their mirrors) and test time is always central crop. This leads to a discrepancy of the statistics in training/test.
1111

12-
A similar work is MultiGrain, where the p-pooling is adjusted to match the train/test-time stats.
12+
Simple solution: **finetune last layer** with test time scale and resolution, as the final stage of training.
1313

1414

1515
#### Key ideas
16-
- The distribution of activation changes at test time, the values are not in the range that the final cls layers were trained for.
17-
- Simple solution: **finetune last layer** with test time scale and resolution.
18-
- Larger test crops yields better results.
1916

2017
#### Technical details
18+
- Larger test crops yields better results.
19+
- A similar work is MultiGrain, where the p-pooling is adjusted to match the train/test-time stats.
2120
- GeM (generalized mean pooling) p-pooling: a generalization of average pooling and max pooling
2221
- cf LSE pooling in [From Image-level to Pixel-level Labeling with Convolutional Networks](https://arxiv.org/abs/1411.6228) CVPR 2015
2322
- Image/instance retrieval requires adjusting p-pooling for better accuracy

0 commit comments

Comments
 (0)