You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
-[MoNet3D: Towards Accurate Monocular 3D Object Localization in Real Time](https://arxiv.org/abs/2006.16007)[[Notes](paper_notes/monet3d.md)] <kbd>ICML 2020</kbd> [mono3D]
99
+
-[MoNet3D: Towards Accurate Monocular 3D Object Localization in Real Time](https://arxiv.org/abs/2006.16007)[[Notes](paper_notes/monet3d.md)] <kbd>ICML 2020</kbd> [Mono3D, pairwise relationship]
100
100
-[Object-Aware Centroid Voting for Monocular 3D Object Detection](https://arxiv.org/abs/2007.09836) <kbd>IROS 2020</kbd> [mono3D]
101
101
-[Center3D: Center-based Monocular 3D Object Detection with Joint Depth Understanding](https://arxiv.org/abs/2005.13423)[mono3D]
102
102
-[SAFENet: Self-Supervised Monocular Depth Estimation with Semantic-Aware
Copy file name to clipboardExpand all lines: paper_notes/monet3d.md
+5-3
Original file line number
Diff line number
Diff line change
@@ -5,12 +5,14 @@ _November 2020_
5
5
tl;dr: Encodes the local geometric consistency (spatial correlation of neighboring objects) into learning.
6
6
7
7
#### Overall impression
8
-
The idea is similar to enforcing certain order in prediction. It learns the second degree of information hidden in the GT labels. It incorporates prior knowledge of geometric locality as regularization in the training module.
8
+
The idea is similar to enforcing certain order in prediction. It learns the second degree of information hidden in the GT labels. It incorporates prior knowledge of geometric locality as regularization in the training module. The mining of pair-wise relationship if similar to [MonoPair](monopair.md).
9
9
10
-
The writing is actually quite bad with heavy use of non-standard terminology.
10
+
The writing is actually quite bad with heavy use of non-standard terminology. No ablation study on the effect of this newly introduced regularization.
11
11
12
12
#### Key ideas
13
-
- Local similarity constraints as additional regularization. If two objects are similar (close-by) in GT, then they should be similar in prediction as well.
13
+
- Local similarity constraints as additional regularization. If two objects are similar (close-by) in GT, then they should be similar in prediction as well.
14
+
- The similarity is defined as $s_{ij} = \exp (-\Delta u_{ij}^2 - \Delta z_{ij}^2/\lambda)$
15
+
- The difference between the output for different vehicles are penalized according to this metric.
Copy file name to clipboardExpand all lines: paper_notes/monopair.md
+3-3
Original file line number
Diff line number
Diff line change
@@ -7,19 +7,19 @@ tl;dr: mono3D with pair wise relation and non-linear optimization.
7
7
#### Overall impression
8
8
This work is inspired by [CenterNet](centernet.md). it not only predicts the 3d bbox from the center of the bbox (similar to [RTM3D](RTM3D) but without predicting the eight points directly). It is similar to the popular solutions to the [Kaggle mono3D competition](https://www.kaggle.com/c/pku-autonomous-driving).
9
9
10
-
The main idea is to predict distance of each instance and relative distance between neighboring pairs, and their corresponding uncertainties, then use nonlinear optimization (with g2o) for joint optimization. It refines the detection results based on spatial relationships.
10
+
The main idea is to predict distance of each instance and relative distance between neighboring pairs, and their corresponding uncertainties, then use nonlinear optimization (with g2o) for joint optimization. It refines the detection results based on spatial relationships. The mining of pair-wise relationship if similar to [MoNet-3D](monet3d.md).
11
11
12
12
MonoPair improved accuracy dramatically, especially for heavily occluded scenario.
13
13
14
14
#### Key ideas
15
-
- Range circle: diameter is set up
15
+
- Range circle: diameter is set up to connect the center of the two instances
16
16
- Predicting relative distance is in local coordinate. This is a brilliant idea as this makes the regression target to be invariant to global azimuth. Regression target is multiplied by the rotational matrix of the azimuth angle.
17
17
- Predict uncertainty helps depth estimation greatly, as shown in Table 5.
18
18
- The joint optimization does not lead to too much improvement as shown in Table 6.
- Weight matrix is diagonal of predicted uncertainties of diff bits. $W = \text{diag}(1/\sigma_i) $. The authors tried various weighting strategies but no improvement.
24
24
- For images with more pair constraints, the performnace is better, even before
25
25
- The addition of uncertainty to depth leads to the biggest improvement.
0 commit comments