jangsoopark
diff --git a/‎.gitignore
+8-5 b/‎.gitignore
+8-5
diff --git a/‎README.md
+152-153 b/‎README.md
+152-153
diff --git a/‎assets/figure/soc-confusion-matrix.png
34.4 KB b/‎assets/figure/soc-confusion-matrix.png
34.4 KB
diff --git a/‎assets/figure/soc-training-plot.png
12.5 KB b/‎assets/figure/soc-training-plot.png
12.5 KB
@@ -280,11 +280,14 @@ $RECYCLE.BIN/
 *.lnk
 
 # asdf
-dataset/raw/train/*
-dataset/raw/test/*
+dataset/soc/raw/train/
+dataset/soc/raw/test/
 
-dataset/train/*
-dataset/test/*
+dataset/soc/train/
+dataset/soc/test/
+experiments/history
 
 *.pth
-# End of https://www.toptal.com/developers/gitignore/api/windows,linux,pycharm+all,python
+# End of https://www.toptal.com/developers/gitignore/api/windows,linux,pycharm+all,python
+
+
@@ -1,186 +1,158 @@
-# A-ConvNets
-
-**Note:**
->
-> Since this is my first trial to implement a model with SAR images and I don't have any domain knowledge,
-> this repository may contains weird implementations.
->
-> If you find those kinds of incorrect things, please let me know to correct it via issues. 
->
-> It will be very helpful for me to understand SAR image processing and could contribute to lower the technical barriers.
->
-> Thank you.
-
-## Target Classification Using the Deep Convolutional Networks for SAR images
-
-This repository contains the implementation of the paper 
-`S. Chen, H. Wang, F. Xu and Y. Jin, "Target Classification Using the Deep Convolutional Networks for SAR Images,"
- in IEEE Transactions on Geoscience and Remote Sensing, vol. 54, no. 8, pp. 4806-4817, Aug. 2016,
- doi: 10.1109/TGRS.2016.2551720.` 
-
-
-## MSTAR Dataset
-
-The implementation uses MSTAR(Moving and Stationary Target Acquisition and Recognition) database. Each of image consists of
-header(ASCII type) and data(*data type: float32, shape: W X H X 2*). Header contains meta data to read and utilize the
-database which include width, height, serial number, azimuth angle, etc. Data consists of magnitude and phase. Below
-figure is the example of magnitude(left) and phase(right).
-
-![Example of MSTAR image](assets/figure/001.png)
-
-In order to comprehensively assess the performance, the model is trained and evaluated under SOC
-(Standard Operating Condition) which uses the images captured at 17 depression angle for training and uses the images
-captured at 15 depression angle for evaluation.
-
-|Class|Serial No.|Training<br/>Depression<br>angle|Training<br/>#images|Test<br/>Depression<br>angle|Test<br/>#images|
-|:---:|:---:|:---:|:---:|:---:|:---:|
-2S1 | b01 | 17 | 299 | 15 | 274 |
-BMP-2 | 9563 |17 | 233 | 15 | 195 |    
-BRDM-2 | E-71 | 17 | 298 | 15 | 274 |
-BTR-60 | k10yt7532 |17 | 256 | 15 | 195 |   
-BTR-70 | c71 | 17 | 233 | 15 | 196 |
-D7 | 92v13015 | 17 | 299 | 15 | 274 |
-T-62 | A51 | 17 | 299 | 15 | 273 |
-T-72 | 132 | 17 | 232 | 15 | 196 |
-ZIL-131 | E12 | 17 | 299 | 15 | 274 |
-ZSU-234 | d08 | 17 | 299 | 15 | 274 |
+# AConvNet
 
-## Model
-
-#### `src/model/network.py`
-
-Because of lack of lack of the number of available data in MSTAR database, deep neural networks could be suffered from
-over fitting. Hence, the author proposes new architecture which eases the over fitting problem by replacing fully
-connected layers with convolution layers.
-
-|layer|Input|Conv 1|Conv 2|Conv 3|Conv 4|Conv 5|
-|:---:|---|:---:|:---:|:---:|:---:|:---:|
-|channels|2|16|32|64|128|10|
-|weight size| - |5 x 5|5 x 5|6 x 6|5 x 5| 3 x 3|
-|pooling| - | 2 x 2 - s2 | 2 x 2 - s2 |  2 x 2 - s2| - | - |
-|dropout| - | - | - | - | 0.5 | - |
-|activation| linear | ReLU | ReLU | ReLU | ReLU | Softmax |
-
-## Experiments
-
-### Data Augmentation
-
-#### `src/data/generate.py`
-#### `src/data/mstar.py`
-
-This repository follows the shifting method which used by author. However in order to preserve the reproducibility this
-repository doesn't uses the random sampling to extract patches. The patches are extracted in raster scanning order using
-patch size and stride.
-
-### Training
-
-#### `src/model/_base.py`
-
-#### `experiments/AConvNet.json`
-
-To replicate the official result, the model is trained with same optimization methods which uses SGD with Momentum,
-learning rate decay, weight decay, and the weights are initialized with He's method with uniform distribution.
-
-The difference with the paper is the author decreases the learning rate at epoch 50, but this repository decreases the
-learning at 10 and 50 for stable convergence for my environments. 
+### Target Classification Using the Deep Convolutional Networks for SAR Images
 
-![](assets/figure/003.png)
+This repository is reproduced-implementation of AConvNet which recognize target from MSTAR dataset.
+You can see the official implementation of the author at [MSTAR-AConvNet](https://github.com/fudanxu/MSTAR-AConvNet).
 
-### Early Stopping
+## Dataset
 
-The early stopping method is a form of regularization technique that finds the balance point between under fitting and
-over fitting.
+### MSTAR (Moving and Stationary Target Acquisition and Recognition) Database
 
-In this experiments, the trained model shows an accuracy 99.01% at epoch 42.
+#### Format
 
-### Results
+- Header
+    - Type: ASCII
+    - Including data shape(width, height), serial number, azimuth angle, etc.
+- Data
+    - Type: Two-bytes
+    - Shape: W x H x 2
+        - Magnitude block
+        - Phase Block
 
-#### Confusion Matrix
+Below figure is the example of magnitude block(Left) and phase block(Right)
 
-| classes | 2S1 | BMP-2 | BRDM-2 | BTR-60 | BTR-70 | D7 | T-62 | T-72 | ZIL-131 | ZSU-234 | Accuracy |
-|:---:|:---:|:---:|:---:|:---:|:---:|:---:|:---:|:---:|:---:|:---:|:---:|
-| 2S1 | 274 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
-| BMP-2 | 1 | 190 | 0 | 2 | 0 | 0 | 0 | 2 | 0 | 0 | 97.44 |
-| BRDM-2 | 0 | 0 | 272 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 99.27 |
-| BTR-60 | 1 | 0 | 0 | 186 | 4 | 0 | 0 | 1 | 1 | 2 | 95.38 |
-| BTR-70 | 0 | 1 | 0 | 0 | 195 | 0 | 0 | 0 | 0 | 0 | 99.49 |
-| D7 | 0 | 0 | 0 | 0 | 0 | 272 | 0 | 0 | 2 | 0 | 99.27 |
-| T-62 | 0 | 0 | 0 | 0 | 0 | 0 | 271 | 0 | 0 | 2 | 99.27 |
-| T-72 | 0 | 0 | 0 | 1 | 0 | 0 | 1 | 194 | 0 | 0 | 98.98 |
-| ZIL-131 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 273 | 1 | 99.64 |
-| ZSU-234 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 274 | 100 |
-| Total | - | - | - | - | - | - | - | - | - | - | 99.01|
+![Example of data block: 2S1](./assets/figure/001.png)
 
-> The total accuracy is calculated with  (TP + TN) / (TP + TN + FP + FN).
-> 
-> It is equivalent to sum(C * I) / sum(C).
-> 
-> Where C is confusion matrix and I is identity matrix
-
-<!-- TODO : reinforce more analytical explanation
-#### Error Analysis
-
-False Positive of BMP-2
+## Model
 
-![FP-BMP-2](assets/figure/004.png)
+The proposed model only consists of **sparsely connected layers** without any fully connected layers.
 
-False Positive of BRDM-2
+- It eases the over-fitting problem by reducing the number of free parameters(model capacity)
 
-![FP-BRDM-2](assets/figure/005.png)
+|    layer    | Input  |   Conv 1   |   Conv 2   |   Conv 3   | Conv 4 | Conv 5  |
+| :---------: | ------ | :--------: | :--------: | :--------: | :----: | :-----: |
+|  channels   | 2      |     16     |     32     |     64     |  128   |   10    |
+| weight size | -      |   5 x 5    |   5 x 5    |   6 x 6    | 5 x 5  |  3 x 3  |
+|   pooling   | -      | 2 x 2 - s2 | 2 x 2 - s2 | 2 x 2 - s2 |   -    |    -    |
+|   dropout   | -      |     -      |     -      |     -      |  0.5   |    -    |
+| activation  | linear |    ReLU    |    ReLU    |    ReLU    |  ReLU  | Softmax |
 
-False Positive of BTR-60
+## Training
+For training, this implementation fixes the random seed to `12321` for `reproducibility`.
 
-![FP-BTR-60](assets/figure/006.png)
+The experimental conditions are same as in the paper, except for `data augmentation` and `learning rate`. 
+The `learning rate` is initialized with `1e-3` and decreased by a factor of 0.1 **after 26 epochs**.
+You can see the details in `src/model/_base.py` and `experiments/config/AConvNet-SOC.json`
 
-False Positive of BTR-70
+### Data Augmentation
+ 
+- The author uses random shifting to extract 88 x 88 patches from 128 x 128 SAR image chips.
+    - The number of training images per one SAR image chip could be increased at maximum by (128 - 88 + 1) x (128 - 88 + 1) = 1681.
+
+- However, for SOC, this repository does not use random shifting tue to accuracy issue.
+    - You can see the details in `src/data/generate_dataset.py` and `src/data/mstar.py`
+    - This implementation failed to achieve higher than 98% accuracy when using random sampling.
+    - The implementation details for data augmentation is as: 
+        - Crop the center of 94 x 94 size image on 128 x 128 SAR image chip (49 patches per image chip).
+        - Extract 88 x 88 patches with stride 1 from 94 x 94 image.
+    
 
-![FP-BTR-70](assets/figure/007.png)
+## Experiments
 
-False Positive of D7
+You can download the MSTAR Dataset from [MSTAR Overview](https://www.sdms.afrl.af.mil/index.php?collection=mstar)
+
+### Standard Operating Condition (SOC)
+
+- MSTAR Target Chips (T72 BMP2 BTR70 SLICY) which is **MSTAR-PublicTargetChips-T72-BMP2-BTR70-SLICY.zip**
+- MSTAR / IU Mixed Targets which consists of **MSTAR-PublicMixedTargets-CD1.zip** and **MSTAR-PublicMixedTargets-CD2.zip**
+- **SLICY target is ignored**
+
+|         |            | Train      |            | Test       |            |
+| ------- | ---------- | ---------- | ---------- | ---------- | ---------- |
+| Class   | Serial No. | Depression | No. Images | Depression | No. Images |
+| BMP-2   | 9563       | 17         | 233        | 15         | 196        |
+| BTR-70  | c71        | 17         | 233        | 15         | 196        |
+| T-72    | 132        | 17         | 232        | 15         | 196        |
+| BTR-60  | k10yt7532  | 17         | 256        | 15         | 195        |
+| 2S1     | b01        | 17         | 299        | 15         | 274        |
+| BRDM-2  | E-71       | 17         | 298        | 15         | 274        |
+| D7      | 92v13015   | 17         | 299        | 15         | 274        |
+| T-62    | A51        | 17         | 299        | 15         | 273        |
+| ZIL-131 | E12        | 17         | 299        | 15         | 274        |
+| ZSU-234 | d08        | 17         | 299        | 15         | 274        |
+
+#### Training Set (Depression: 17$\degree$)
+
+```shell
+MSTAR-PublicTargetChips-T72-BMP2-BTR70-SLICY
+├ TRAIN/17_DEG
+│    ├ BMP2/SN_9563/*.000 (233 images)
+│    ├ BTR70/SN_C71/*.004 (233 images)
+│    └ T72/SN_132/*.015   (232 images)
+└ ...
+
+MSTAR-PublicMixedTargets-CD2/MSTAR_PUBLIC_MIXED_TARGETS_CD2
+├ 17_DEG
+│    ├ COL1/SCENE1/BTR_60/*.003  (256 images)
+│    └ COL2/SCENE1
+│        ├ 2S1/*.000            (299 images)
+│        ├ BRDM_2/*.001         (298 images)
+│        ├ D7/*.005             (299 images)
+│        ├ SLICY
+│        ├ T62/*.016            (299 images)
+│        ├ ZIL131/*.025         (299 images)
+│        └ ZSU_23_4/*.026       (299 images)
+└ ...
 
-![FP-D7](assets/figure/008.png)
+```
 
-False Positive of T-62
+#### Test Set (Depression: 15$\degree$)
+
+```shell
+MSTAR-PublicTargetChips-T72-BMP2-BTR70-SLICY
+├ TEST/15_DEG
+│    ├ BMP2/SN_9563/*.000 (195 images)
+│    ├ BTR70/SN_C71/*.004 (196 images)
+│    └ T72/SN_132/*.015   (196 images)
+└ ...
+
+MSTAR-PublicMixedTargets-CD1/MSTAR_PUBLIC_MIXED_TARGETS_CD1
+├ 15_DEG
+│    ├ COL1/SCENE1/BTR_60/*.003  (195 images)
+│    └ COL2/SCENE1
+│        ├ 2S1/*.000            (274 images)
+│        ├ BRDM_2/*.001         (274 images)
+│        ├ D7/*.005             (274 images)
+│        ├ SLICY
+│        ├ T62/*.016            (273 images)
+│        ├ ZIL131/*.025         (274 images)
+│        └ ZSU_23_4/*.026       (274 images)
+└ ...
 
-![FP-T-62](assets/figure/009.png)
+```
 
-False Positive of T-72
+#### Results of SOC
+- You can see the details in `notebook/experiments-SOC.ipynb`
 
-![FP-T-72](assets/figure/010.png)
+- Visualization of training loss and test accuracy
 
-False Positive of ZIL-131
+![soc-training-plot](./assets/figure/soc-training-plot.png)
 
-![FP-ZIL-131](assets/figure/011.png)
--->
+- Confusion Matrix with best model at **epoch 28**
 
-#### Confuser Rejection
-Training Target: BMP-2, BTR-70, T-72
+![soc-confusion-matrix](./assets/figure/soc-confusion-matrix.png)
 
-Confuser Target: 2S1, ZIL-131
 
-- Confuser Rejection Rules
-  - If all the posterior prob. are lower than threshold(\tau_th), the target image will be declared as a confser.
-  - Detection Ratio(P_d) = (# known targets detected) / (# known targets in the test set) 
-  - False Alarm Ratio(P_fa) =  (# confusers declared as known) / (# confusers in the test set)
-    
-![](assets/figure/confuser-rejection.png)
-    
-> TODO
+### Extended Operating Conditions (EOC)
 
-### Details about the Specific environment of this repository
+### Outlier Rejection
 
-|||
-|:---:|:---:|
-| OS | Windows 10| 
-| CPU | Intel i9 |
-| GPU | RTX 2080 8GB |
-| Memory | 16 GB |
-| SSD | 500GB |
-| HDD | 2TB |
+### End-to-End SAR-ATR Cases
 
-### Citation
+## Citation
 
-```
+```bibtex
 @ARTICLE{7460942,
   author={S. {Chen} and H. {Wang} and F. {Xu} and Y. {Jin}},
   journal={IEEE Transactions on Geoscience and Remote Sensing}, 
@@ -191,4 +163,31 @@ Confuser Target: 2S1, ZIL-131
   pages={4806-4817},
   doi={10.1109/TGRS.2016.2551720}
 }
-``` 
+```
+
+---
+
+## TODO
+
+- [ ] Implementation
+    - [ ] Data generation
+        - [X] SOC
+        - [ ] EOC
+        - [ ] Outlier Rejection
+        - [ ] End-to-End SAR-ATR
+    - [ ] Data Loader
+        - [X] SOC
+        - [ ] EOC
+        - [ ] Outlier Rejection
+        - [ ] End-to-End SAR-ATR
+    - [ ] Model
+        - [X] Network
+        - [X] Training
+        - [X] Early Stopping
+        - [X] Hyper-parameter Optimization
+    - [ ] Experiments
+        - [X] Reproduce the SOC Results
+        - [ ] Reproduce the EOC Results
+        - [ ] Reproduce the outlier rejection
+        - [ ] Reproduce the end-to-end SAR-ATR
+