CS6640 - Final Project

Folder structure

|-- root
    |-- Model_results_visualization.ipynb                             ------------> Comparative results summarization and visualization of all models
    |-- utils.py                                                      ------------> Contain common class and methods for data transformation/preparation and evaluation metrics calculation
    |-- Basic_CNN_Model.ipynb                                         ------------> Basic CNN Architecture for testing(Not included in the report)
    |-- Basic_EDA.ipynb                                               ------------> Basic Data summarization
    |-- TransferLearning_yolo.ipynb                                   ------------> Yolo architech for testing(Not included in the report)
    |-- TrasnferLearning_AlexNet.ipynb                                ------------> Pretrained AlexNet Model
    |-- TrasnferLearning_GoogleNet_Inception_V1.ipynb                 ------------> Pretrained GoogleNet(Inception v1) Model
    |-- TrasnferLearning_GoogleNet_Inception_v3.ipynb                 ------------> Pretrained GoogleNet(Inception v3) Model
    |-- TrasnferLearning_ResNet50.ipynb                               ------------> Pretrained ResNet50 Model
    |-- data                                                          ------------> Contain original data
    |   |-- test.csv                                                  ------------> Contains image name, ground thruth for Test data
    |   |-- train.csv                                                 ------------> Contains image name, ground thruth for Training data
    |   |-- images_train                                              ------------> Training ISW image set
    |   |   |-- 1.png
    |   |   |-- 100.png
    |   |   |-- ......
    |   |-- images_test                                               ------------> Test ISW image set
    |   |   |-- 1.png
    |   |   |-- 100.png
    |   |   |-- ......
    |-- model_outputs_data                                            ------------> Directory for model output data(For saving the best performed model after training, which can be later used if needed.)
    |   |-- best_alexnet_model.pth
    |   |   |-- ......
    |   |-- model_evaluation_logs                                     ------------> Directory for saving model training data for later evaluations
    |   |   |-- training_logs_alexnet.csv
    |   |   |-- .....
    |   |-- model_prediction_logs                                     ------------> Directory for saving model predictions for later analysis
    |   |   |-- alexnet_labels_predictions.csv
    |   |   |-- .....
    |   |-- yolo_output                                               ------------> Output directory for data preparation of YOLO model
    |-- pre_trained_models                                            ------------> Locally saved pre-trained models ( Not included for the experiment. Just for testing)
    |   |-- resnet50_weights_tf_dim_ordering_tf_kernels_notop.h5
    |   |-- yolov8x-cls.pt
    |-- runs                                                          ------------> Output directory for saving model training of  YOLO model (Automatically created)

Read this

Data transformation/processing for each model is done in utils.py to maintain the consistency
Each model has a separate notebook with annotations and comments. This is for clarity and to tweak the model architecture if needed.
In model training loop, the best version of the model is saved based on validation loss if I need to train it more later(It wasn't really important and did not push to the repo).
Logging:
- Training Logs(Epoch, Train_Loss, Validation_Loss, Validation_Accuracy, FLOPs) during training were recorded in model_outputs_data/model_evaluation_logs
- Model evaluation logs(true_labels, predicted_labels, positive_probabilities) on test data were logged in model_outputs_data/model_prediction_logs
- These logs are used in Model_results_visualization for comparison, results discussion and visualization

Steps to execute

Install necessary libraries before executing the notebook files
- pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu124 (I used CUDA for GPU performance utilization, but the code works without it, just taking a longer time)
- pip install tensorflow[and-cuda] (tf used for some testing stuff, not for main 6 models)
- pip install -U scikit-learn
- pip install pandas
- pip install matplotlib
- pip install ptflops
- That's it! You're good to go. If you don't want to execute models you can check out training and testing logs of model outputs.

Ignore these :

Ignore these files(some additional work I did, just out of curiosity. But NOT relevant to the project/report. I didn't remove these just in case if I wanted to work on this in future) :
TransferLearning_yolo.ipynb, TrasnferLearning_VGG16.ipynb, TrasnferLearning_SqueezeNet.ipynb
- YOLO was also included in the code as a test and it performed well(F1 = 0.94), but it was not included in the report because it's often considered as a segmentation model rather than classification.
- Another two models (VGG16, SqueezeNet) was tested and did not include them because; Training time was very long(VGG16 specifically) making it difficult to test the model and do necessary adjustment.
- It's difficult to conclude their performance without further investigations, therefore they were simply ignored.

Source of the Images & Access Conditions

Dataset used: https://www.kaggle.com/competitions/internal-waves/data
Dataset sourced from: https://xwaves.ifremer.fr/#/quicklook
Use of the images is regulated by: http://en.data.ifremer.fr/All-about-data/Data-access-conditions

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CS6640 - Final Project

Folder structure

Read this

Steps to execute

Ignore these :

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
data		data
model_outputs_data		model_outputs_data
.gitattributes		.gitattributes
.gitignore		.gitignore
Basic_CNN_Model.ipynb		Basic_CNN_Model.ipynb
Basic_EDA.ipynb		Basic_EDA.ipynb
Model_results_visualization.ipynb		Model_results_visualization.ipynb
README.md		README.md
TransferLearning_yolo.ipynb		TransferLearning_yolo.ipynb
TrasnferLearning_AlexNet.ipynb		TrasnferLearning_AlexNet.ipynb
TrasnferLearning_DenseNet121.ipynb		TrasnferLearning_DenseNet121.ipynb
TrasnferLearning_EfficientNet.ipynb		TrasnferLearning_EfficientNet.ipynb
TrasnferLearning_GoogleNet_Inception_V1.ipynb		TrasnferLearning_GoogleNet_Inception_V1.ipynb
TrasnferLearning_GoogleNet_Inception_v3.ipynb		TrasnferLearning_GoogleNet_Inception_v3.ipynb
TrasnferLearning_ResNet50.ipynb		TrasnferLearning_ResNet50.ipynb
TrasnferLearning_SqueezeNet.ipynb		TrasnferLearning_SqueezeNet.ipynb
TrasnferLearning_VGG16.ipynb		TrasnferLearning_VGG16.ipynb
utils.py		utils.py

ishara084/CS6640_Project

Folders and files

Latest commit

History

Repository files navigation

CS6640 - Final Project

Folder structure

Read this

Steps to execute

Ignore these :

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages