How to Run the Project

The project entry point is main_cer.py. If you run this file using python main_cer.py it should train the model and and run the validation and test sets on the trained model. Right now it only caculates accuracy.

To successfully run the code with neptune, you need to set the NEPTUNE_API_TOKEN as an environment variable. Here you can see how to do that.

If you have access to the unity servers and want to run this project as a batch there, the script is given in sbatch.run.sh. You have to create your own conda environment though using the requirements.txt to source the conda environment inside the sbatch.run.sh.

Dataset

The dataset used in the project comes from CSEDM. The dataset is preprocessed and put in data/dataset.pkl. The columns in the dataset are:

problemID: The id of the problem in CSEDM dataset.
problemDescription: The textual descrtiption of the problem.
studentID_1: The student ID in CSEDM dataset of the first student.
test_case_verdict_i_1 The test case verdicts of the code submission indexed using i_1. This is a string containing multiple 0-3. 0 means correct, 1 means wrong answer, 2 means run-time error, 3 means time-limit exceeded in test_case_verdict_x_y where x can be ['i','j'] and y can be [1,2].
codeID_i_1: codeID of i_1 in CSEDM dataset. This is the chronologically earlier submission made by studentID_1 in problemID.
code_i_1: The actual code in texts of codeID_i_1.
score_i_1: The assigned score in CSEDM dataset for codeID_i_1.
score_calc_i_1: Calculated score by our score against the test cases for codeID_i_1.
test_case_verdict_j_1: The test case verdicts of the later code submission indexed using j_1 by the student studentID_1.
codeID_j_1: Same as earlier.
code_j_1: Same as earlier.
score_j_1: Same as earlier.
score_calc_j_1: Same as earlier.
studentID_2: Same as student 1.
test_case_verdict_i_2: Same as student 1.
codeID_i_2: Same as student 1.
code_i_2: Same as student 1.
score_i_2: Same as student 1.
score_calc_i_2: Same as student 1.
test_case_verdict_j_2: Same as student 1.
codeID_j_2: Same as student 1.
code_j_2: Same as student 1.
score_j_2: Same as student 1.
score_calc_j_2: Same as student 1.
is_similar: Binary label (True/False) denoting whether the change between (code_i_1, code_j_1) and (code_i_2, code_j_2) is similar or not.

High-level TO DO

Prepare the data to include boundary cases for negative examples
Prepare the model to handle batched data so that we can share weights among the encoders instead of copying them to for 4 inputs.
Do the gradient accumulation.
Explore diffrent loss functions.

Code Structure

`preprocess_data.ipynb`

The puropose of this notebook is to preprocess the CSEDM dataset and prepare it for the CER project. We have the test cases for 17 of the problems. We also have the execution verdict of the runs for this test cases in a separate file. We put these two together and created data/dataset.pkl.

Requirements

Each dataset row contains a two pair of codes.
The student id for both the codes should be the same.
The code id must be different.
We also need the scores for both codes
There should also be the test case masks for both codes these are
The testCaseMask of the two codes must be different
The dataset should contain only the codes with test cases (17 problems so far)
The pair of codes in each row can be either consecutive chronologically or not. Based on this we can create different datasets.

`main_cer.py`

Main entry point for the project. This file prepares the training and testing data, and runs the training and testing loop.

`configs_cer.yaml`

Holds all the config values for various model, data, training and testing options.

`data_loader.py`

Data processing functions.

`model.py`

Defines all the models.

`trainer.py`

Defines all the training related helper functions

`eval.py`

Defines all the functions do the evaluation of the test outputs.

`utils.py`

Defines all the helper utility functions.

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
baselines		baselines
bleu_scores		bleu_scores
data		data
.gitignore		.gitignore
Hist-optimal.png		Hist-optimal.png
Hist-random.png		Hist-random.png
Hist_compare_test.png		Hist_compare_test.png
Hist_compare_train0.png		Hist_compare_train0.png
Hist_compare_train2.png		Hist_compare_train2.png
Hist_compare_train8.png		Hist_compare_train8.png
Hist_compare_valid.png		Hist_compare_valid.png
README.md		README.md
analysis.py		analysis.py
baseline_codet5.txt		baseline_codet5.txt
bleu.sbatch.sh		bleu.sbatch.sh
cer_out_if_else.txt		cer_out_if_else.txt
cer_out_if_else_db.txt		cer_out_if_else_db.txt
cer_out_if_else_db_diffProblem.txt		cer_out_if_else_db_diffProblem.txt
cerd_out_all_student.txt		cerd_out_all_student.txt
cerd_out_all_student_codet5_.5_2_.5.txt		cerd_out_all_student_codet5_.5_2_.5.txt
cerd_out_if_else.txt		cerd_out_if_else.txt
cerd_out_if_else_exclusive.txt		cerd_out_if_else_exclusive.txt
cluster_summaries.txt		cluster_summaries.txt
cluster_summaries_2.txt		cluster_summaries_2.txt
cluster_summaries_cerd.5_4o.txt		cluster_summaries_cerd.5_4o.txt
cluster_summaries_cerd.5_4omini.txt		cluster_summaries_cerd.5_4omini.txt
cluster_summaries_cerdd.5_4o.txt		cluster_summaries_cerdd.5_4o.txt
cluster_summaries_cerdd.5_4omini.txt		cluster_summaries_cerdd.5_4omini.txt
cluster_summary.png		cluster_summary.png
cluster_summary_test_cerd_direct.png		cluster_summary_test_cerd_direct.png
cluster_summary_test_cerd_edit.png		cluster_summary_test_cerd_edit.png
cluster_summary_test_cerd_no_contrastive.png		cluster_summary_test_cerd_no_contrastive.png
cluster_summary_test_cerd_no_reconstruction.png		cluster_summary_test_cerd_no_reconstruction.png
cluster_summary_test_cerd_no_regularization.png		cluster_summary_test_cerd_no_regularization.png
cluster_summary_train_cerd_direct.png		cluster_summary_train_cerd_direct.png
configs_cer.yaml		configs_cer.yaml
data_loader.py		data_loader.py
datatypes.py		datatypes.py
eval.py		eval.py
exp.sbatch.run.sh		exp.sbatch.run.sh
gpt4o_cluster_summaries.csv		gpt4o_cluster_summaries.csv
l40.sbatch.run.sh		l40.sbatch.run.sh
main_all_exp_edit_cross_bleu.py		main_all_exp_edit_cross_bleu.py
main_all_exp_history_bleu.py		main_all_exp_history_bleu.py
main_cer.py		main_cer.py
main_exp_clustering_summary.py		main_exp_clustering_summary.py
main_exp_gen_code_decoder.py		main_exp_gen_code_decoder.py
main_exp_multi_step_cerd_gen_code_decoder.py		main_exp_multi_step_cerd_gen_code_decoder.py
main_experiment.py		main_experiment.py
main_finetune_decoder.py		main_finetune_decoder.py
main_test_gen copy.py		main_test_gen copy.py
main_test_gen.py		main_test_gen.py
main_test_gen2.py		main_test_gen2.py
mask_distance_comparison_boxplot.png		mask_distance_comparison_boxplot.png
mask_distance_comparison_hist.png		mask_distance_comparison_hist.png
mask_distance_comparison_hist_random.png		mask_distance_comparison_hist_random.png
model.py		model.py
multi-step-bleu.out		multi-step-bleu.out
multiBleu.png		multiBleu.png
multiBleu.sbatch.sh		multiBleu.sbatch.sh
multiBleuPlot.py		multiBleuPlot.py
multiBleul40.sbatch.run.sh		multiBleul40.sbatch.run.sh
mycermodel.py		mycermodel.py
mycermodel2.py		mycermodel2.py
mycodebleu.py		mycodebleu.py
out1.txt		out1.txt
out2.txt		out2.txt
out3.txt		out3.txt
out_all_closest.txt		out_all_closest.txt
out_all_clusters.txt		out_all_clusters.txt
out_all_exp_edit_cross.txt		out_all_exp_edit_cross.txt
out_all_exp_history_personal_all_prob.txt		out_all_exp_history_personal_all_prob.txt
out_all_exp_history_personal_array.txt		out_all_exp_history_personal_array.txt
out_all_exp_history_personal_ifelse.txt		out_all_exp_history_personal_ifelse.txt
out_all_exp_history_personal_string.txt		out_all_exp_history_personal_string.txt
out_array_closest.txt		out_array_closest.txt
out_history_model.txt		out_history_model.txt
out_history_model_large.txt		out_history_model_large.txt
out_history_model_personal_baseline_cerd.txt		out_history_model_personal_baseline_cerd.txt
out_history_model_personal_large_only_reconstruction.txt		out_history_model_personal_large_only_reconstruction.txt
out_history_model_personal_no_contrastive.txt		out_history_model_personal_no_contrastive.txt
out_history_model_personal_no_reconstruction.txt		out_history_model_personal_no_reconstruction.txt
out_history_model_personal_no_regularizer.txt		out_history_model_personal_no_regularizer.txt
out_history_model_personal_only_reconstruction.txt		out_history_model_personal_only_reconstruction.txt
out_history_model_personal_reconstruction_transform.txt		out_history_model_personal_reconstruction_transform.txt
out_ifelse_closest.txt		out_ifelse_closest.txt
out_run_java_gpt.txt		out_run_java_gpt.txt
out_run_java_model.txt		out_run_java_model.txt
out_string.txt		out_string.txt
out_string_closest.txt		out_string_closest.txt
preprocess_data.ipynb		preprocess_data.ipynb
requirements.txt		requirements.txt
sbatch.run.sh		sbatch.run.sh
sbatchl40.sbatch.run.sh		sbatchl40.sbatch.run.sh
test_cluster.png		test_cluster.png
test_model.ipynb		test_model.ipynb
testmodel.py		testmodel.py
testmodel3.py		testmodel3.py
train_cluster 80.png		train_cluster 80.png
train_cluster 90.png		train_cluster 90.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

How to Run the Project

Dataset

High-level TO DO

Code Structure

`preprocess_data.ipynb`

Requirements

`main_cer.py`

`configs_cer.yaml`

`data_loader.py`

`model.py`

`trainer.py`

`eval.py`

`utils.py`

About

Uh oh!

Releases

Packages

Languages

umass-ml4ed/code-edit-representation

Folders and files

Latest commit

History

Repository files navigation

How to Run the Project

Dataset

High-level TO DO

Code Structure

preprocess_data.ipynb

Requirements

main_cer.py

configs_cer.yaml

data_loader.py

model.py

trainer.py

eval.py

utils.py

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

`preprocess_data.ipynb`

`main_cer.py`

`configs_cer.yaml`

`data_loader.py`

`model.py`

`trainer.py`

`eval.py`

`utils.py`

Packages