You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We provide a trained checkpoint for the baseline model at [this link](https://drive.google.com/file/d/1VzYy1a_PbOUnqZZLNW58jpWTFL8PRc3B/view?usp=sharing)
37
37
38
38
## Testing
39
-
39
+
To evaluate your own splitted validation data, run
Specify the path to test data via --test_data_dir, e.g. to predict the result for submission, set --test_data_dir tag_op/data/A. The predicted result will be stored in `tag_op/model_L2I/answer_dev.json`.
The predicted result will be stored in `tag_op/model_L2I/answer_dev.json`.
45
48
46
49
## Result Evaluation
47
50
48
-
Run `evaluate.py` by specifying the data file with gold answers and the predicted results. This will return the Exact Match and F1 score.
51
+
Run `evaluate.py`for your own validation data by specifying the data file with gold answers and the predicted results. This will return the Exact Match and F1 score.
0 commit comments