Training run with certain non-default values causes evaluation run to fail.

I'm still trying to track down exactly what's causing this. But in the meantime, here's some data.

Training command:

```
python -m pdb train.py \
    --embed_dim=768 \
    --layers=4 \
    --heads=12 \
    --training_steps=12 \
    --log_eval_freq=4 \
    --warmup_steps=1 \
    --batch_size=4 \
    --sequence_length=512 \
    --eval_episodes=1 \
    --activation_fn=gelu \
    --save_model \
    --save_mode=checkpoint \
    --text_prop=1.0 \
    --eval_text_log_examples \
    --text_datasets=wikitext-2-v1 \
    --text_datasets_paths=wikitext \
    --pretrained_lm=gpt2 \
    --disable_cosine_decay
```

Evaluation command:

```
python -m pdb eval.py \
    --model_path=./models/neko-gato-<your-id-here>/checkpoint_12.pt \
    --eval_episodes=1
```

![image](https://github.com/ManifoldRG/NEKO/assets/1719584/827ab5dc-d3c7-413d-82d0-e4c02108f242)

![image](https://github.com/ManifoldRG/NEKO/assets/1719584/70a94844-4a35-4c73-b8e3-17aa18f1d740)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training run with certain non-default values causes evaluation run to fail. #66

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Training run with certain non-default values causes evaluation run to fail. #66

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions