Atari100k Pong settings

I am running the command `python dreamerv3/main.py --script train_eval --configs atari100k --run.eval_eps 100 --task atari100k_pong`(#173) for --seed from 0 to 4, on RTX 3090 GPUs. However, for every seed, I consistently get a final score of -21 (the minimum score), and it seems that the agent does not move in sample trajectories. You can see my full reproduced results [here](https://docs.google.com/spreadsheets/d/1AuCd1b-numwhQ8bZ0kNoC9UKVJ-dDnO4Wg93TENvVmc/edit?usp=sharing). 

To debug, I tried running these different configurations:
* I tried running the old version of DreamerV3 ([2023 version](https://github.com/danijar/dreamerv3/tree/8fa35f83eee1ce7e10f3dee0b766587d0a713a60)), which successfully reproduces an average score of 18, as reported in the old version of the paper.
* I tried running with a train_ratio of 128 as described in the paper, instead of the default of 256 as set by configs.yaml for atari100k. However, this still results in scores of -21.
* I tried running the small model of new DreamerV3 (12M parameters), but this also still results in -21.
* I tried reproducing results for other Atari 100K games. Some of my [results](https://docs.google.com/spreadsheets/d/1AuCd1b-numwhQ8bZ0kNoC9UKVJ-dDnO4Wg93TENvVmc/edit?usp=sharing) match the reported results (Alien, Amidar, Assault, Boxing) while some do not (Asterix, Battle Zone, Up N Down).

This is similar to #138, but I wasn't sure if they were discussing the old version of DreamerV3 (2023) or the new version, as the train_ratio of 1024 was recommended (which was used in the 2023 paper but not the 2024 paper).

Could you please let me know anything I missed in the configuration or setup for running the new version of DreamerV3 for Atari100K? Or are there any recommended debugging steps?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Atari100k Pong settings #175

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Atari100k Pong settings #175

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions