Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about reproducing the result #26

Open
chihchiehchen opened this issue Apr 12, 2017 · 3 comments
Open

Question about reproducing the result #26

chihchiehchen opened this issue Apr 12, 2017 · 3 comments

Comments

@chihchiehchen
Copy link

Hello,

I tried to reproduce the result (with n_action_repeat 1) on the computer with GTX 1080, however the performance is not as good as shown in the figure. After 2.88 M steps the average reward is 0.0174,
the average ep_reward is 3.1071, and the max ep_reward is 7.

Maybe I did something wrong in the setting or misread some information. Could you give me some suggestions? Thanks a lot!

Chih-Chieh

@hiwonjoon
Copy link

Not answering your questions, but, What kinds of an environment are you testing? Breakout-v0?
How long does it take for about 3M steps in your setting?

@chihchiehchen
Copy link
Author

chihchiehchen commented Jun 27, 2017 via email

@FushanLi
Copy link

I am running the model, and get very similar performance as you.
My guess is the results shown in the figure are run by DQN published on nature. And I am using the model published in nips.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants