-
Notifications
You must be signed in to change notification settings - Fork 21
Open
Description
It seems that you use activity index as the obervation code, I think there are other imformation we need to feed to the agent. That will get a better result.
The amount of information is too little, simple Q learning feels like enough.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels