-
Notifications
You must be signed in to change notification settings - Fork 223
Enables batch DDPG agents to be trained. #416
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
This PR changes two Q-functions: shows that the DDPG example is the only place these are used. Furthermore, looking at the chainerrl/chainerrl/links/mlp.py Line 16 in 15d7cbb
and chainerrl/chainerrl/links/mlp_bn.py Line 31 in 15d7cbb
are not recurrent. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
👍
No description provided.