Use LR schedule for beta and epsilon #3940

andrewcoh · 2020-05-08T18:36:39Z

Proposed change(s)

This PR extends the schedule used to decay the learning rate to beta and epsilon. Currently, beta and epsilon are decayed automatically and this is not configurable. This impacts training scenarios like self-play which require some degree of exploration all through out an experiment as the opponent changes.

Additionally, this PR adds beta and epsilon to tensorboard to track these values.

Useful links (Github issues, JIRA tickets, ML-Agents forum threads etc.)

Types of change(s)

Checklist

Added tests that prove my fix is effective or that my feature works
Updated the changelog (if applicable)
Updated the documentation (if applicable)
Updated the migration guide (if applicable)

Other comments

andrewcoh · 2020-05-08T18:45:47Z

ml-agents/mlagents/trainers/models.py

-            learning_rate = tf.train.polynomial_decay(
-                lr, global_step, max_step, 1e-10, power=1.0
+        if schedule == ScheduleType.CONSTANT:
+            parameter_rate = tf.Variable(parameter, trainable=False)


@ervteng added trainable=False

ervteng

🚢 🇮🇹

Add constant decay to beta and epsilon

48be62d

andrewcoh requested a review from ervteng May 8, 2020 18:36

andrewcoh added 2 commits May 8, 2020 11:37

update change log

5db8528

removed stop gradient/add trainable=false

731d1ea

andrewcoh commented May 8, 2020

View reviewed changes

andrewcoh added 2 commits May 8, 2020 12:20

increase gail visual test steps

db7bbd2

increase bc steps gail visual ppo

bfa7b65

ervteng approved these changes May 8, 2020

View reviewed changes

andrewcoh merged commit 34dd929 into master May 8, 2020

delete-merged-branch bot deleted the develop-constant-decay branch May 8, 2020 20:48

github-actions bot locked as resolved and limited conversation to collaborators May 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use LR schedule for beta and epsilon #3940

Use LR schedule for beta and epsilon #3940

Uh oh!

andrewcoh commented May 8, 2020 •

edited

Loading

Uh oh!

andrewcoh May 8, 2020

Uh oh!

ervteng left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Use LR schedule for beta and epsilon #3940

Use LR schedule for beta and epsilon #3940

Uh oh!

Conversation

andrewcoh commented May 8, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Proposed change(s)

Useful links (Github issues, JIRA tickets, ML-Agents forum threads etc.)

Types of change(s)

Checklist

Other comments

Uh oh!

andrewcoh May 8, 2020

Choose a reason for hiding this comment

Uh oh!

ervteng left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

andrewcoh commented May 8, 2020 •

edited

Loading