Skip to content

Conversation

Niccolo-Ajroldi
Copy link
Member

The new dropout design in AlgoPerf (see #753 (comment) and #873) requires passing dropout_rate to the workload’s forward pass function rather than to init_model_fn. Some minor issues from this change were left unresolved, and this PR fixes them.

  • Reference submissions: we pipe dropout_rate through the model fwd pass in both JAX and PyTorch submissions.
  • PyTorch dev workloads: we adapt MNIST and CIFAR workloads to new dropout design.
  • modeldiff: we remove dropout_rate and aux_dropout_rate from tests/modeldiffs/diff.py and adapt all tests by pipping dropout_rate=0.0 through the test call (one-line change).
  • spec.py: updated spec to reflect new design.
  • DOCS: updated docs to reflect new design.

@Niccolo-Ajroldi Niccolo-Ajroldi requested a review from a team as a code owner August 25, 2025 13:44
Copy link

github-actions bot commented Aug 25, 2025

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

Copy link
Contributor

@priyakasimbeg priyakasimbeg left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! Thank you for tying up the loose ends Niccolo!

@priyakasimbeg priyakasimbeg merged commit 0643268 into mlcommons:dev Aug 26, 2025
15 checks passed
@github-actions github-actions bot locked and limited conversation to collaborators Aug 26, 2025
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants