-
Notifications
You must be signed in to change notification settings - Fork 4.6k
bf16+pipeline parallelism #1801
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from 27 commits
Commits
Show all changes
52 commits
Select commit
Hold shift + click to select a range
fb0dc00
bf16 updates
jeffra 6eb4f1f
Got bf16 working
tjruwase a3d3576
fp32 reduction; flattened tensors
tjruwase 6f5ebc3
bf16+zero_stage_1 first cut
tjruwase 819abe2
finish zero_stage 1 sharding
tjruwase e48035b
Matching fp16 with debugging codes
tjruwase 8245053
Matching loss with fp16
tjruwase 1529313
Fix gradient clipping
tjruwase 27e5b95
bf16 gradient clipping fix
tjruwase f497702
Unscale grad norm
tjruwase 0ad7c7d
Fix grad norm scaling
tjruwase b81d862
Enable loading fp16_zero_1 into bf16_zero_1 engine and vice versa
tjruwase 35ea380
Fix clip_grad key error
tjruwase 37011a9
Reduce tied weight gradients
tjruwase 8fbd4bf
Rebase with master
tjruwase 61d51fd
Fix grad norm for moe
tjruwase 3ee61cd
Merge branch 'master' into olruwase/bf16-updates
jeffra 46cc2ce
Merge branch 'master' into olruwase/bf16-updates
jeffra de3616c
Reduce specified gradients
tjruwase 89e054d
Merge branch 'olruwase/reduce_specified_gradients' of github.com:micr…
tjruwase ab61edb
Use O(n) instead of O(n^2)
tjruwase b7d64fd
Remove optimizer restriction for bf16
tjruwase 1919868
Link bf16 & fp32 params
tjruwase 77b649d
Clip gradients of last stage tied weights
tjruwase 4a505ec
Merge branch 'master' into olruwase/bf16-updates
tjruwase ff99cb2
Merge branch 'master' into olruwase/bf16-updates
tjruwase 20fdba3
Merge branch 'master' into olruwase/bf16-updates
jeffra 86fa437
Merge branch 'master' into olruwase/bf16-updates
tjruwase 71499a8
Merge branch 'master' into olruwase/bf16-updates
jeffra 7e7fa60
Merge branch 'master' into olruwase/bf16-updates
jeffra 2aa612a
Simplify tied weights reduction logic
tjruwase 2cd21f1
Merge branch 'master' into olruwase/bf16-updates
tjruwase a4cbf0c
Merge branch 'olruwase/bf16-updates' of github.com:microsoft/DeepSpee…
tjruwase 67ea260
Merge branch 'master' into olruwase/bf16-updates
tjruwase 6a4d6e6
Merge branch 'master' into olruwase/bf16-updates
tjruwase 4e1dcfd
Merge branch 'master' into olruwase/bf16-updates
tjruwase e24814a
Also clip all tp rank parameters
tjruwase 88cdf61
Merge branch 'olruwase/bf16-updates' of github.com:microsoft/DeepSpee…
tjruwase 20697bc
lp to hp mapping
tjruwase 4e8f7ff
Link lp/hp/optim state; Refresh links after checkpoint load
tjruwase 52a2f10
Merge branch 'master' into olruwase/bf16-updates
tjruwase 3ed5703
Merge branch 'olruwase/bf16-updates' of github.com:microsoft/DeepSpee…
tjruwase 5481b86
Remove debug print
tjruwase d911e67
Remove debug print
tjruwase 144f652
Simplify zero_grad logic
tjruwase bb70816
fp32 accessors
tjruwase 89b4b3f
Merge branch 'master' into olruwase/bf16-updates
tjruwase a9bfaee
Merge branch 'master' into olruwase/bf16-updates
tjruwase fa4ff11
Fix update bug
tjruwase cfd5638
Merge branch 'olruwase/bf16-updates' of github.com:microsoft/DeepSpee…
tjruwase 5ea1c60
Merge branch 'master' into olruwase/bf16-updates
tjruwase 0e2a1c5
Merge branch 'master' into olruwase/bf16-updates
tjruwase File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -4,6 +4,7 @@ | |
| *.swp | ||
| *.log | ||
| deepspeed/git_version_info_installed.py | ||
| __pycache__ | ||
|
|
||
| # Build + installation data | ||
| build/ | ||
|
|
||
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.