You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am trying to collect gradients for LLaMA2-13B with the released code, but out-of-memory issue occurs.
Currently, I am using a single NVIDIA H100 GPU.
How many GPUs do we need for models having more than 13B parameters?