You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Summary:
Note that we can do the following right now:
* initialize and quantize the model with int4_weight_only quant in cpu
* move the model to cuda
we'll enable this in a separate PR
Test Plan:
CI
Reviewers:
Subscribers:
Tasks:
Tags:
0 commit comments