You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
ENV:
cuda: 12.1
torch: 2.5.0+cu121
python benchmark_fp6.py
Hello, have you tested the performance of the FP6 kernel on the A100? I found that the speed is much slower compared to FP16."