Skip to content

b6759

Choose a tag to compare

@github-actions github-actions released this 14 Oct 11:35
1ee9d0b
CUDA: use fastdiv + ggml_cuda_mad for mmvf (#16557)

* CUDA: use fastdiv + ggml_cuda_mad for mmvf

* use bf16 directly + fix formatting

* Add exception for HIP code