Skip to content

Releases: ngxson/llama.cpp

b5177

24 Apr 11:00
80982e8

Choose a tag to compare

arg : clean up handling --mmproj with -hf (#13082)

* arg : clean up handling --mmproj with -hf

* rm change about no_mmproj

* Revert "rm change about no_mmproj"

This reverts commit 2cac8e0efb629d66c612f137e75d562f94bb9e6c.

* handle no_mmproj explicitly

* skip download mmproj on examples not using it

b5176

24 Apr 08:27
7604a7d

Choose a tag to compare

metal : fix floating-point range of attention scores in FA kernels (#…

b5175

24 Apr 08:16
b3b6d86

Choose a tag to compare

vulkan: matmul gcn tuning (#13016)

* tune matmul for gcn

* this one is more power efficient

* Update ggml/src/ggml-vulkan/ggml-vulkan.cpp

Co-authored-by: 0cc4m <[email protected]>

* disable this tune for the proprietary driver

---------

Co-authored-by: 0cc4m <[email protected]>

b5174

23 Apr 22:32
5630406

Choose a tag to compare

llama-mtmd-cli: Sigint rework in mtmd vision example (#13080)

* Sigint rework in mtmd vision example

* Applied suggestions on mtmd-cli PR

* Forgot to invert one of the conditions

* Update examples/llava/mtmd-cli.cpp

* Removed redundant exit check

---------

Co-authored-by: pl752 <[email protected]>
Co-authored-by: Xuan-Son Nguyen <[email protected]>

b5170

22 Apr 20:19
658987c

Choose a tag to compare

CUDA: noncont MMVQ + batched bs1 MUL_MAT_ID (#13014)

* CUDA: noncont MMVQ + batched bs1 MUL_MAT_ID

* fix logic for RoPE support, CUDA graphs

b5169

22 Apr 15:22
dc39a5e

Choose a tag to compare

mtmd : support SmolVLM (version 1 and 2) (#13050)

* mtmd : support SmolVLM (version 1 and 2)

* correct chat template

* fix n_patches

* scale_factor is an int

* add more models to test

b5168

22 Apr 14:02
ab47dec

Choose a tag to compare

security : add note about RPC and server functionality (#13061)

* security : add note about RPC functionality

* security : add note about llama-server

b5166

22 Apr 09:27
2434535

Choose a tag to compare

llava : update documentations (#13055)

* llava : update documentations

* fix typo

b5165

21 Apr 17:08
1d735c0

Choose a tag to compare

ggml : add SSE 4.2 and x64 base variant for CPUs without AVX (#12871)

* ggml : add SSE 4.2 variant for CPUs without AVX

* ggml : add x64 base ABI variant

b5164

21 Apr 14:41
5368ddd

Choose a tag to compare

SYCL: Add non-contiguous support in ROPE (#12993)

ggml-ci