Releases · ggml-org/llama.cpp

07 May 09:16

6c7fd67

b5299

llama : support tie embedding for chatglm models (#13328)

Assets 21

06 May 23:10

github-actions

b5298

141a908

b5298

CUDA: mix virt/real CUDA archs for GGML_NATIVE=OFF (#13135)

Assets 21

06 May 22:26

github-actions

b5297

32916a4

b5297

clip : refactor graph builder (#13321)

* mtmd : refactor graph builder

* fix qwen2vl

* clean up siglip cgraph

* pixtral migrated

* move minicpmv to a dedicated build function

* move max_feature_layer to build_llava

* use build_attn for minicpm resampler

* fix windows build

* add comment for batch_size

* also support tinygemma3 test model

* qwen2vl does not use RMS norm

* fix qwen2vl norm (2)

Assets 21

06 May 22:16

github-actions

b5296

ffc7272

b5296

sampling : make top_n_sigma no-op at <=0 or a single candidate (#13345)

Assets 21

06 May 19:17

github-actions

b5295

91a86a6

b5295

sampling : don't consider -infinity values in top_n_sigma (#13344)

Assets 21

06 May 16:31

github-actions

b5293

1e333d5

b5293

SYCL: Disable reorder optimize by default and stop setting tensor ext…

Assets 21

06 May 14:29

github-actions

b5292

2f54e34

b5292

llama : fix build_ffn without gate (#13336)

* llama : fix build_ffn without gate

* fix build on windows

* Revert "fix build on windows"

This reverts commit fc420d3c7eef3481d3d2f313fef2757cb33a7c56.

Assets 21

06 May 07:28

github-actions

b5289

15a28ec

b5289

CUDA: fix --split-mode row for MMQ (#13323)

Assets 21

05 May 21:28

github-actions

b5287

9070365

b5287

CUDA: fix logic for clearing padding with -ngl 0 (#13320)

Assets 21

05 May 20:56

github-actions

b5286

233461f

b5286

sampling : Integrate Top-nσ into main sampling chain (and add it to t…

Assets 21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: ggml-org/llama.cpp

b5299

Uh oh!

b5298

Uh oh!

b5297

Uh oh!

b5296

Uh oh!

b5295

Uh oh!

b5293

Uh oh!

b5292

Uh oh!

b5289

Uh oh!

b5287

Uh oh!

b5286

Uh oh!