-
Notifications
You must be signed in to change notification settings - Fork 13.3k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add AfmoeForCausalLM support
python
python script changes
#16477
opened Oct 8, 2025 by
bartowski1182
•
Draft
CUDA Copy Kernel for Contiguous Tensors for GGML CPY OP
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#16471
opened Oct 8, 2025 by
anavp-nvidia
Loading…
fix: convert_hf_to_gguf - change Jamba non-sentencepiece mode (tokeni…
python
python script changes
#16470
opened Oct 8, 2025 by
amirai21
Loading…
vulkan: Add State Space Model (SSM) Operations Support
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16463
opened Oct 7, 2025 by
giuseppe
Loading…
Add hipblasLt implementation for batched gemm to improve performance for CDNA3 only
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#16457
opened Oct 7, 2025 by
peizhang56
Loading…
vulkan: Handle FA with all -inf mask values
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16447
opened Oct 6, 2025 by
jeffbolznv
Loading…
Metal Pool 1D Kernel
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#16429
opened Oct 5, 2025 by
ThoreKoritzius
Loading…
fix: add generic fallback to detect trailing <think> tags in Jinja templates and handle forced-open reasoning blocks
testing
Everything test related
#16426
opened Oct 4, 2025 by
ServeurpersoCom
•
Draft
server / ranking : add sorting and management of top_n
examples
server
#16403
opened Oct 3, 2025 by
YannFollet
Loading…
model-conversion : add support for SentenceTransformers
examples
python
python script changes
#16387
opened Oct 2, 2025 by
danbev
Loading…
Add ARANGE Operator to SYCL Backend (Small & Focused Changes)
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#16362
opened Sep 30, 2025 by
GittyBurstein
Loading…
feat: render user content as markdown option
examples
server
#16358
opened Sep 30, 2025 by
ServeurpersoCom
Loading…
SYCL SET operator optimized for F32 tensors
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#16350
opened Sep 30, 2025 by
GittyBurstein
Loading…
Update build.md
documentation
Improvements or additions to documentation
#16346
opened Sep 30, 2025 by
refine360-debug
Loading…
ggml-cpu : inspect -march and -mcpu to found the CPU
ggml
changes relating to the ggml tensor library for machine learning
#16333
opened Sep 29, 2025 by
angt
Loading…
Add a deepwiki badge to auto-refresh the wiki-in-deepwiki weekly.
#16296
opened Sep 28, 2025 by
0400H
Loading…
hip : substituted bpermute ops with swizzle ops (gfx906, maybe all AMD)
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#16291
opened Sep 27, 2025 by
iacopPBK
Loading…
Update convert_hf_to_gguf_update.py
python
python script changes
#16280
opened Sep 26, 2025 by
cpumaxx
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-09-09.