Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add AfmoeForCausalLM support python python script changes
#16477 opened Oct 8, 2025 by bartowski1182 Draft
CUDA Copy Kernel for Contiguous Tensors for GGML CPY OP ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#16471 opened Oct 8, 2025 by anavp-nvidia Loading…
fix: convert_hf_to_gguf - change Jamba non-sentencepiece mode (tokeni… python python script changes
#16470 opened Oct 8, 2025 by amirai21 Loading…
opencl: add q8_0 mm support ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#16469 opened Oct 8, 2025 by lhez Draft
vulkan: Add State Space Model (SSM) Operations Support ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#16463 opened Oct 7, 2025 by giuseppe Loading…
Add hipblasLt implementation for batched gemm to improve performance for CDNA3 only ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#16457 opened Oct 7, 2025 by peizhang56 Loading…
vulkan: Handle FA with all -inf mask values ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#16447 opened Oct 6, 2025 by jeffbolznv Loading…
Metal Pool 1D Kernel Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#16429 opened Oct 5, 2025 by ThoreKoritzius Loading…
Implement llama-pull tool examples
#16423 opened Oct 4, 2025 by ericcurtin Loading…
contrib : add fish completions via --completion-fish
#16404 opened Oct 3, 2025 by g0t4 Loading…
server : host-memory prompt caching examples python python script changes server
#16391 opened Oct 2, 2025 by ggerganov Loading…
4 of 5 tasks
model-conversion : add support for SentenceTransformers examples python python script changes
#16387 opened Oct 2, 2025 by danbev Loading…
Add ARANGE Operator to SYCL Backend (Small & Focused Changes) ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#16362 opened Sep 30, 2025 by GittyBurstein Loading…
SYCL SET operator optimized for F32 tensors ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#16350 opened Sep 30, 2025 by GittyBurstein Loading…
Update build.md documentation Improvements or additions to documentation
#16346 opened Sep 30, 2025 by refine360-debug Loading…
ggml-cpu : inspect -march and -mcpu to found the CPU ggml changes relating to the ggml tensor library for machine learning
#16333 opened Sep 29, 2025 by angt Loading…
ggml : remove KQ mask padding ggml changes relating to the ggml tensor library for machine learning
#16309 opened Sep 28, 2025 by ggerganov Draft
2 of 3 tasks
hip : substituted bpermute ops with swizzle ops (gfx906, maybe all AMD) ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#16291 opened Sep 27, 2025 by iacopPBK Loading…
Update convert_hf_to_gguf_update.py python python script changes
#16280 opened Sep 26, 2025 by cpumaxx Loading…
ProTip! What’s not been updated in a month: updated:<2025-09-09.