Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

vulkan: Implement GGML_OP_CUMSUM ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#17479 opened Nov 24, 2025 by jeffbolznv Loading…
opencl: add sqr, sqrt, mean and ssm_conv ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#17476 opened Nov 24, 2025 by lhez Draft
vulkan: allow graph_optimize for prompt processing workloads ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#17475 opened Nov 24, 2025 by jeffbolznv Loading…
tests: Avoid floating point precision false positives in SUM testing Everything test related
#17471 opened Nov 24, 2025 by jeffbolznv Loading…
server: introduce API for serving / loading / unloading multiple models examples python python script changes script Script related server testing Everything test related
#17470 opened Nov 24, 2025 by ngxson Loading…
common : support custom HTTP headers for model downloads
#17468 opened Nov 24, 2025 by angt Loading…
ci : skip winget update when not in ggml-org devops improvements to build systems and github actions
#17465 opened Nov 24, 2025 by angt Loading…
Typo in json.gbnf
#17460 opened Nov 24, 2025 by hi-cannon Loading…
SOLVE_TRI CUDA kernel for small matrices ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#17457 opened Nov 23, 2025 by pwilkin Loading…
vulkan: Use fewer rows for scalar FA when HS is not a multiple of 16 ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#17455 opened Nov 23, 2025 by jeffbolznv Loading…
model : add LLADA 2.0 diffusion support examples model Model specific python python script changes
#17454 opened Nov 23, 2025 by wsbagnsv1 Draft
ggml-cpu : add RISC-V Zvfh impl for ggml_vec_mad_f16 ggml changes relating to the ggml tensor library for machine learning
#17448 opened Nov 23, 2025 by xctan Loading…
HIP: enable mul_mat_f for RDNA4 ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#17437 opened Nov 22, 2025 by zhang-hui-yulo Loading…
docs: vulkan add GGML_VK_ALLOW_SYSMEM_FALLBACK=1 docs documentation Improvements or additions to documentation
#17436 opened Nov 22, 2025 by taronaeo Loading…
Fix convert_hf_to_gguf.py script on s390x python python script changes
#17431 opened Nov 21, 2025 by AlekseiNikiforovIBM Loading…
common : throttle download progress output to reduce IO flush
#17427 opened Nov 21, 2025 by angt Loading…
server : add Anthropic Messages API support examples python python script changes server
#17425 opened Nov 21, 2025 by noname22 Loading…
cmake : simplify build info detection using standard variables build Compilation issues
#17423 opened Nov 21, 2025 by angt Loading…
vulkan: Implement top-k Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#17418 opened Nov 21, 2025 by jeffbolznv Draft
Vulkan: Add GGML_OP_GET_REL_POS ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#17417 opened Nov 20, 2025 by AgainstEntropy Loading…
ProTip! Adding no:label will show everything without a label.