Releases: ngxson/llama.cpp
Releases · ngxson/llama.cpp
b5163
mtmd : merge llava, gemma3 and minicpmv CLI into single `llama-mtmd-c…
b5162
convert : experimental support for `--mmproj` flag (#13023) * convert : experimental support for `--mmproj` flag * fix bad ctrl+f replace * fix style * split into subclasses TextModel and VisionModel * rename Mode --> ModelBase * small fix * correct CLIP_VISION arch name (because existing GGUF already use it) * Apply suggestions from code review Co-authored-by: compilade <[email protected]> * fix Mistral3Model * fix typo Co-authored-by: compilade <[email protected]> --------- Co-authored-by: compilade <[email protected]>
b5161
llava: fix errors in clip.h on certain compilers (#13030)
b5160
vulkan: support noncontiguous rms_norm (#13031)
b5159
metal: add neg operator (#13029)
b5158
Disable CI cross-compile builds (#13022)
b5156
clip : refactor, add `image_manipulation` and `llava_uhd` classes (#1…
b5155
main : Fix Ctrl+D/newline handling (#12951) This restores the behavior from #491. This does not affect Ctrl+D's ability to terminate --multiline-input lines (#1040). This also actually implements #587: "If the user wants the text to end in a newline, this should be accomplished by explicitly adding a newline by using \ followed by return, then returning control by pressing return again." Fixes #12949
b5153
server : use std::move whenever possible (#12936) * server : use std::move whenever possible * use r-value ref * Apply suggestions from code review Co-authored-by: Georgi Gerganov <[email protected]> * make task creation scoped * restore std::move * fix task_id not set correctly * apply changes from suggestion Co-authored-by: ggerganov <[email protected]> --------- Co-authored-by: Georgi Gerganov <[email protected]>
b5152
SYCL: Refactor and enable FP16 in binary broadcast OPs (#12975) * SYCL: refactor move to a separate file * Fix binbcast * Remove duplicates * fix include formatting * fix typo