Skip to content

Releases: ggml-org/llama.cpp

b6663

01 Oct 22:11
e95fec6
Compare
Choose a tag to compare
HIP: Disable ROCWMMA fattn on CDNA when compiled against ROCWMMA 2.0.…

b6662

01 Oct 21:30
ded67b9
Compare
Choose a tag to compare
llama : parameter conversion and loading fixes for PLaMo2 variants (#…

b6661

01 Oct 18:47
1fe4e38
Compare
Choose a tag to compare
ci: Properly install rocwmma for hip builds (#16305)

* CI: Properly install rocwmma for hip builds

on windows we now windows install rocwmma from ubuntu pacakges

* CI: update linux rocm docker build to use rocm 7.0

b6660

01 Oct 17:42
4201dea
Compare
Choose a tag to compare
common: introduce http.h for httplib-based client (#16373)

* common: introduce http.h for httplib-based client

This change moves cpp-httplib based URL parsing and client setup into
a new header `common/http.h`, and integrates it in `arg.cpp` and `run.cpp`.

It is an iteration towards removing libcurl, while intentionally
minimizing changes to existing code to guarantee the same behavior when
`LLAMA_CURL` is used.

Signed-off-by: Adrien Gallouët <[email protected]>

* tools : add missing WIN32_LEAN_AND_MEAN

Signed-off-by: Adrien Gallouët <[email protected]>

---------

Signed-off-by: Adrien Gallouët <[email protected]>
Signed-off-by: Adrien Gallouët <[email protected]>

b6653

30 Sep 22:18
e74c92e
Compare
Choose a tag to compare
model : support GLM 4.6 (make a few NextN/MTP tensors not required) (…

b6651

30 Sep 22:09
bf6f3b3
Compare
Choose a tag to compare
common : disable progress bar without a tty (#16352)

* common : disable progress bar without a tty

Signed-off-by: Adrien Gallouët <[email protected]>

* Add missing headers

Signed-off-by: Adrien Gallouët <[email protected]>

---------

Signed-off-by: Adrien Gallouët <[email protected]>

b6650

30 Sep 21:12
7c156df
Compare
Choose a tag to compare
opencl: support pad_ext (#15888)

b6648

30 Sep 19:49
8d78cd2
Compare
Choose a tag to compare
ggml webgpu: support for rope,div,sub,glu,scale,cont operators (#16187)

* Work on rope

* Simplify inplace operation generation and combine mul/add generation

* Work on rope variants

* implement neox rope

* rope complete

* Add sub,div,glu operators

* implement scale op

* Update cpy shader to handle cont/more types

* formatting

* Update test vars printing for rope,rms_norm

* Avoid ROPE hardcoded constants

* Add TODO to change ROPE constants to enum

Co-authored-by: Georgi Gerganov <[email protected]>

* fix TODO comment

---------

Co-authored-by: Georgi Gerganov <[email protected]>

b6647

30 Sep 18:08
d1c84a6
Compare
Choose a tag to compare
opencl: support ne3 in get_rows (#15866)

b6646

30 Sep 15:54
364a7a6
Compare
Choose a tag to compare
common : remove common_has_curl() (#16351)

`test-arg-parser.cpp` has been updated to work consistently,
regardless of whether CURL or SSL support is available, and
now always points to `ggml.ai`.

The previous timeout test has been removed, but it can be
added back by providing a dedicated URL under `ggml.ai`.

Signed-off-by: Adrien Gallouët <[email protected]>