Add docker protocol support for llama-server model loading #15790

ericcurtin · 2025-09-04T11:38:41Z

To pull and run models via: llama-server -dr gemma3
Add some validators and sanitizers for Docker Model urls and metadata

ericcurtin · 2025-09-04T11:43:33Z

@CISC @ggerganov PTAL

common/arg.cpp

Copilot

Pull Request Overview

This PR adds Docker registry support to llama-server, enabling users to pull and run AI models directly from Docker Hub using the docker:// protocol. The implementation handles Docker registry authentication, manifest parsing, and blob downloading to cache models locally.

Adds Docker URL parsing and resolution functionality to download GGUF models from Docker registries
Integrates Docker model resolution into the existing model loading pipeline
Implements streaming download with proper authentication and caching support

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 6 comments.

File	Description
common/common.cpp	Integrates Docker model resolution into the model loading pipeline and updates error messages
common/arg.h	Adds function declaration for Docker model resolution
common/arg.cpp	Implements complete Docker registry functionality including authentication, manifest parsing, and blob downloading

common/arg.cpp

ericcurtin · 2025-09-04T18:19:39Z

@JohannesGaessler @slaren PTAL

ericcurtin · 2025-09-05T18:46:57Z

@danbev PTAL

ericcurtin · 2025-09-08T10:19:29Z

@ggerganov @JohannesGaessler @slaren @danbev struggle to get this reviewed, if you guys have cycles I'd appreciate it.

common/arg.cpp

common/common.cpp

common/arg.cpp

ericcurtin · 2025-09-09T12:08:22Z

I'm also having a change of heart, thinking of changing to a:

-d/--docker-repo

option, at least the string would be the same then as suggested in Docker Hub and the one used in Docker Model Runner. It would also be more consistent with the huggingface argument approach.

ericcurtin · 2025-09-09T14:15:37Z

@ggerganov @danbev ready for re-review

ericcurtin · 2025-09-09T22:23:29Z

Added resumable downloads in a second commit, models can be large and redownloading models from scratch on interrupted connections can be a pain and a waste of bandwidth

common/arg.cpp

ericcurtin · 2025-09-10T10:07:14Z

@ggerganov ready for re-review

ggerganov

The docker-related functions seem ok.

I'm not confident about the changes to common_download_file_single to support resumable downloads. Either wait for someone to review this part in details, or move it to a separate PR.

common/arg.cpp

ericcurtin · 2025-09-11T12:28:51Z

The docker-related functions seem ok.

I'm not confident about the changes to common_download_file_single to support resumable downloads. Either wait for someone to review this part in details, or move it to a separate PR.

SGTM, I do think that resumable downloads bit is important in the next PR, whether it's huggingface, docker, etc. Somebody has to pay the cloud bill of all the wasted petabytes that are retransferred because of retries.

And of course having to start a download from the start again because of an interrupted connection simply being annoying.

Sometimes that can make larger models impossible to download for people.

Even some servers have server side timeouts if you down finish the download in a certain time.

Resumable downloads solves these things, client side.

ericcurtin · 2025-09-11T12:41:05Z

@ggerganov all done, it's just the docker pulling change now

ericcurtin · 2025-09-11T13:03:14Z

Getting build problems unrelated to this PR:

/Users/runner/work/llama.cpp/llama.cpp/ggml/src/ggml-blas/ggml-blas.cpp:143:13: error: 'cblas_sgemm' is only available on macOS 13.3 or newer [-Werror,-Wunguarded-availability-new]
            cblas_sgemm(CblasRowMajor, CblasNoTrans, CblasTrans,
            ^~~~~~~~~~~

x86_64 macOS

Gonna try a rebuild.

common/arg.cpp

ericcurtin · 2025-09-11T15:35:15Z

@ggerganov green build!

common/common.h

To pull and run models via: llama-server -dr gemma3 Add some validators and sanitizers for Docker Model urls and metadata Signed-off-by: Eric Curtin <[email protected]>

ericcurtin force-pushed the docker-pull-functionality branch 3 times, most recently from 789373c to f16b5b6 Compare September 4, 2025 11:43

CISC reviewed Sep 4, 2025

View reviewed changes

common/arg.cpp Show resolved Hide resolved

common/arg.cpp Show resolved Hide resolved

common/arg.cpp Outdated Show resolved Hide resolved

common/arg.cpp Outdated Show resolved Hide resolved

common/arg.cpp Show resolved Hide resolved

ericcurtin force-pushed the docker-pull-functionality branch 2 times, most recently from 79f41b0 to d10c249 Compare September 4, 2025 12:44

ericcurtin requested a review from Copilot September 4, 2025 12:52

Copilot AI reviewed Sep 4, 2025

View reviewed changes

common/arg.cpp Outdated Show resolved Hide resolved

common/arg.cpp Show resolved Hide resolved

common/arg.cpp Outdated Show resolved Hide resolved

common/arg.cpp Show resolved Hide resolved

common/arg.cpp Outdated Show resolved Hide resolved

common/arg.cpp Show resolved Hide resolved

ericcurtin force-pushed the docker-pull-functionality branch 3 times, most recently from 2333af1 to ab246cb Compare September 4, 2025 13:50

ericcurtin mentioned this pull request Sep 4, 2025

Resumable downloads docker/model-cli#134

Open

ggerganov reviewed Sep 8, 2025

View reviewed changes

common/arg.cpp Outdated Show resolved Hide resolved

common/common.cpp Outdated Show resolved Hide resolved

danbev reviewed Sep 8, 2025

View reviewed changes

common/arg.cpp Outdated Show resolved Hide resolved

ericcurtin force-pushed the docker-pull-functionality branch 2 times, most recently from f60d560 to d36c7aa Compare September 9, 2025 14:14

ericcurtin force-pushed the docker-pull-functionality branch from d36c7aa to c01b8e5 Compare September 9, 2025 14:18

ericcurtin changed the title ~~Add docker:// protocol support for llama-server model pulling~~ Add docker protocol support for llama-server model loading Sep 9, 2025

ericcurtin force-pushed the docker-pull-functionality branch 6 times, most recently from c0e6f0b to 85ad516 Compare September 9, 2025 17:30

ericcurtin force-pushed the docker-pull-functionality branch from efb7394 to fdc9c55 Compare September 9, 2025 22:22

ericcurtin force-pushed the docker-pull-functionality branch 5 times, most recently from 603a693 to e32f32f Compare September 10, 2025 09:27

ggerganov reviewed Sep 10, 2025

View reviewed changes

common/arg.cpp Outdated Show resolved Hide resolved

common/arg.cpp Outdated Show resolved Hide resolved

ericcurtin force-pushed the docker-pull-functionality branch 2 times, most recently from 3ba6076 to 9b15dfc Compare September 10, 2025 09:56

ericcurtin force-pushed the docker-pull-functionality branch 3 times, most recently from 0bd8384 to 7e1f35c Compare September 10, 2025 12:02

ggerganov reviewed Sep 11, 2025

View reviewed changes

common/arg.cpp Outdated Show resolved Hide resolved

common/arg.cpp Outdated Show resolved Hide resolved

common/arg.cpp Outdated Show resolved Hide resolved

common/arg.cpp Outdated Show resolved Hide resolved

ericcurtin force-pushed the docker-pull-functionality branch from 7e1f35c to 9278552 Compare September 11, 2025 12:39

ericcurtin force-pushed the docker-pull-functionality branch 2 times, most recently from 0327dd0 to c728d2e Compare September 11, 2025 13:39

rumpl reviewed Sep 11, 2025

View reviewed changes

common/arg.cpp Outdated Show resolved Hide resolved

ericcurtin force-pushed the docker-pull-functionality branch from c728d2e to 93e0e58 Compare September 11, 2025 14:15

ggerganov approved these changes Sep 12, 2025

View reviewed changes

common/common.h Outdated Show resolved Hide resolved

Add docker protocol support for llama-server model loading

802e1fb

To pull and run models via: llama-server -dr gemma3 Add some validators and sanitizers for Docker Model urls and metadata Signed-off-by: Eric Curtin <[email protected]>

ericcurtin force-pushed the docker-pull-functionality branch from 93e0e58 to 802e1fb Compare September 12, 2025 15:31

ericcurtin merged commit 4bf5549 into master Sep 12, 2025
45 of 48 checks passed

ericcurtin deleted the docker-pull-functionality branch September 12, 2025 15:31

ericcurtin mentioned this pull request Oct 19, 2025

Add Go OCI library integration with go-containerregistry #16667

Open

Add docker protocol support for llama-server model loading #15790

Add docker protocol support for llama-server model loading #15790

Uh oh!

Conversation

ericcurtin commented Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ericcurtin commented Sep 4, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ericcurtin commented Sep 4, 2025

Uh oh!

ericcurtin commented Sep 5, 2025

Uh oh!

ericcurtin commented Sep 8, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ericcurtin commented Sep 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ericcurtin commented Sep 9, 2025

Uh oh!

ericcurtin commented Sep 9, 2025

Uh oh!

Uh oh!

Uh oh!

ericcurtin commented Sep 10, 2025

Uh oh!

ggerganov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ericcurtin commented Sep 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ericcurtin commented Sep 11, 2025

Uh oh!

ericcurtin commented Sep 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

ericcurtin commented Sep 11, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

ericcurtin commented Sep 4, 2025 •

edited

Loading

ericcurtin commented Sep 9, 2025 •

edited

Loading

ericcurtin commented Sep 11, 2025 •

edited

Loading

ericcurtin commented Sep 11, 2025 •

edited

Loading