2.2.5 Backend: Aphrodite Engine

Aphrodite Engine

Handle: aphrodite
URL: http://localhost:33921

aphrodite

PygmalionAI's large-scale inference engine

Starting

# [Optional] pre-pull the image, ~5GB
harbor pull aphrodite

# Start the service
harbor up aphrodite

# [Optional] When loading closed/gated models
# provision the token
harbor hf token <your-token>

Models

# Open HF Search to find the models
harbor find gptq awq

# Download model repo to the global HF cache
# user/repo format
harbor hf download infly/INF-34B-Chat-AWQ

# Get/set the model to run
# in the aphrodite engine
harbor aphrodite model infly/INF-34B-Chat-AWQ

Configuration

Official Engine Options docs

# See available options
harbor run aphrodite --help

# Get/Set the extra arguments for
# the aphrodite engine
harbor aphrodite args

Set specific version

You can adjust used version (docker image tag) of the engine:

# Get the current version - "latest" by default
harbor config get aphrodite.version

# Set the version
harbor config set aphrodite.version latest

Home | CLI Reference | Services | Adding New Service | Compatibility

Uh oh!

2.2.5 Backend: Aphrodite Engine

Aphrodite Engine

Starting

Models

Configuration

Set specific version

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!