Please see this [comment](https://github.com/ggerganov/llama.cpp/issues/3502#issuecomment-1753847439) by @jploski.