- Add an option to disable thinking (for models that have thinking capability).
Example usage:llm -m qwen3:30b-a3b-q4_K_M "Why is the sky blue?" -o think false
- Set the connection timeout to 1 second to quickly fail when the Ollama host is unreachable.