Add DeepSeek V3.1 thinking mode support #6

createthis · 2025-08-22T13:54:08Z

just for my own use

- Added COMMON_CHAT_FORMAT_DEEPSEEK_V3_1 enum value - Created common_chat_params_init_deepseek_v3_1() function (currently uses R1 implementation) - Created common_chat_parse_deepseek_v3_1() function that handles V3.1 thinking format: - Extracts reasoning content before '</think>' tag into reasoning_content - Extracts regular content after '</think>' tag into content - No opening '<think>' tag in V3.1 format - Added detection logic for V3.1 templates based on pattern: 'message['prefix'] is defined and message['prefix'] and thinking' - Added V3.1 case to parsing switch statement This addresses the issue where V3.1 outputs reasoning content followed by '</think>' and then regular content without the opening '<think>' tag.

This reverts commit c50d887.

chat parser.

…budget 0`.

…ll variants.

still isn't working for some reason.

as to not accidentally introduce regressions.

Tool calls work in thinking and non-thinking modes. However, I've introduced a regression in streaming mode where reasoning content initially comes through as regular content. I need to think about how to deal with this long term.

test case.

function per CISC's request.

behaviors by adding optional update_cursor argument.

Co-authored-by: Sigbjørn Skjæret <[email protected]>

createthis self-assigned this Aug 22, 2025

createthis added 4 commits August 22, 2025 12:03

Another attempt by V3.1 non-thinking

3912fd3

Fix test, but it's not asserting anything.

bac6c99

Ignore vim swap files in tests dir

fe86282

Update the test

3d00d62

createthis mentioned this pull request Aug 22, 2025

Feature Request: Add DeepSeek-V3.1 ggml-org/llama.cpp#15496

Open

4 tasks

createthis added 23 commits August 23, 2025 10:42

Try using try_find_literal instead of regex

c50d887

passing test

3f319aa

Revert "Try using try_find_literal instead of regex"

79f7ca3

This reverts commit c50d887.

Remove unnecessary change

0d959ba

Remove comment

6223c1c

Add code to handle non-thinking mode.

0d372f4

Try to set message['prefix'] when thinking is enabled.

f0da116

This fixes reasoning, but breaks normal content. We need state in the

56f7e38

chat parser.

DeepSeek V3.1 thinking is now the default. Disable with `--reasoning-…

f4f0ddb

…budget 0`.

Simplify (DeepSeek V3.1 reasoning)

f7d2ee9

Fix sign inversion bug

7ac92ca

Add some tool calling code (not working).

be0b2b8

Tool calls working in non-reasoning mode.

776d95b

Attempt a unit test for tool call parsing.

a32cad1

Passing test

52d5488

Add tests for both happy path and broken fenced DeepSeek V3.1 tool ca…

a839be7

…ll variants.

Passing DeepSeek V3.1 tool call tests, but model is not working.

6ade60e

Revert assistance response prefill change. Not my monkeys.

79d4812

Add fenced_thinking unit test variant. Passes, but thinking tool calling

36b047c

still isn't working for some reason.

Tests pass in reasoning mode. Also e2e tool test passes.

bdfa87f

Make a copy of the parse_json_tool_calls function for deepseek-v3.1 so

0e36761

as to not accidentally introduce regressions.

Fix thinking_forced_open logic. tool calling broken. Need to add another

ab22c76

test case.

github-actions bot added the testing label Aug 26, 2025

createthis and others added 20 commits August 26, 2025 14:22

That's what I get for cargo culting a newline.

4a2d17d

Add multi tool call test for deepseek v3.1 non-reasoning

b2d57ce

Merge branch 'master' into deepseek_3_1_thinking_mode

f422fe7

Move test, remove .gitignore change

7dc19e8

Place deepseek-v3.1 reasoning test directly into existing reasoning

380146e

function per CISC's request.

Address whitespace CI failure.

9056707

Merge two assert_equals per CISC's request.

a406d6a

Add DeepSeek-V3.1 tests to tests/test-chat.cpp per CISC's request.

ec984da

Merge branch 'master' into deepseek_3_1_thinking_mode

92003d7

Merge deepseek V3.1 and regular parse_json_tool_calls() function

f661dbe

behaviors by adding optional update_cursor argument.

Update tests/test-chat-parser.cpp

12b013f

Co-authored-by: Sigbjørn Skjæret <[email protected]>

Update tests/test-chat-parser.cpp

800af00

Co-authored-by: Sigbjørn Skjæret <[email protected]>

Update tests/test-chat-parser.cpp

80a7e1c

Co-authored-by: Sigbjørn Skjæret <[email protected]>

Update tests/test-chat-parser.cpp

155852a

Co-authored-by: Sigbjørn Skjæret <[email protected]>

Update tests/test-chat-parser.cpp

e587808

Co-authored-by: Sigbjørn Skjæret <[email protected]>

Update tests/test-chat-parser.cpp

ac6ed1e

Co-authored-by: Sigbjørn Skjæret <[email protected]>

Update tests/test-chat-parser.cpp

3843d94

Co-authored-by: Sigbjørn Skjæret <[email protected]>

Update tests/test-chat-parser.cpp

6773708

Co-authored-by: Sigbjørn Skjæret <[email protected]>

Update tests/test-chat-parser.cpp

befa31c

Co-authored-by: Sigbjørn Skjæret <[email protected]>

DeepSeek V3.1 fix reasoning_format none

7795594

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add DeepSeek V3.1 thinking mode support #6

Add DeepSeek V3.1 thinking mode support #6

createthis commented Aug 22, 2025

Uh oh!

Uh oh!

Add DeepSeek V3.1 thinking mode support #6

Are you sure you want to change the base?

Add DeepSeek V3.1 thinking mode support #6

Conversation

createthis commented Aug 22, 2025

Uh oh!

Uh oh!