make deepseekv3 renderer work with system messages, add renderer that forces thinking #79

joschu · 2025-11-10T02:53:46Z

No description provided.

joschu · 2025-11-10T02:54:04Z

tinker_cookbook/renderers.py

+    Renderer that forces inclusion of a thinking block for DsV3 models.
+    """
+
+    def _render_message(self, message: Message) -> tuple[list[int], list[int], list[int]]:


I copied this from the NoThinking renderer. Not sure exactly why it's here.

opherlieber · 2025-11-10T06:18:05Z

tinker_cookbook/renderers.py

+            if new_content.startswith("</think>"):
+                new_content = new_content[len("</think>") :]
+            if not new_content.startswith("<think>"):
+                new_content = "<think>" + new_content


Do we want to support/use the Message's 'thinking' field here like gpt-oss? If we do want to support <think> also directly in the content, should we enforce here that there is also a </think> followed by an actual message after it? And not sure if for multi-message interaction the thinking tokens should be rendered in all previous assistant messages, or be kept only in the last message like in gpt-oss?

Good questions. Haven't thought these things through yet. No rush to merge this one -- still experimenting with it in my use case.

Re: thinking field, we might as well prioritize supporting the most advanced tool use systems that are being released, which would include gpt-oss and kimi-k2-thinking, which use interleaved CoT + tool calls. So I'm in favor of whatever improves the support for those models.

I'm not sure what's the right policy for possibly hiding CoT -- in gpt-oss, I assume we include all the thinking traces from the current turn (i.e., back until the last user message), in cases where we have multiple assistant messages including tool calls?

Yes, in gpt-oss we include thinking tokens from the current turn, even if there are multiple assistant channel messages

.

600aa30

joschu commented Nov 10, 2025

View reviewed changes

joschu requested a review from opherlieber November 10, 2025 02:54

opherlieber reviewed Nov 10, 2025

View reviewed changes

Merge remote-tracking branch 'origin/main' into ds-renderer-sys-msg

c7659f6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

make deepseekv3 renderer work with system messages, add renderer that forces thinking #79

make deepseekv3 renderer work with system messages, add renderer that forces thinking #79

Uh oh!

joschu commented Nov 10, 2025

Uh oh!

joschu Nov 10, 2025

Uh oh!

opherlieber Nov 10, 2025 •

edited

Loading

Uh oh!

joschu Nov 10, 2025

Uh oh!

joschu Nov 10, 2025

Uh oh!

joschu Nov 10, 2025

Uh oh!

opherlieber Nov 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

make deepseekv3 renderer work with system messages, add renderer that forces thinking #79

Are you sure you want to change the base?

make deepseekv3 renderer work with system messages, add renderer that forces thinking #79

Uh oh!

Conversation

joschu commented Nov 10, 2025

Uh oh!

joschu Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

opherlieber Nov 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

joschu Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

joschu Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

joschu Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

opherlieber Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

opherlieber Nov 10, 2025 •

edited

Loading