improvements to tool calling logic (merged changes from old PR branch) #1855

Rose22 · 2025-11-21T13:50:10Z

as we've discussed in discord,

this change alters the way koboldcpp determines what tool to use in some pretty drastic ways that vastly improve it's accuracy, especially with small LLM's. instead of doing one request to the LLM to prompt it and ask it if a tool should be used, and forcing it down to 5 tokens with grammar forcing a simple "yes/no" answer.. it now gives the LLM full freedom to write out it's decision and why it took that decision, with it's final decision text always added at the end of the response. then we take that response and use the yes/no grammar on that instead!

(this is a redo of the pull request because due to inexperience with git and github i messed up my branch with too many merges with upstream)

… testing. refer to PR.

…ng irrelevant to toolcall decision

LostRuins · 2025-11-22T07:51:11Z

alright as requested i moved the tool list into memory so it won't be hurt by context shifting.

did a bit of tidying of the prompts but no real functional changes except for removing If there was no final decision stated, default to no. which I don't think is necessary.

if everything works good for you we can merge this

…ng final decision

Rose22 · 2025-11-22T15:00:13Z

did a bit of tidying of the prompts but no real functional changes except for removing If there was no final decision stated, default to no. which I don't think is necessary.

the reason i did this is because the llm would sometimes output something that wasn't reasoning but basically just a standard answer as if it was inside a conversation. so i decided to mitigate that using json... an added benefit to that is that we can skip most of the other calls to the LLM and get it down to just one!

it will need further testing again, but, i believe this is better than before

…backs as they are no longer required.

LostRuins · 2025-11-23T04:27:45Z

gave your new method a try and i think it does work better than before, I have added the JSON enforcement and removed the non-json fallback as its no longer triggerable.
from my tests its working just as good, if not better than before, single-pass is also faster
do take a look and see if you are happy with this version or if something is still lacking

i used your prompts but added one more for "required" mode instead of "auto" (as per openai spec)
also compacted the text down to single line as it was kinda messy (visual change only, same prompt)

Secondary Issue: I noticed some issues with the "always send tools at the start" approach you swapped to previously. this is not an issue with tool calls per-se but it's affecting the quality of the no-tool output.

Previously: When tools are not called, they are excluded.

Now: when tools are not called, they are still included at the top

Result: The AI talks about tools in response to simple questions. For example if I have a tool called "get_menu" and i ask it "what is the meaning of life", it tends to reply "I cannot answer that question as I only have access to the food menu, which does not include the philosophy of life".
if a tool call is NOT needed... we might have to remove tools from context like before in order to avoid poisoning the AI's normal replies

LostRuins

merging

Rose22 and others added 2 commits November 21, 2025 14:27

improvements to tool calling logic (merged changes from old PR branch)

776dcb1

added some tweaks for improved tool calls to reuse old ctx, but needs…

dd7ef6e

… testing. refer to PR.

Rose22 mentioned this pull request Nov 21, 2025

major improvements to tool calling logic #1839

Closed

Rose22 and others added 12 commits November 21, 2025 17:41

fixes to some stuff that concedo's modifications broke

e0f7389

fixed error in reasoning

2377910

extremely hacky way to cache tool list please fix

d8098ee

oops forgot to add this

ad309b2

slightly less hacky way to preserve the tool list in context

c47a7de

prevented unintended toolcalls from happening when LLM states somethi…

0ff0cdd

…ng irrelevant to toolcall decision

fixed something that broke koboldlite

000115d

fixed bug added by concedo that broke jinja tools

183fd43

experimental further compression of tools array, needs testing

b4339b4

reverted experimental further compression of tools array

20c7626

final cleanup

b137a10

add newline after memory insert

1281ced

Rose22 added 8 commits November 22, 2025 13:33

changed tool reasoning to always be in json format to enforce includi…

58599da

…ng final decision

used new json format to skip extra llm call when not necessary

d863607

more catching of possible bad llm output

960a600

further cleanup

e23ce62

got it down to just one llm call!

20f3c84

better json format

98523d3

even better json format

4bc55ed

further refinement to json format

6f88c0e

Rose22 and others added 3 commits November 22, 2025 16:20

further refinement to json format

ac69c73

fixed broken tool calling

cdf802e

single-call enforced json method now seems to work well. removed fall…

21ea116

…backs as they are no longer required.

LostRuins approved these changes Nov 23, 2025

View reviewed changes

LostRuins merged commit eeb7363 into LostRuins:concedo_experimental Nov 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

improvements to tool calling logic (merged changes from old PR branch) #1855

improvements to tool calling logic (merged changes from old PR branch) #1855

Uh oh!

Rose22 commented Nov 21, 2025

Uh oh!

LostRuins commented Nov 22, 2025

Uh oh!

Rose22 commented Nov 22, 2025

Uh oh!

LostRuins commented Nov 23, 2025

Uh oh!

LostRuins left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

improvements to tool calling logic (merged changes from old PR branch) #1855

improvements to tool calling logic (merged changes from old PR branch) #1855

Uh oh!

Conversation

Rose22 commented Nov 21, 2025

Uh oh!

LostRuins commented Nov 22, 2025

Uh oh!

Rose22 commented Nov 22, 2025

Uh oh!

LostRuins commented Nov 23, 2025

Uh oh!

LostRuins left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants