Explore Help

Dark-Alex-17/loki

1

0

You've already forked loki

Code Issues Pull Requests Actions Packages Projects Releases Wiki Activity

Files

63b6678e73057df8f90ad0704e83b719943eec44

loki/docs/testing/plans/06-tool-evaluation.md

Alex Clarke 63b6678e73

testing

2026-04-15 12:56:00 -06:00

2.2 KiB

Raw Blame History

Test Plan: Tool Evaluation

Feature description

When the LLM returns tool calls, eval_tool_calls dispatches each call to the appropriate handler. Handlers include: shell tools (bash/python/ts scripts), MCP tools, supervisor tools (agent spawn), todo tools, and user interaction tools.

Behaviors to test

eval_tool_calls dispatch

Calls dispatched to correct handler by function name prefix
Tool results returned for each call
Multiple concurrent tool calls processed
Tool call tracker updated (chain length, repeats)
Root agent (depth 0) checks escalation queue after eval
Escalation notifications injected into results

ToolCall::eval routing

agent__* → handle_supervisor_tool
todo__* → handle_todo_tool
user__* → handle_user_tool (depth 0) or escalate (depth > 0)
mcp_invoke_* → invoke_mcp_tool
mcp_search_* → search_mcp_tools
mcp_describe_* → describe_mcp_tool
Other → shell tool execution

Shell tool execution

Tool binary found and executed
Arguments passed correctly
Environment variables set (LLM_OUTPUT, etc.)
Tool output returned as result
Tool failure → error returned as tool result (not panic)

Tool call tracking

Tracker counts consecutive identical calls
Max repeats triggers warning
Chain length tracked across turns
Tracker state preserved across tool-result loops

Function selection

select_functions filters by role's enabled_tools
select_functions includes MCP meta functions for enabled servers
select_functions includes agent functions when agent active
"all" enables all functions
Comma-separated list enables specific functions

Context switching scenarios

Tool calls during agent → agent tools available
Tool calls during role → role tools available
Tool calls with MCP → MCP invoke/search/describe work
No agent → no agent__/todo__ tools in declarations

Old code reference

src/function/mod.rs — eval_tool_calls, ToolCall::eval
src/function/supervisor.rs — handle_supervisor_tool
src/function/todo.rs — handle_todo_tool
src/function/user_interaction.rs — handle_user_tool

Reference in New Issue View Git Blame Copy Permalink

Powered by Gitea Version: 1.26.0+dev-405-g45ee571693 Page: 52ms Template: 5ms

Auto

English

Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語简体中文繁體中文（台灣）繁體中文（香港） 한국어

Licenses API