Move command requests to card message matrix events and add support for multiple commands per message #2145

lukemelia · 2025-02-13T17:09:29Z

This PR is breaking in a sense -- previous commands added to matrix rooms will no longer be visible since this PR changes the way we are encoding command requests (tool calls) into matrix events.

The motivation for this change is that tool-call-enabled LLMs have the ability make multiple tool calls in a single response, and our modeling made that quite awkward. This PR make that case normal with full support.

This PR also changes the way that debouncing/throttling works in the aibot. It handles response streaming in better, ensuring an event is sent to matrix at most every 250ms and the response is encoded and delivered in full.

Finally, type-checking and linting in the aibot package is improved in this PR too. It was a little loose before.

UPDATE: ~~This PR has some issues consuming the stream from OpenRouter. I'm investigating.~~ Resolved!

add host tests for multiple tool calls
add aibot tests for multiple tool calls with results
test coverage / implementation for command failures
Ensure code TODOs are addressed

github-actions · 2025-02-13T17:21:15Z

Host Test Results

    1 files ±0     1 suites ±0 25m 25s ⏱️ + 1m 21s
795 tests +2 793 ✔️ +2 2 💤 ±0 0 ❌ ±0
800 runs +2 798 ✔️ +2 2 💤 ±0 0 ❌ ±0

Results for commit 8b4e305. ± Comparison against base commit 0e42fb1.

♻️ This comment has been updated with latest results.

…pport multiple command requests per message

…ave results and at least one has output

…iple-tool-calls-per-ai-response

…esponse

…sing absolute URLs

packages/runtime-common/helpers/ai.ts

packages/base/matrix-event.gts

…esponse

backspace

I’m approving after having read this over with the caveat that I couldn’t get it running locally, the bot was crashing like this:

/Users/b/Documents/Cardstack/Code/boxel-motion/packages/ai-bot/helpers.ts:196
      let { attachedCardsEventIds } = event.content.data;
            ^
TypeError: Cannot destructure property 'attachedCardsEventIds' of 'event.content.data' as it is undefined.
    at constructHistory (/Users/b/Documents/Cardstack/Code/boxel-motion/packages/ai-bot/helpers.ts:196:13)

Is this the breaking change of the PR description, do I need to clear my Matrix history?

packages/host/tests/acceptance/commands-test.gts

packages/host/app/components/matrix/room-message-command.gts

lukemelia · 2025-03-04T22:10:34Z

Is this the breaking change of the PR description, do I need to clear my Matrix history?

It's not supposed to crash. Thanks for the error message. I'll see if I can fix it.

IanCal · 2025-03-05T22:52:00Z

I ran this and managed to get 3.5 sonnet to return two calls for a command on a skill - I noticed there wasn't a description with the options, it replied when the first one was applied (unsure if those two were intentional here, I know we spoke about different options) and it didn't like sending the reaction event for the second command

Looks like we are using m.annotation, and while that doesn't seem to be against the spec it's flagged by synapse as two reactions -

IanCal · 2025-03-06T11:06:14Z

replacing m.annotation with a custom thing seems to work for that issue.

jurgenwerk · 2025-03-06T12:10:23Z

…esponse

backspace · 2025-03-06T21:17:03Z

I copied Matic’s example and it seems to have worked:

California Beach Volleyball Sunset, Beach Decor, Beach Wall Art 2025-03-06 13-14-56

For that screenshot I pressed “Apply” on the first. The blankness in the response seems confusing but I did inspect the Matrix message and saw the command was properly in an array.

Is there a more thorough way to exercise this, to get multiple commands in a response?

lukemelia · 2025-03-06T21:38:40Z

Is there a more thorough way to exercise this, to get multiple commands in a response?

Interesting -- that prompt was enough to get multiple commands in one message for me

jurgenwerk · 2025-03-07T09:18:14Z

I tested this again and I'm a little confused about how multiple commands per message should work. I would expect there would be multiple commands within the same message, but when I test it, the commands will be in multiple messages. Is this expected? Or is there an issue with the test prompt?

Here is my prompt:

jurgenwerk · 2025-03-07T11:20:57Z

Should there be a description of the action to be applied?

Convert tool calls to CommandRequests on boxel message events, and su…

b40c860

…pport multiple command requests per message

lukemelia force-pushed the cs-7993-support-multiple-tool-calls-per-ai-response branch 2 times, most recently from b762a09 to 8d52ce0 Compare February 14, 2025 04:07

WIP Update host types for changes to how command requests are encoded

c7a2bd6

lukemelia force-pushed the cs-7993-support-multiple-tool-calls-per-ai-response branch from 8d52ce0 to c7a2bd6 Compare February 14, 2025 04:31

lukemelia added 8 commits February 17, 2025 11:24

Add aibot test coverage for handling multiple tool calls in one response

e927512

aibot should not respond to CommandResultEvent until all tool calls h…

899da9b

…ave results and at least one has output

Merge branch 'return-of-the-skill-commands' into cs-7993-support-mult…

69acba1

…iple-tool-calls-per-ai-response

Merge branch 'main' into cs-7993-support-multiple-tool-calls-per-ai-r…

66cc8cc

…esponse

Fix aibot tests

622d887

Fix and improve test of executing command that came from a skill card

5d4dcaf

host app support for multiple command requests in one message

b9bcbec

Before trying to load command code refs, we need to ensure they are u…

bbbf2c4

…sing absolute URLs

lukemelia commented Mar 3, 2025

View reviewed changes

packages/runtime-common/helpers/ai.ts Outdated Show resolved Hide resolved

lukemelia commented Mar 3, 2025

View reviewed changes

packages/base/matrix-event.gts Outdated Show resolved Hide resolved

lukemelia commented Mar 3, 2025

View reviewed changes

packages/base/matrix-event.gts Outdated Show resolved Hide resolved

lukemelia force-pushed the cs-7993-support-multiple-tool-calls-per-ai-response branch from ddeea6e to 1f5b5c6 Compare March 3, 2025 16:21

Merge branch 'main' into cs-7993-support-multiple-tool-calls-per-ai-r…

9979d51

…esponse

lukemelia force-pushed the cs-7993-support-multiple-tool-calls-per-ai-response branch from 1f5b5c6 to 9979d51 Compare March 3, 2025 16:56

lukemelia added 6 commits March 3, 2025 16:28

WIP retry failed command

6a87a81

Retrying failed commands and failed command state

af729d4

Move commandRequestId out of data node

264eebc

Rename and move CommandRequestContent interface

141eb81

Minor tweaks

3911dc8

Merge branch 'main' into cs-7993-support-multiple-tool-calls-per-ai-r…

3ad64b3

…esponse

lukemelia changed the title ~~WIP move tool calling to boxel message events and add support for multiple tool calls/commands~~ Move command requests to card message matrix events and add support for multiple commands per message Mar 4, 2025

lukemelia requested review from IanCal and a team March 4, 2025 17:46

lukemelia marked this pull request as ready for review March 4, 2025 17:46

backspace approved these changes Mar 4, 2025

View reviewed changes

packages/host/tests/acceptance/commands-test.gts Show resolved Hide resolved

packages/host/tests/acceptance/commands-test.gts Outdated Show resolved Hide resolved

packages/host/app/components/matrix/room-message-command.gts Outdated Show resolved Hide resolved

lukemelia added 3 commits March 4, 2025 17:32

Defensive check for attachedCardEventIds for backwards compat.

44bfe64

Fix real-world streaming

1ff046c

Restore scroll into view behavior

bdfcfc2

lukemelia requested a review from backspace March 5, 2025 18:37

Fix assertion message

c5ec57a

lukemelia force-pushed the cs-7993-support-multiple-tool-calls-per-ai-response branch from 2fda108 to c5ec57a Compare March 5, 2025 18:42

lukemelia mentioned this pull request Mar 5, 2025

Show ApplyButton's preparing state when a command is being prepared #2120

Merged

2 tasks

lukemelia force-pushed the cs-7993-support-multiple-tool-calls-per-ai-response branch 2 times, most recently from 8795782 to 03c51ea Compare March 6, 2025 17:00

Adopt custom rel_type for command results

53a898d

lukemelia force-pushed the cs-7993-support-multiple-tool-calls-per-ai-response branch from 03c51ea to 53a898d Compare March 6, 2025 17:36

lukemelia added 3 commits March 6, 2025 14:02

Fix how command results are added to prompt when there is no body

0d84b50

Add coverage for responder.ensureThinkingMessageSent behavior

8d2d4c6

Merge branch 'main' into cs-7993-support-multiple-tool-calls-per-ai-r…

8b4e305

…esponse

IanCal approved these changes Mar 10, 2025

View reviewed changes

lukemelia merged commit 160dab0 into main Mar 10, 2025
53 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move command requests to card message matrix events and add support for multiple commands per message #2145

Move command requests to card message matrix events and add support for multiple commands per message #2145

lukemelia commented Feb 13, 2025 •

edited

Loading

github-actions bot commented Feb 13, 2025 •

edited

Loading

backspace left a comment

lukemelia commented Mar 4, 2025

IanCal commented Mar 5, 2025

IanCal commented Mar 6, 2025 •

edited

Loading

jurgenwerk commented Mar 6, 2025

backspace commented Mar 6, 2025

lukemelia commented Mar 6, 2025

jurgenwerk commented Mar 7, 2025 •

edited

Loading

jurgenwerk commented Mar 7, 2025

Move command requests to card message matrix events and add support for multiple commands per message #2145

Move command requests to card message matrix events and add support for multiple commands per message #2145

Conversation

lukemelia commented Feb 13, 2025 • edited Loading

github-actions bot commented Feb 13, 2025 • edited Loading

Host Test Results

backspace left a comment

Choose a reason for hiding this comment

lukemelia commented Mar 4, 2025

IanCal commented Mar 5, 2025

IanCal commented Mar 6, 2025 • edited Loading

jurgenwerk commented Mar 6, 2025

backspace commented Mar 6, 2025

lukemelia commented Mar 6, 2025

jurgenwerk commented Mar 7, 2025 • edited Loading

jurgenwerk commented Mar 7, 2025

lukemelia commented Feb 13, 2025 •

edited

Loading

github-actions bot commented Feb 13, 2025 •

edited

Loading

IanCal commented Mar 6, 2025 •

edited

Loading

jurgenwerk commented Mar 7, 2025 •

edited

Loading