Replay & Fork: rerun any past conversation through new rules

Pick a past inbound, optionally override the system prompt or append a candidate rule, and see what the agent would have said. Pure dry run — no sends, no writes.

Updated June 26, 2026

Tuning agents in production is risky if you're flying blind. Replay
lets you re-run any past inbound through any version of your prompt or
rules and see, side-by-side, exactly what the agent would have said.

Open it from any agent → Replay tab.

What it does

For any MessageLog entry, replay:

Reconstructs the conversation up to that inbound
Re-runs the agent with current settings (or your overrides)
Shows the original reply alongside the new reply, plus tools each

It's a complete dry run:

No SMS / email / chat sends
No CRM writes (every write tool no-ops with a sandbox marker)
No MessageLog row created
No charges to your monthly usage cap (other than the LLM tokens for

Workflow: tuning a system prompt

Open Replay
Pick an inbound the agent handled poorly
Click Overrides → Override system prompt
Paste your candidate prompt
Click Replay
Compare the original vs new reply

Iterate until you're happy, then update the agent's actual system prompt
with the winning version.

Workflow: testing a candidate rule

You wrote a new rule (e.g. "Always offer Tuesday slots first") and want
to know if it would actually fire on real conversations.

Open Replay
Pick a conversation where the rule should have fired
Overrides → Append additional instructions
Paste the candidate rule
Click Replay and check if the agent followed it

This is faster than adding the rule to the agent, hoping the next
inbound exercises it, and reverting if not.

Workflow: spotting regressions

Before pushing a big prompt change, replay 5–10 of your most recent
conversations. If the new prompt produces worse replies on the easy
cases, that's a red flag — even if it fixes the hard one you started
with.

Limits

Replay uses message history reconstructed from MessageLog rows, not
Tools that need real CRM data still hit your CRM (read-only) so the
Long conversations only include the last ~8 turns of context — same

Tips

The contactId is mangled (playground-replay-…) so write tools
The "tools used" chip lists each tool the new agent called; useful