Replay & Fork: rerun any past conversation through new rules
Pick a past inbound, optionally override the system prompt or append a candidate rule, and see what the agent would have said. Pure dry run — no sends, no writes.
Updated June 26, 2026
Tuning agents in production is risky if you're flying blind. Replay
lets you re-run any past inbound through any version of your prompt or
rules and see, side-by-side, exactly what the agent would have said.
Open it from any agent → Replay tab.
What it does
For any MessageLog entry, replay:
- Reconstructs the conversation up to that inbound
- Re-runs the agent with current settings (or your overrides)
- Shows the original reply alongside the new reply, plus tools each
It's a complete dry run:
- No SMS / email / chat sends
- No CRM writes (every write tool no-ops with a sandbox marker)
- No MessageLog row created
- No charges to your monthly usage cap (other than the LLM tokens for
Workflow: tuning a system prompt
- Open Replay
- Pick an inbound the agent handled poorly
- Click Overrides → Override system prompt
- Paste your candidate prompt
- Click Replay
- Compare the original vs new reply
Iterate until you're happy, then update the agent's actual system prompt
with the winning version.
Workflow: testing a candidate rule
You wrote a new rule (e.g. "Always offer Tuesday slots first") and want
to know if it would actually fire on real conversations.
- Open Replay
- Pick a conversation where the rule should have fired
- Overrides → Append additional instructions
- Paste the candidate rule
- Click Replay and check if the agent followed it
This is faster than adding the rule to the agent, hoping the next
inbound exercises it, and reverting if not.
Workflow: spotting regressions
Before pushing a big prompt change, replay 5–10 of your most recent
conversations. If the new prompt produces worse replies on the easy
cases, that's a red flag — even if it fixes the hard one you started
with.
Limits
- Replay uses message history reconstructed from MessageLog rows, not
- Tools that need real CRM data still hit your CRM (read-only) so the
- Long conversations only include the last ~8 turns of context — same
Tips
- The contactId is mangled (
playground-replay-…) so write tools - The "tools used" chip lists each tool the new agent called; useful