Deterministic Refunds: Agent Script Guardrails in Practice

The scenario

Falcon Retail's refund policy: delivered within 30 days, item not final-sale, max two refunds per customer per year. A purely instruction-driven agent gets this mostly right — and "mostly" is unacceptable when money moves. A sweet-talking customer, an ambiguous date, a long conversation that pushes the policy out of context… and the model approves a refund it shouldn't.

The principle: let the LLM handle the conversation, let code handle the consequences. That's exactly what hybrid reasoning with Agent Script (public beta since November 2025) is for.

Instructions vs. script: the crucial difference

A natural-language instruction like "Only refund if the order was delivered within 30 days" is a suggestion the model usually follows. An Agent Script condition is a branch the model cannot skip — the Atlas Reasoning Engine executes it deterministically, like code, because it is code.

The refund topic in Agent Script

topic refund_request:
  description: "Handle a customer's request to refund a delivered order"
  reasoning:
    instruction: |
      The customer wants a refund for order {{@variable.order_number}}.
      Be empathetic. Gather the order number if missing, then check
      eligibility before promising anything:
      {{@action.Check_Refund_Eligibility}}
    actions:
      Check_Refund_Eligibility:
        with order_number = @variable.order_number
        set @variable.eligible     = @result.is_eligible
        set @variable.deny_reason  = @result.reason

  # ── Deterministic guardrails — not up for negotiation ──
  if @variable.eligible == false:
    @utils.transition to @topic.explain_denial

  if @variable.eligible == true:
    @utils.transition to @topic.process_refund

topic process_refund:
  description: "Execute an approved refund"
  reasoning:
    instruction: |
      Confirm the refund amount with the customer, then execute:
      {{@action.Issue_Refund}}
      Share the confirmation number and expected timeline.

The eligibility rules (30 days, final-sale, refund count) live inside the Check_Refund_Eligibility action — a Flow or Apex invocable you can unit-test.
The if transitions are evaluated programmatically. No phrasing, mood or prompt injection changes the branch taken.
process_refund is only reachable through an approved check — the money-moving action isn't even in the first topic's toolbox.

Design rule worth memorizing: if a step has consequences (payments, deletions, permissions), gate it behind a scripted condition and keep its action out of reach of earlier topics.

Testing the guardrails

Builder test panel: try to social-engineer your own agent — "my grandma bought this 90 days ago but she's sick" — and verify the denial branch fires with empathy but no refund.
Testing Center: load a suite of edge-case utterances (day 29, day 31, final-sale, third refund of the year) and assert which topic and action fired for each.
Apex tests: the eligibility logic is plain code — test it exhaustively there, where it's cheap.

Want working examples to poke at? Salesforce published an Agent Script Recipes sample app with runnable hybrid-reasoning patterns.

What you learned

Hybrid reasoning = LLM flexibility for conversation + programmatic certainty for policy.
Agent Script conditions and transitions are enforced by the reasoning engine, not "followed" by the model.
Beta caveat: the language is still evolving — keep scripts in version control and re-test on release upgrades.

Sources: Introducing Hybrid Reasoning with Agent Script · Agent Script — Agentforce Developer Guide

Refunds an LLM Can't Improvise: Agent Script Guardrails

The scenario

Instructions vs. script: the crucial difference

The refund topic in Agent Script

Testing the guardrails

What you learned