Open-AutoGLM can tap, type, and submit forms. That power requires guardrails. This tutorial collects best practices for preventing destructive actions during evaluation.
Examples include:
If the action cannot be easily undone, treat it as destructive.
Create a simple risk scale to decide how strict your guardrails should be:
Require stronger approvals as risk increases.
Require a human confirmation step before any destructive action. The agent should explicitly ask:
I am about to submit a payment. Do you want me to proceed? (yes/no)In dry run, the agent lists its intended actions without executing them. This is ideal for early evaluation:
Plan: tap Settings > Billing > Cancel Subscription.
Awaiting confirmation to execute.Limit which screens the agent can interact with. If the agent navigates outside allowed areas, stop the run.
Ask the agent to read back the action before executing:
I am about to tap "Delete account". Confirm yes/no.This catches mis‑read UI elements.
Always log:
This helps you reproduce and review any unexpected behavior.
Whenever possible:
This reduces the blast radius of mistakes.
Use checkpoints at:
See the human‑in‑the‑loop guide for patterns.
The agent must:
1) Ask before tapping any button labeled Delete, Remove, or Purchase.
2) Stop if a payment form is detected.
3) Require a human to enter credentials or OTP codes.Before shipping a workflow:
Waitlist
Get notified when guided Android regression testing workflows and safety checklists are ready.
We only use your email for the waitlist. You can opt out anytime.