Back to Registry

aiv.shell.state_probe

AGENTIC

Executes a sandboxed read-only shell command and compares stdout to an expected value, probing whether an agent action actually changed system state.

Scorecard

Determinismdeterministic
Evidence Qualityhard-state
Intended Useeval-and-train
Gating RequiredNo
Permissions
subprocess:readonly

Attack Surface

injection risk
low
format gaming risk
low
tool spoofing risk
low

Test Fixtures

9 total
TypeCount
Positive3
Negative3
Adversarial3

Metadata

Version0.1.0
Domainaiv
Task Typeshell_state_verification
Contributorvr.dev
SourcearXiv:2602.00575

Use in SDK

# CLI
vr verify --verifier aiv.shell.state_probe --ground-truth '{"order_id": "ORD-42"}'

# Python
from vrdev import verify
result = verify("aiv.shell.state_probe", ground_truth={"order_id": "ORD-42"})

# API
curl -X POST https://api.vr.dev/v1/verify \
  -H "X-API-Key: vr_live_..." \
  -d '{"verifier": "aiv.shell.state_probe", "ground_truth": {"order_id": "ORD-42"}}'
Verifier Registry | vr.dev