Back to Registry

code.python.tests_pass

HARD

Writes agent-generated Python code to a temp directory and runs pytest via the sandbox runner. Score is based on the fraction of tests that pass.

Scorecard

Determinismdeterministic
Evidence Qualityhard-state
Intended Useeval-and-train
Gating RequiredNo
Permissions
fs:write_tmp
exec:pytest

Test Fixtures

9 total
TypeCount
Positive3
Negative3
Adversarial3

Metadata

Version0.1.0
Domaincode
Task Typetest_execution
Contributorvr.dev
Sourcecode-quality

Use in SDK

# CLI
vr verify --verifier code.python.tests_pass --ground-truth '{"order_id": "ORD-42"}'

# Python
from vrdev import verify
result = verify("code.python.tests_pass", ground_truth={"order_id": "ORD-42"})

# API
curl -X POST https://api.vr.dev/v1/verify \
  -H "X-API-Key: vr_live_..." \
  -d '{"verifier": "code.python.tests_pass", "ground_truth": {"order_id": "ORD-42"}}'
Verifier Registry | vr.dev