Evals · A/B tests
A/B tests.
Compare two agent-version snapshots against the same visitor traffic. The orchestrator pins each visitor to a variant with a sticky hash; concluding the test optionally promotes the winner's version as the agent's current snapshot.
Filter by agent
No A/B tests match these filters.