You can now A/B test prompts in production. Publish two or more prompt versions, start an experiment with configurable traffic weights, and let Grepture route requests automatically. An auto-created Relevance evaluator scores each variant so you can pick the winner based on real data — not gut feel.