Design the multi-arm test
Arms & target conversion rates
Specify the expected conversion or response rate for the control and each variant. The planner will compute the minimum per-arm sample size to detect the differences you have entered at your chosen confidence and power.
Arms, target means, and variability
Specify the expected mean outcome for the control and each variant, along with a common standard deviation. The planner assumes roughly equal variability across arms.
Rough measure of spread around each arm’s mean (for example, dollars or points).
Lift vs. control focuses on detecting a meaningful improvement for each variant separately. Omnibus focuses on detecting that at least one arm differs meaningfully from control.
Confidence = 1 - alpha for a two-sided test.
Common choices are 80% or 90% power.
Advanced settings
Two-sided tests are standard when any increase or decrease matters. One-sided tests can be used when you only care about improvements relative to control, but they ignore strong evidence of a decrease.