A/B Test Calculator

Significance · sample size · confidence interval · power

Control (A)

Variant (B)

Not Significant

Not enough evidence at the 95% confidence level. Consider running longer.

Control Rate

3.00%

Variant Rate

3.70%

Relative Lift

+23.3%

Absolute: +0.70%

p-value

0.3432

Z = 1.945

95% Confidence Interval for the Difference

0
-0.01%+0.70%1.41%

The interval crosses zero. The true effect could be positive, negative, or zero.

Statistical Power Achieved

49.4%

Low power (< 80%). The test may not have had enough visitors to reliably detect this effect. Consider a larger sample.

Show the math
// Conversion rates
p_control = 3.0000%
p_variant = 3.7000%
// Pooled proportion
p_pooled = 0.033500
// Pooled standard error
SE_pooled = sqrt(p_pooled * (1 - p_pooled) * (1/n1 + 1/n2))
SE_pooled = 0.003599
// Z-score
Z = (p_variant - p_control) / SE_pooled
Z = 1.945112
// Two-tailed p-value
p_value = 2 * (1 - Phi(|Z|))
p_value = 0.343221
// Confidence interval
SE_diff = sqrt(p1*(1-p1)/n1 + p2*(1-p2)/n2)
SE_diff = 0.003598
CI = (0.7000%) +/- 1.9600 * 0.003598
CI = [-0.0052%, 1.4052%]

About this tool

What does the A/B test calculator tell me?
It calculates statistical significance for your experiment: whether the difference between variant A and variant B is likely real or due to chance. Enter your control and variant visitors and conversions to get a p-value, confidence level, and relative lift.
What p-value should I target for a valid A/B test?
Most practitioners use p ≤ 0.05 (95% confidence) as the minimum bar. High-stakes decisions (pricing, checkout) often require p ≤ 0.01 (99%). The calculator shows both.
How do I use the sample size planner?
Enter your current baseline conversion rate, the minimum detectable effect (the smallest lift you care about), and your desired confidence level. The planner returns the number of visitors each variant needs before you can trust the result.
Is my test data sent to a server?
No. All calculations run client-side in your browser. Paste sensitive conversion data without concern.
Can I run it with just visitors and conversions?
Yes. The Results tab accepts visitors + conversions for control and variant. The Sample Size tab works from baseline rate + MDE alone.
About

Two modes: Significance (paste visitor + conversion counts to see if the lift is real) and Sample size (plan how many visitors you need).

  • p-value: probability the difference is random.
  • Power: ability to detect a real effect. 80% is standard.
  • MDE: smallest lift you can reliably catch.

Tip: never peek; never test 20 metrics; run for a full week minimum.