Conversion Rate Optimization Signup | Features | Pricing | Case Studies | Blog | Login

A/B testing significance calculator (spreadsheet in Excel)

The statistics of A/B testing results can be confusing unless you know exact formulas. Earlier, we had published articles related to mathematics of A/B testing and also have a free A/B testing calculator on the site to see if your results are significant or not. The articles provides an introduction and calculator simply provides an interface; the real formulas used for calculate statistical significance of split testing results are still missing.

Excel sheet with A/B testing formulas

So, we have come up with a FREE spreadsheet which details how exactly the significance is calculated. You just need to provide e thnumber of visitors and conversions for control and variations. The spreadsheet will automatically calculate for you significance, p-value, z-value and other relevant metrics for any kind of split testing (including Adwords). Of course, you can see the relevant formulas in the spreadsheet. Click the screenshot below to download the calculator (spreadsheet):

Click here to download A/B testing significance calculator (excel sheet)

Please feel free to share the file with your friends and colleagues or post it on your blog / twitter.

PS: By the way, if you want to do quick calculations, we have a version of this calculator hosted on Google Docs (please make a copy of the Google Doc sheet into your own account before you make any changes to it).

  1. A/B test duration calculator (Excel spreadsheet)
  2. How many visitors do you need for your A/B test? [Presentation]
  3. What you really need to know about mathematics of A/B split testing
  4. Killer idea: play the split testing game with your boss and colleagues
  5. How to A/B test AdSense revenue using custom channels

28 Comments »

  1. Aren’t all non-converting visitors a mistrial?

    i.e., if I multiple the number of visitors by 10x, but keep the conversions the same, the statistical significance of the results should not change.

    See http://blog.asmartbear.com/easy-statistics-for-adwords-ab-testing-and-hamsters.html

    Comment by Portman — September 28, 2010 @ 12:23 am

  2. @Portman: No, the number of visitors in the test influence the standard deviation and hence the significance. Suppose you have 10 visitors and 2 conversions v/s 1000 visitors and 200 conversions. You have a much better idea of conversion rate in the latter than the former.

    Comment by Paras Chopra — September 28, 2010 @ 12:32 am

  3. [...] A/B testing significance calculator (spreadsheet in Excel) « I love split testing – Visual Websit… You just need to provide number of visitors and conversions for control and variations. The spreadsheet will automatically calculate for you significance, p-value, z-value and other relevant metrics for any kind of split testing (including Adwords). (tags: a/b testing metrics calculator spreadsheet free tools abtesting conversion) [...]

    Pingback by links for 2010-09-30 « Köszönjük, Emese!September 30, 2010 @ 4:40 pm

  4. [...] a site seal, I strongly recommend performing an A-B test for a few months, or at least until some statistical significance is reached, to see if it will be worth spending the money on the seal again. Also make sure you are [...]

    Pingback by What is a trust seal actually worth? « The Ecommerce BlogOctober 29, 2010 @ 12:51 am

  5. http://www.cliffsnotes.com/study_guide/Point-Estimates-and-Confidence-Intervals.topicArticleId-25951,articleId-25932.html

    When you are using the values 1.65 and 1.96 to calculate significance isn’t that the niveau for 90% and 95 % respectively? At least, that’s what I take from the other website.

    Comment by Benjamin DagerothOctober 29, 2010 @ 7:53 pm

  6. @Benjamin: you will notice that it is +/- 1.65 * SE so that covers the full 95% of area of normal curve.

    Comment by Paras Chopra — October 29, 2010 @ 8:38 pm

  7. Fail. Your Conversion rate limits overlap at the 95% level but you say that they are significant. This is inconsistant.

    Comment by Dennis — November 9, 2010 @ 4:35 pm

  8. @Dennis: not sure if I got your point. Can you elaborate?

    Comment by Paras Chopra — November 9, 2010 @ 4:51 pm

  9. Sure, in your spreadsheet your 95% conversion rate limit for the control is between 5.68% and 7.62% while the conversion rate for the variation is between 4.81% and 6.89%. These two ranges overlap and thus you have failed to find a significant difference as the conversion rate may be 6% for the control and 6% for the variation.

    However you have listed in another box that your conversion rate at 95% confidence is significant.

    This result contradicts your 95& conversion rate limits results.

    Comment by Dennis — November 10, 2010 @ 3:40 pm

  10. Could you please respond to the last comment posted by Dennis? It does seem your worksheet contradicts itself. I would like to use it, but I want to make sure it is accurate.

    Comment by JoeNovember 17, 2010 @ 10:09 am

  11. @Joe and @Dennis: actually, 95% range of conversion rate is different from being significant at 95% confidence level. If you visualize conversion rate ranges as a normal curves, then the overlap in 95% range constitutes a tiny area and that’s why the resultant z-value becomes significant at 95% confidence level.

    I hope I am clear. If not, let me know. Will try to clarify.

    Comment by Paras Chopra — November 17, 2010 @ 12:48 pm

  12. Isn’t it because you are using 1.65 instead of 1.96? If you are doing a two-tailed test, 1.65 only gives you a 90% range. 1.96 is required for a 95% range, again on a two-tailed view. If you define it as checking if variation is better than control (pvariation-pcontrol<=0), then you could use a one tail range maybe. But it seems your calculator is just trying to show if they are different (i.e., you care if either one is larger than the other).

    Comment by JoeNovember 17, 2010 @ 11:33 pm

  13. @Joe: Yes, you are right. It isn’t a one-tailed test. It depends on how you are interpreting the result but I am glad you clarified.

    Comment by Paras Chopra — November 17, 2010 @ 11:37 pm

  14. For the online version of the calculator, you set the minimum N as being 15. Does n=15 have an special relevance?

    Comment by BartekJanuary 10, 2011 @ 10:06 pm

  15. @Bartek: which N are you talking about?

    Comment by Paras Chopra — January 11, 2011 @ 12:56 am

  16. Could I use this tool for evaluating responses to a survey?

    E.g. 1000 respondents, 600 are satisfied, 400 are not satisfied. is the difference statistically significant?

    Comment by Dave — February 23, 2011 @ 10:07 pm

  17. [...] User There's a handy spreadsheet with statistical significance formulas on http://visualwebsiteoptimizer.co…And like Michelle Wyatt said, you can (and should) start testing with any amount of traffic, it may [...]

    Pingback by How many hits to your landing page do you need to start A/B testing it? - QuoraApril 2, 2011 @ 4:26 am

  18. [...] Word of caution. Be aware of premature e-finalization. Don’t end tests before data is finalized (aka statistically significant). [...]

    Pingback by Appsumo reveals its A/B testing secret: only 1 out of 8 tests produce resultsMay 13, 2011 @ 8:10 pm

  19. [...] Paras Chopra from Visual Website Optimizer has some helpful tips on figuring out when you have reached “Statistical confidence.” [...]

    Pingback by A/B Testing Ad Text for Better PPC Results : Amadeus ConsultingJune 1, 2011 @ 4:05 am

  20. What’s the best way to measure statistical significance of revenue improvements. I have my split test feeding data into Analytics but I’m interested in knowing at what point my Per Visit Value (which may not correlate well with raw conversions) becomes statistically relevant. Is their a way of calculating this? To me, the answer isn’t at what point the number of conversions becomes statistically relevant it’s at what point the £ or $ becomes relevant.

    Comment by TW — June 13, 2011 @ 6:03 pm

  21. @Tim: mathematically, the basis for calculating significance on revenue improvement is similar. You simply need to input the mean and standard deviation of revenue and rest of math remains the same. We already do it for revenue tracking feature in VWO: http://visualwebsiteoptimizer.com/split-testing-blog/revenue-tracking-for-ab-testing/

    Comment by Paras Chopra — June 13, 2011 @ 6:16 pm

  22. That looks great and would def. give me a reason to use VWO next time. I have a lot of data at the moment in GWO / Analytics for this test that we’ve run so in this particular instance I’ll need to find a way of calculating that significance with the data I’ve got.

    Comment by TW — June 13, 2011 @ 8:35 pm

  23. [...] it takes for you to know which one is the best. You need statistical significance. There’s a significance calculator spreadsheet in Excel you can [...]

    Pingback by What you have to know about conversion optimization - ConversionXLNovember 4, 2011 @ 9:24 pm

  24. [...] it takes for you to know which one is the best. You need statistical significance. There’s a significance calculator spreadsheet in Excel you can [...]

    Pingback by What you have to know about conversion optimization | Traindom BlogNovember 6, 2011 @ 4:46 am

  25. The significance level of the test is not determined by the p-value, nor is it the probability that the null hypothesis is true.

    One rejects the null hypothesis when the p-value is less than the significance level alpha, which is often 0.05

    The p-value is based on the assumption that a result is the product of chance alone, it therefore cannot also be used to gauge the probability of that assumption being true.

    The significance level of a test is a value that should be decided upon by the person interpreting the data before the data are viewed,and is compared against the p-value or any other statistic calculated after the test has been performed.

    The real meaning is that the p-value is the chance of obtaining such results if the null hypothesis is true.

    Comment by EricDecember 16, 2011 @ 8:18 am

  26. [...] A/B testing significance calculator (spreadsheet in Excel) Tweet This entry was posted in Uncategorized. Bookmark the permalink. ← Skal kundens navn med i emnelinjen på dit nyhedsbrev? [...]

    Pingback by A/B spilttest giver ikke et brugbart resultatDecember 31, 2011 @ 7:27 pm

  27. [...] lite datagrunnlag – bruk denne enkle A/B-testing kalkulatoren for å sikre [...]

    Pingback by A/B-testing av annonserJanuary 12, 2012 @ 4:59 pm

  28. Can anybody tell me how to derivate the formula of the Z-Score? I need this formula for my thesis, so it would be good if I could explain the correctness of this formula with mathematical literature. Does anybody now books or any other scientifical papers that describe this forumla?
    Thank you very much!

    Comment by idnA — January 19, 2012 @ 4:45 pm

RSS feed for comments on this post. TrackBack URL

Leave a comment

Get email updates (it's free)

Get email updates

Or subscribe blog via RSS



Search Blog


Latest Posts



Visual Website Optimizer


Latest Tweets Follow Wingify on Twitter

Follow Wingify on Twitter