Numbered Balls: Multiple cumulative probabilities












1












$begingroup$


I have 5 bags, each with 20 balls. In each bag one of the balls is special - it's red, the others are white. The game is to pick balls without replacing until I choose the special ball.



My friend says he is psychic and can pick a special ball statistically significantly greater than chance.



He picks the chosen ball (for example):
2nd on bag 1,
3rd on bag 2,
1st on bag 3,
4th on bag 4, and
2nd on bag 5.



This average (2.2) is obviously quite a good result and much lower than the expected value (10.5) - but can I show that he is very unlikely to have done so well by chance?



I feel it's the same problem as calculating to the probability of say rolling a total of 11 or lower on 5, 20 sided dice.










share|cite|improve this question











$endgroup$

















    1












    $begingroup$


    I have 5 bags, each with 20 balls. In each bag one of the balls is special - it's red, the others are white. The game is to pick balls without replacing until I choose the special ball.



    My friend says he is psychic and can pick a special ball statistically significantly greater than chance.



    He picks the chosen ball (for example):
    2nd on bag 1,
    3rd on bag 2,
    1st on bag 3,
    4th on bag 4, and
    2nd on bag 5.



    This average (2.2) is obviously quite a good result and much lower than the expected value (10.5) - but can I show that he is very unlikely to have done so well by chance?



    I feel it's the same problem as calculating to the probability of say rolling a total of 11 or lower on 5, 20 sided dice.










    share|cite|improve this question











    $endgroup$















      1












      1








      1





      $begingroup$


      I have 5 bags, each with 20 balls. In each bag one of the balls is special - it's red, the others are white. The game is to pick balls without replacing until I choose the special ball.



      My friend says he is psychic and can pick a special ball statistically significantly greater than chance.



      He picks the chosen ball (for example):
      2nd on bag 1,
      3rd on bag 2,
      1st on bag 3,
      4th on bag 4, and
      2nd on bag 5.



      This average (2.2) is obviously quite a good result and much lower than the expected value (10.5) - but can I show that he is very unlikely to have done so well by chance?



      I feel it's the same problem as calculating to the probability of say rolling a total of 11 or lower on 5, 20 sided dice.










      share|cite|improve this question











      $endgroup$




      I have 5 bags, each with 20 balls. In each bag one of the balls is special - it's red, the others are white. The game is to pick balls without replacing until I choose the special ball.



      My friend says he is psychic and can pick a special ball statistically significantly greater than chance.



      He picks the chosen ball (for example):
      2nd on bag 1,
      3rd on bag 2,
      1st on bag 3,
      4th on bag 4, and
      2nd on bag 5.



      This average (2.2) is obviously quite a good result and much lower than the expected value (10.5) - but can I show that he is very unlikely to have done so well by chance?



      I feel it's the same problem as calculating to the probability of say rolling a total of 11 or lower on 5, 20 sided dice.







      probability combinatorics






      share|cite|improve this question















      share|cite|improve this question













      share|cite|improve this question




      share|cite|improve this question








      edited Nov 29 '18 at 11:13







      james_alvarez

















      asked Nov 28 '18 at 19:24









      james_alvarezjames_alvarez

      1266




      1266






















          1 Answer
          1






          active

          oldest

          votes


















          2












          $begingroup$

          For a person picking at random, chance of getting the special on first pick is $ frac 1 {20}$. On second try it is $frac {19} {20} * frac 1 {19}$ = $ frac 1 {20}$. Following the same logic, we get for a single bag probability to pick the special ball on $n^{th}$ pick is



          $P(n) = frac 1 {20}$



          Using this we can construct a discrete uniform distribution for the number of pick we get the special ball.



          $E(n) = 10.5$



          $Var(n) = frac {(b - a + 1 ) ^ 2 -1} {12} = frac {399} {12}$



          Now that we have a distribution, we need to do a goodness of fit test. Using a one sided chi-squared test we can show whether the results are meaningful or not. Since you only have 5 observations your degree of freedom would be really low. You can test it with different significance levels (or look at p-value)



          However, as you pointed out in the comment,Chi-squared tests only test whether it's from the same distribution, not whether on lower end. We can do a monte carlo simulation to get cdf and get the answer that way, however since we are working on the tail, it might be slow to converge (importance sampling could be used ).



          We can also use brute force to calculate it.



          Theoretically, the results of the 5 pick can come up in $20^5$ ways. Your psychic friend got a total of 12. Now lets get the distribution of the sum.



          To get a score of 5 (getting all at 1st try) there is only 1 way. We have to choose all of them at the first pick.



          To get a score of 6 or less (get one at second try all others on the first) there are 5 + 1 = 6 ways.



          To get a score of 7, and more lets come up with a general solution



          Imagine the total is 11. (eg. he got the right ball on picks 2/3/2/2/2)
          How many total ways there can be to arrange that? Let 0 denote a wrong pick and x denote the right pick. the example could be described as:



          0x00x0x0x0x



          This could be arranged in $frac {11!} {6!5!} $ = $11 choose 5$ ways
          Since there is always going to be exactly 5 x and there could be no more than 19 consecutive 0, we can use $n choose 5$ up to 20 safely. If the arrangement happens something like the following



          0xxx0xx0000



          It means that we got the right ball on picks (2,1,1,2,1) and ended with 7, which is an arrangement included in $11 choose 5$



          Since your friend got 12 points, it would be $frac {12 choose 5} {20^5} = 0.0002475. $ Since this is smaller than 1%, we can say he is statistically better at picking.



          Hope it helps






          share|cite|improve this answer











          $endgroup$









          • 1




            $begingroup$
            This is along the right lines, but chi-squared tests for any deviation from uniform, where as I am only interested in a specifc case, where lower numbers are more probable... Is there anyway to check that the true E(n) is lower than 10.5 for example - as per a t-test if had normal distributed data for example?
            $endgroup$
            – james_alvarez
            Nov 28 '18 at 23:22






          • 1




            $begingroup$
            For example with a Monte-Carlo analysis, getting an average of 2.2 (as per the question example) or less, is coming out as p = 0.00015 - but is there a way of deriving this?
            $endgroup$
            – james_alvarez
            Nov 29 '18 at 7:28










          • $begingroup$
            Edited in lights of your comment
            $endgroup$
            – Ofya
            Nov 29 '18 at 18:31






          • 1




            $begingroup$
            Thanks that's brilliant
            $endgroup$
            – james_alvarez
            Nov 30 '18 at 9:36











          Your Answer





          StackExchange.ifUsing("editor", function () {
          return StackExchange.using("mathjaxEditing", function () {
          StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
          StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
          });
          });
          }, "mathjax-editing");

          StackExchange.ready(function() {
          var channelOptions = {
          tags: "".split(" "),
          id: "69"
          };
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function() {
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled) {
          StackExchange.using("snippets", function() {
          createEditor();
          });
          }
          else {
          createEditor();
          }
          });

          function createEditor() {
          StackExchange.prepareEditor({
          heartbeatType: 'answer',
          autoActivateHeartbeat: false,
          convertImagesToLinks: true,
          noModals: true,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: 10,
          bindNavPrevention: true,
          postfix: "",
          imageUploader: {
          brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
          contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
          allowUrls: true
          },
          noCode: true, onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          });


          }
          });














          draft saved

          draft discarded


















          StackExchange.ready(
          function () {
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3017575%2fnumbered-balls-multiple-cumulative-probabilities%23new-answer', 'question_page');
          }
          );

          Post as a guest















          Required, but never shown

























          1 Answer
          1






          active

          oldest

          votes








          1 Answer
          1






          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes









          2












          $begingroup$

          For a person picking at random, chance of getting the special on first pick is $ frac 1 {20}$. On second try it is $frac {19} {20} * frac 1 {19}$ = $ frac 1 {20}$. Following the same logic, we get for a single bag probability to pick the special ball on $n^{th}$ pick is



          $P(n) = frac 1 {20}$



          Using this we can construct a discrete uniform distribution for the number of pick we get the special ball.



          $E(n) = 10.5$



          $Var(n) = frac {(b - a + 1 ) ^ 2 -1} {12} = frac {399} {12}$



          Now that we have a distribution, we need to do a goodness of fit test. Using a one sided chi-squared test we can show whether the results are meaningful or not. Since you only have 5 observations your degree of freedom would be really low. You can test it with different significance levels (or look at p-value)



          However, as you pointed out in the comment,Chi-squared tests only test whether it's from the same distribution, not whether on lower end. We can do a monte carlo simulation to get cdf and get the answer that way, however since we are working on the tail, it might be slow to converge (importance sampling could be used ).



          We can also use brute force to calculate it.



          Theoretically, the results of the 5 pick can come up in $20^5$ ways. Your psychic friend got a total of 12. Now lets get the distribution of the sum.



          To get a score of 5 (getting all at 1st try) there is only 1 way. We have to choose all of them at the first pick.



          To get a score of 6 or less (get one at second try all others on the first) there are 5 + 1 = 6 ways.



          To get a score of 7, and more lets come up with a general solution



          Imagine the total is 11. (eg. he got the right ball on picks 2/3/2/2/2)
          How many total ways there can be to arrange that? Let 0 denote a wrong pick and x denote the right pick. the example could be described as:



          0x00x0x0x0x



          This could be arranged in $frac {11!} {6!5!} $ = $11 choose 5$ ways
          Since there is always going to be exactly 5 x and there could be no more than 19 consecutive 0, we can use $n choose 5$ up to 20 safely. If the arrangement happens something like the following



          0xxx0xx0000



          It means that we got the right ball on picks (2,1,1,2,1) and ended with 7, which is an arrangement included in $11 choose 5$



          Since your friend got 12 points, it would be $frac {12 choose 5} {20^5} = 0.0002475. $ Since this is smaller than 1%, we can say he is statistically better at picking.



          Hope it helps






          share|cite|improve this answer











          $endgroup$









          • 1




            $begingroup$
            This is along the right lines, but chi-squared tests for any deviation from uniform, where as I am only interested in a specifc case, where lower numbers are more probable... Is there anyway to check that the true E(n) is lower than 10.5 for example - as per a t-test if had normal distributed data for example?
            $endgroup$
            – james_alvarez
            Nov 28 '18 at 23:22






          • 1




            $begingroup$
            For example with a Monte-Carlo analysis, getting an average of 2.2 (as per the question example) or less, is coming out as p = 0.00015 - but is there a way of deriving this?
            $endgroup$
            – james_alvarez
            Nov 29 '18 at 7:28










          • $begingroup$
            Edited in lights of your comment
            $endgroup$
            – Ofya
            Nov 29 '18 at 18:31






          • 1




            $begingroup$
            Thanks that's brilliant
            $endgroup$
            – james_alvarez
            Nov 30 '18 at 9:36
















          2












          $begingroup$

          For a person picking at random, chance of getting the special on first pick is $ frac 1 {20}$. On second try it is $frac {19} {20} * frac 1 {19}$ = $ frac 1 {20}$. Following the same logic, we get for a single bag probability to pick the special ball on $n^{th}$ pick is



          $P(n) = frac 1 {20}$



          Using this we can construct a discrete uniform distribution for the number of pick we get the special ball.



          $E(n) = 10.5$



          $Var(n) = frac {(b - a + 1 ) ^ 2 -1} {12} = frac {399} {12}$



          Now that we have a distribution, we need to do a goodness of fit test. Using a one sided chi-squared test we can show whether the results are meaningful or not. Since you only have 5 observations your degree of freedom would be really low. You can test it with different significance levels (or look at p-value)



          However, as you pointed out in the comment,Chi-squared tests only test whether it's from the same distribution, not whether on lower end. We can do a monte carlo simulation to get cdf and get the answer that way, however since we are working on the tail, it might be slow to converge (importance sampling could be used ).



          We can also use brute force to calculate it.



          Theoretically, the results of the 5 pick can come up in $20^5$ ways. Your psychic friend got a total of 12. Now lets get the distribution of the sum.



          To get a score of 5 (getting all at 1st try) there is only 1 way. We have to choose all of them at the first pick.



          To get a score of 6 or less (get one at second try all others on the first) there are 5 + 1 = 6 ways.



          To get a score of 7, and more lets come up with a general solution



          Imagine the total is 11. (eg. he got the right ball on picks 2/3/2/2/2)
          How many total ways there can be to arrange that? Let 0 denote a wrong pick and x denote the right pick. the example could be described as:



          0x00x0x0x0x



          This could be arranged in $frac {11!} {6!5!} $ = $11 choose 5$ ways
          Since there is always going to be exactly 5 x and there could be no more than 19 consecutive 0, we can use $n choose 5$ up to 20 safely. If the arrangement happens something like the following



          0xxx0xx0000



          It means that we got the right ball on picks (2,1,1,2,1) and ended with 7, which is an arrangement included in $11 choose 5$



          Since your friend got 12 points, it would be $frac {12 choose 5} {20^5} = 0.0002475. $ Since this is smaller than 1%, we can say he is statistically better at picking.



          Hope it helps






          share|cite|improve this answer











          $endgroup$









          • 1




            $begingroup$
            This is along the right lines, but chi-squared tests for any deviation from uniform, where as I am only interested in a specifc case, where lower numbers are more probable... Is there anyway to check that the true E(n) is lower than 10.5 for example - as per a t-test if had normal distributed data for example?
            $endgroup$
            – james_alvarez
            Nov 28 '18 at 23:22






          • 1




            $begingroup$
            For example with a Monte-Carlo analysis, getting an average of 2.2 (as per the question example) or less, is coming out as p = 0.00015 - but is there a way of deriving this?
            $endgroup$
            – james_alvarez
            Nov 29 '18 at 7:28










          • $begingroup$
            Edited in lights of your comment
            $endgroup$
            – Ofya
            Nov 29 '18 at 18:31






          • 1




            $begingroup$
            Thanks that's brilliant
            $endgroup$
            – james_alvarez
            Nov 30 '18 at 9:36














          2












          2








          2





          $begingroup$

          For a person picking at random, chance of getting the special on first pick is $ frac 1 {20}$. On second try it is $frac {19} {20} * frac 1 {19}$ = $ frac 1 {20}$. Following the same logic, we get for a single bag probability to pick the special ball on $n^{th}$ pick is



          $P(n) = frac 1 {20}$



          Using this we can construct a discrete uniform distribution for the number of pick we get the special ball.



          $E(n) = 10.5$



          $Var(n) = frac {(b - a + 1 ) ^ 2 -1} {12} = frac {399} {12}$



          Now that we have a distribution, we need to do a goodness of fit test. Using a one sided chi-squared test we can show whether the results are meaningful or not. Since you only have 5 observations your degree of freedom would be really low. You can test it with different significance levels (or look at p-value)



          However, as you pointed out in the comment,Chi-squared tests only test whether it's from the same distribution, not whether on lower end. We can do a monte carlo simulation to get cdf and get the answer that way, however since we are working on the tail, it might be slow to converge (importance sampling could be used ).



          We can also use brute force to calculate it.



          Theoretically, the results of the 5 pick can come up in $20^5$ ways. Your psychic friend got a total of 12. Now lets get the distribution of the sum.



          To get a score of 5 (getting all at 1st try) there is only 1 way. We have to choose all of them at the first pick.



          To get a score of 6 or less (get one at second try all others on the first) there are 5 + 1 = 6 ways.



          To get a score of 7, and more lets come up with a general solution



          Imagine the total is 11. (eg. he got the right ball on picks 2/3/2/2/2)
          How many total ways there can be to arrange that? Let 0 denote a wrong pick and x denote the right pick. the example could be described as:



          0x00x0x0x0x



          This could be arranged in $frac {11!} {6!5!} $ = $11 choose 5$ ways
          Since there is always going to be exactly 5 x and there could be no more than 19 consecutive 0, we can use $n choose 5$ up to 20 safely. If the arrangement happens something like the following



          0xxx0xx0000



          It means that we got the right ball on picks (2,1,1,2,1) and ended with 7, which is an arrangement included in $11 choose 5$



          Since your friend got 12 points, it would be $frac {12 choose 5} {20^5} = 0.0002475. $ Since this is smaller than 1%, we can say he is statistically better at picking.



          Hope it helps






          share|cite|improve this answer











          $endgroup$



          For a person picking at random, chance of getting the special on first pick is $ frac 1 {20}$. On second try it is $frac {19} {20} * frac 1 {19}$ = $ frac 1 {20}$. Following the same logic, we get for a single bag probability to pick the special ball on $n^{th}$ pick is



          $P(n) = frac 1 {20}$



          Using this we can construct a discrete uniform distribution for the number of pick we get the special ball.



          $E(n) = 10.5$



          $Var(n) = frac {(b - a + 1 ) ^ 2 -1} {12} = frac {399} {12}$



          Now that we have a distribution, we need to do a goodness of fit test. Using a one sided chi-squared test we can show whether the results are meaningful or not. Since you only have 5 observations your degree of freedom would be really low. You can test it with different significance levels (or look at p-value)



          However, as you pointed out in the comment,Chi-squared tests only test whether it's from the same distribution, not whether on lower end. We can do a monte carlo simulation to get cdf and get the answer that way, however since we are working on the tail, it might be slow to converge (importance sampling could be used ).



          We can also use brute force to calculate it.



          Theoretically, the results of the 5 pick can come up in $20^5$ ways. Your psychic friend got a total of 12. Now lets get the distribution of the sum.



          To get a score of 5 (getting all at 1st try) there is only 1 way. We have to choose all of them at the first pick.



          To get a score of 6 or less (get one at second try all others on the first) there are 5 + 1 = 6 ways.



          To get a score of 7, and more lets come up with a general solution



          Imagine the total is 11. (eg. he got the right ball on picks 2/3/2/2/2)
          How many total ways there can be to arrange that? Let 0 denote a wrong pick and x denote the right pick. the example could be described as:



          0x00x0x0x0x



          This could be arranged in $frac {11!} {6!5!} $ = $11 choose 5$ ways
          Since there is always going to be exactly 5 x and there could be no more than 19 consecutive 0, we can use $n choose 5$ up to 20 safely. If the arrangement happens something like the following



          0xxx0xx0000



          It means that we got the right ball on picks (2,1,1,2,1) and ended with 7, which is an arrangement included in $11 choose 5$



          Since your friend got 12 points, it would be $frac {12 choose 5} {20^5} = 0.0002475. $ Since this is smaller than 1%, we can say he is statistically better at picking.



          Hope it helps







          share|cite|improve this answer














          share|cite|improve this answer



          share|cite|improve this answer








          edited Nov 30 '18 at 9:45









          james_alvarez

          1266




          1266










          answered Nov 28 '18 at 20:43









          OfyaOfya

          5048




          5048








          • 1




            $begingroup$
            This is along the right lines, but chi-squared tests for any deviation from uniform, where as I am only interested in a specifc case, where lower numbers are more probable... Is there anyway to check that the true E(n) is lower than 10.5 for example - as per a t-test if had normal distributed data for example?
            $endgroup$
            – james_alvarez
            Nov 28 '18 at 23:22






          • 1




            $begingroup$
            For example with a Monte-Carlo analysis, getting an average of 2.2 (as per the question example) or less, is coming out as p = 0.00015 - but is there a way of deriving this?
            $endgroup$
            – james_alvarez
            Nov 29 '18 at 7:28










          • $begingroup$
            Edited in lights of your comment
            $endgroup$
            – Ofya
            Nov 29 '18 at 18:31






          • 1




            $begingroup$
            Thanks that's brilliant
            $endgroup$
            – james_alvarez
            Nov 30 '18 at 9:36














          • 1




            $begingroup$
            This is along the right lines, but chi-squared tests for any deviation from uniform, where as I am only interested in a specifc case, where lower numbers are more probable... Is there anyway to check that the true E(n) is lower than 10.5 for example - as per a t-test if had normal distributed data for example?
            $endgroup$
            – james_alvarez
            Nov 28 '18 at 23:22






          • 1




            $begingroup$
            For example with a Monte-Carlo analysis, getting an average of 2.2 (as per the question example) or less, is coming out as p = 0.00015 - but is there a way of deriving this?
            $endgroup$
            – james_alvarez
            Nov 29 '18 at 7:28










          • $begingroup$
            Edited in lights of your comment
            $endgroup$
            – Ofya
            Nov 29 '18 at 18:31






          • 1




            $begingroup$
            Thanks that's brilliant
            $endgroup$
            – james_alvarez
            Nov 30 '18 at 9:36








          1




          1




          $begingroup$
          This is along the right lines, but chi-squared tests for any deviation from uniform, where as I am only interested in a specifc case, where lower numbers are more probable... Is there anyway to check that the true E(n) is lower than 10.5 for example - as per a t-test if had normal distributed data for example?
          $endgroup$
          – james_alvarez
          Nov 28 '18 at 23:22




          $begingroup$
          This is along the right lines, but chi-squared tests for any deviation from uniform, where as I am only interested in a specifc case, where lower numbers are more probable... Is there anyway to check that the true E(n) is lower than 10.5 for example - as per a t-test if had normal distributed data for example?
          $endgroup$
          – james_alvarez
          Nov 28 '18 at 23:22




          1




          1




          $begingroup$
          For example with a Monte-Carlo analysis, getting an average of 2.2 (as per the question example) or less, is coming out as p = 0.00015 - but is there a way of deriving this?
          $endgroup$
          – james_alvarez
          Nov 29 '18 at 7:28




          $begingroup$
          For example with a Monte-Carlo analysis, getting an average of 2.2 (as per the question example) or less, is coming out as p = 0.00015 - but is there a way of deriving this?
          $endgroup$
          – james_alvarez
          Nov 29 '18 at 7:28












          $begingroup$
          Edited in lights of your comment
          $endgroup$
          – Ofya
          Nov 29 '18 at 18:31




          $begingroup$
          Edited in lights of your comment
          $endgroup$
          – Ofya
          Nov 29 '18 at 18:31




          1




          1




          $begingroup$
          Thanks that's brilliant
          $endgroup$
          – james_alvarez
          Nov 30 '18 at 9:36




          $begingroup$
          Thanks that's brilliant
          $endgroup$
          – james_alvarez
          Nov 30 '18 at 9:36


















          draft saved

          draft discarded




















































          Thanks for contributing an answer to Mathematics Stack Exchange!


          • Please be sure to answer the question. Provide details and share your research!

          But avoid



          • Asking for help, clarification, or responding to other answers.

          • Making statements based on opinion; back them up with references or personal experience.


          Use MathJax to format equations. MathJax reference.


          To learn more, see our tips on writing great answers.




          draft saved


          draft discarded














          StackExchange.ready(
          function () {
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3017575%2fnumbered-balls-multiple-cumulative-probabilities%23new-answer', 'question_page');
          }
          );

          Post as a guest















          Required, but never shown





















































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown

































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown







          Popular posts from this blog

          Plaza Victoria

          Puebla de Zaragoza

          Musa