Interpreting an agreement function












1












$begingroup$


Consider the following function:
$$
P = frac{1}{n(n - 1)} sum_{j=1}^k n_{j} (n_{j} - 1)
$$

where for $n = sum_{j=1}^k n_{j}$.



Intuitively, this function measure concentration of values the vector $(n_1, ..., n_k)$. Take the edge cases:




  • Values concentrated: $exists j, n_j = n$ (in other words $forall i neq j, n_i = 0$) $Rightarrow P = 1.0$


  • Least concentration (uniformly distribution): $n_1 = n_2 = ... = n_k Rightarrow P = 0.0$



The formula is relatively simple-looking, but it's not obvious to interpret. I am looking for better/simpler ways to explain this formula to people who are not mathematically sophisticated (say in my psychology department, who know the basics of statistics like mean and variance). Would appreciate if you provide me with any suggestions on this.










share|cite|improve this question











$endgroup$

















    1












    $begingroup$


    Consider the following function:
    $$
    P = frac{1}{n(n - 1)} sum_{j=1}^k n_{j} (n_{j} - 1)
    $$

    where for $n = sum_{j=1}^k n_{j}$.



    Intuitively, this function measure concentration of values the vector $(n_1, ..., n_k)$. Take the edge cases:




    • Values concentrated: $exists j, n_j = n$ (in other words $forall i neq j, n_i = 0$) $Rightarrow P = 1.0$


    • Least concentration (uniformly distribution): $n_1 = n_2 = ... = n_k Rightarrow P = 0.0$



    The formula is relatively simple-looking, but it's not obvious to interpret. I am looking for better/simpler ways to explain this formula to people who are not mathematically sophisticated (say in my psychology department, who know the basics of statistics like mean and variance). Would appreciate if you provide me with any suggestions on this.










    share|cite|improve this question











    $endgroup$















      1












      1








      1





      $begingroup$


      Consider the following function:
      $$
      P = frac{1}{n(n - 1)} sum_{j=1}^k n_{j} (n_{j} - 1)
      $$

      where for $n = sum_{j=1}^k n_{j}$.



      Intuitively, this function measure concentration of values the vector $(n_1, ..., n_k)$. Take the edge cases:




      • Values concentrated: $exists j, n_j = n$ (in other words $forall i neq j, n_i = 0$) $Rightarrow P = 1.0$


      • Least concentration (uniformly distribution): $n_1 = n_2 = ... = n_k Rightarrow P = 0.0$



      The formula is relatively simple-looking, but it's not obvious to interpret. I am looking for better/simpler ways to explain this formula to people who are not mathematically sophisticated (say in my psychology department, who know the basics of statistics like mean and variance). Would appreciate if you provide me with any suggestions on this.










      share|cite|improve this question











      $endgroup$




      Consider the following function:
      $$
      P = frac{1}{n(n - 1)} sum_{j=1}^k n_{j} (n_{j} - 1)
      $$

      where for $n = sum_{j=1}^k n_{j}$.



      Intuitively, this function measure concentration of values the vector $(n_1, ..., n_k)$. Take the edge cases:




      • Values concentrated: $exists j, n_j = n$ (in other words $forall i neq j, n_i = 0$) $Rightarrow P = 1.0$


      • Least concentration (uniformly distribution): $n_1 = n_2 = ... = n_k Rightarrow P = 0.0$



      The formula is relatively simple-looking, but it's not obvious to interpret. I am looking for better/simpler ways to explain this formula to people who are not mathematically sophisticated (say in my psychology department, who know the basics of statistics like mean and variance). Would appreciate if you provide me with any suggestions on this.







      probability combinatorics intuition education






      share|cite|improve this question















      share|cite|improve this question













      share|cite|improve this question




      share|cite|improve this question








      edited Nov 28 '18 at 21:00









      greedoid

      38.7k114797




      38.7k114797










      asked Nov 28 '18 at 17:57









      DanielDaniel

      1,1311021




      1,1311021






















          3 Answers
          3






          active

          oldest

          votes


















          3












          $begingroup$

          Supppose we have sets $A_1,...A_k$ which are pairvise disjunct and for each $j$ let $n_j = |A_j|$.



          We choose randomly two elements from $A= A_1cup A_2cup...cup A_k$. Then the probability that they are from the same set is the expresion you wrote.






          share|cite|improve this answer









          $endgroup$





















            2












            $begingroup$

            You can write your $P$ in terms of $C$, and vice versa, where $C=sum_{i=1}^k (n_i-n/k)^2/(n/k)$ is the conventional chi-square statistic for testing if the vector of $n_i$ values comes from the all-categories-equally-likely flat multinomial model. So $P$ tells you no more or less than $C$ does, but your audience might be familiar with the chi-square statistic as commonly used to measure unevenness of category counts.






            share|cite|improve this answer









            $endgroup$





















              1












              $begingroup$

              Probability to take the same element $j$ two times:
              $$
              p_j=left(frac{n_j}{n}right)left(frac{n_j-1}{n-1}right)=frac{n_j(n_j-1)}{n(n-1)}
              $$

              Now we do not care about $j$ anymore and therefore marginalize:
              $$
              p=sum_j p_j=frac{1}{n(n-1)}sum_j n_j(n_j-1)
              $$



              which is the probability to take the same element two times (independently of $j$)






              share|cite|improve this answer











              $endgroup$













                Your Answer





                StackExchange.ifUsing("editor", function () {
                return StackExchange.using("mathjaxEditing", function () {
                StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
                StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
                });
                });
                }, "mathjax-editing");

                StackExchange.ready(function() {
                var channelOptions = {
                tags: "".split(" "),
                id: "69"
                };
                initTagRenderer("".split(" "), "".split(" "), channelOptions);

                StackExchange.using("externalEditor", function() {
                // Have to fire editor after snippets, if snippets enabled
                if (StackExchange.settings.snippets.snippetsEnabled) {
                StackExchange.using("snippets", function() {
                createEditor();
                });
                }
                else {
                createEditor();
                }
                });

                function createEditor() {
                StackExchange.prepareEditor({
                heartbeatType: 'answer',
                autoActivateHeartbeat: false,
                convertImagesToLinks: true,
                noModals: true,
                showLowRepImageUploadWarning: true,
                reputationToPostImages: 10,
                bindNavPrevention: true,
                postfix: "",
                imageUploader: {
                brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
                contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
                allowUrls: true
                },
                noCode: true, onDemand: true,
                discardSelector: ".discard-answer"
                ,immediatelyShowMarkdownHelp:true
                });


                }
                });














                draft saved

                draft discarded


















                StackExchange.ready(
                function () {
                StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3017476%2finterpreting-an-agreement-function%23new-answer', 'question_page');
                }
                );

                Post as a guest















                Required, but never shown

























                3 Answers
                3






                active

                oldest

                votes








                3 Answers
                3






                active

                oldest

                votes









                active

                oldest

                votes






                active

                oldest

                votes









                3












                $begingroup$

                Supppose we have sets $A_1,...A_k$ which are pairvise disjunct and for each $j$ let $n_j = |A_j|$.



                We choose randomly two elements from $A= A_1cup A_2cup...cup A_k$. Then the probability that they are from the same set is the expresion you wrote.






                share|cite|improve this answer









                $endgroup$


















                  3












                  $begingroup$

                  Supppose we have sets $A_1,...A_k$ which are pairvise disjunct and for each $j$ let $n_j = |A_j|$.



                  We choose randomly two elements from $A= A_1cup A_2cup...cup A_k$. Then the probability that they are from the same set is the expresion you wrote.






                  share|cite|improve this answer









                  $endgroup$
















                    3












                    3








                    3





                    $begingroup$

                    Supppose we have sets $A_1,...A_k$ which are pairvise disjunct and for each $j$ let $n_j = |A_j|$.



                    We choose randomly two elements from $A= A_1cup A_2cup...cup A_k$. Then the probability that they are from the same set is the expresion you wrote.






                    share|cite|improve this answer









                    $endgroup$



                    Supppose we have sets $A_1,...A_k$ which are pairvise disjunct and for each $j$ let $n_j = |A_j|$.



                    We choose randomly two elements from $A= A_1cup A_2cup...cup A_k$. Then the probability that they are from the same set is the expresion you wrote.







                    share|cite|improve this answer












                    share|cite|improve this answer



                    share|cite|improve this answer










                    answered Nov 28 '18 at 19:59









                    greedoidgreedoid

                    38.7k114797




                    38.7k114797























                        2












                        $begingroup$

                        You can write your $P$ in terms of $C$, and vice versa, where $C=sum_{i=1}^k (n_i-n/k)^2/(n/k)$ is the conventional chi-square statistic for testing if the vector of $n_i$ values comes from the all-categories-equally-likely flat multinomial model. So $P$ tells you no more or less than $C$ does, but your audience might be familiar with the chi-square statistic as commonly used to measure unevenness of category counts.






                        share|cite|improve this answer









                        $endgroup$


















                          2












                          $begingroup$

                          You can write your $P$ in terms of $C$, and vice versa, where $C=sum_{i=1}^k (n_i-n/k)^2/(n/k)$ is the conventional chi-square statistic for testing if the vector of $n_i$ values comes from the all-categories-equally-likely flat multinomial model. So $P$ tells you no more or less than $C$ does, but your audience might be familiar with the chi-square statistic as commonly used to measure unevenness of category counts.






                          share|cite|improve this answer









                          $endgroup$
















                            2












                            2








                            2





                            $begingroup$

                            You can write your $P$ in terms of $C$, and vice versa, where $C=sum_{i=1}^k (n_i-n/k)^2/(n/k)$ is the conventional chi-square statistic for testing if the vector of $n_i$ values comes from the all-categories-equally-likely flat multinomial model. So $P$ tells you no more or less than $C$ does, but your audience might be familiar with the chi-square statistic as commonly used to measure unevenness of category counts.






                            share|cite|improve this answer









                            $endgroup$



                            You can write your $P$ in terms of $C$, and vice versa, where $C=sum_{i=1}^k (n_i-n/k)^2/(n/k)$ is the conventional chi-square statistic for testing if the vector of $n_i$ values comes from the all-categories-equally-likely flat multinomial model. So $P$ tells you no more or less than $C$ does, but your audience might be familiar with the chi-square statistic as commonly used to measure unevenness of category counts.







                            share|cite|improve this answer












                            share|cite|improve this answer



                            share|cite|improve this answer










                            answered Nov 28 '18 at 19:52









                            kimchi loverkimchi lover

                            9,69631128




                            9,69631128























                                1












                                $begingroup$

                                Probability to take the same element $j$ two times:
                                $$
                                p_j=left(frac{n_j}{n}right)left(frac{n_j-1}{n-1}right)=frac{n_j(n_j-1)}{n(n-1)}
                                $$

                                Now we do not care about $j$ anymore and therefore marginalize:
                                $$
                                p=sum_j p_j=frac{1}{n(n-1)}sum_j n_j(n_j-1)
                                $$



                                which is the probability to take the same element two times (independently of $j$)






                                share|cite|improve this answer











                                $endgroup$


















                                  1












                                  $begingroup$

                                  Probability to take the same element $j$ two times:
                                  $$
                                  p_j=left(frac{n_j}{n}right)left(frac{n_j-1}{n-1}right)=frac{n_j(n_j-1)}{n(n-1)}
                                  $$

                                  Now we do not care about $j$ anymore and therefore marginalize:
                                  $$
                                  p=sum_j p_j=frac{1}{n(n-1)}sum_j n_j(n_j-1)
                                  $$



                                  which is the probability to take the same element two times (independently of $j$)






                                  share|cite|improve this answer











                                  $endgroup$
















                                    1












                                    1








                                    1





                                    $begingroup$

                                    Probability to take the same element $j$ two times:
                                    $$
                                    p_j=left(frac{n_j}{n}right)left(frac{n_j-1}{n-1}right)=frac{n_j(n_j-1)}{n(n-1)}
                                    $$

                                    Now we do not care about $j$ anymore and therefore marginalize:
                                    $$
                                    p=sum_j p_j=frac{1}{n(n-1)}sum_j n_j(n_j-1)
                                    $$



                                    which is the probability to take the same element two times (independently of $j$)






                                    share|cite|improve this answer











                                    $endgroup$



                                    Probability to take the same element $j$ two times:
                                    $$
                                    p_j=left(frac{n_j}{n}right)left(frac{n_j-1}{n-1}right)=frac{n_j(n_j-1)}{n(n-1)}
                                    $$

                                    Now we do not care about $j$ anymore and therefore marginalize:
                                    $$
                                    p=sum_j p_j=frac{1}{n(n-1)}sum_j n_j(n_j-1)
                                    $$



                                    which is the probability to take the same element two times (independently of $j$)







                                    share|cite|improve this answer














                                    share|cite|improve this answer



                                    share|cite|improve this answer








                                    edited Dec 16 '18 at 20:06

























                                    answered Nov 28 '18 at 21:51









                                    Picaud VincentPicaud Vincent

                                    1,33439




                                    1,33439






























                                        draft saved

                                        draft discarded




















































                                        Thanks for contributing an answer to Mathematics Stack Exchange!


                                        • Please be sure to answer the question. Provide details and share your research!

                                        But avoid



                                        • Asking for help, clarification, or responding to other answers.

                                        • Making statements based on opinion; back them up with references or personal experience.


                                        Use MathJax to format equations. MathJax reference.


                                        To learn more, see our tips on writing great answers.




                                        draft saved


                                        draft discarded














                                        StackExchange.ready(
                                        function () {
                                        StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3017476%2finterpreting-an-agreement-function%23new-answer', 'question_page');
                                        }
                                        );

                                        Post as a guest















                                        Required, but never shown





















































                                        Required, but never shown














                                        Required, but never shown












                                        Required, but never shown







                                        Required, but never shown

































                                        Required, but never shown














                                        Required, but never shown












                                        Required, but never shown







                                        Required, but never shown







                                        Popular posts from this blog

                                        Plaza Victoria

                                        Brian Clough

                                        Cáceres