Q.7.43

Question

Let X1,X2,,Xn be independent random variables having an unknown continuous distribution function F and let Y1,Y2,,Ym be independent random variables having an unknown continuous distribution function G. Now order those n+mvariables, and let

Ii=1     if the i th smallest of the n+m     variables is from the X sample 0     otherwise 

The random variable R=i=1n+miIi is the sum of the ranks of the X sample and is the basis of a standard statistical procedure (called the Wilcoxon sum-of-ranks test) for testing whether F and G are identical distributions. This test accepts the hypothesis that F=G when R is neither too large nor too small. Assuming that the hypothesis of equality is in fact correct, compute the mean and variance of R.

Hint: Use the results of Example 3e.

Step-by-Step Solution

Verified
Answer

The mean of R is E(R)=n(n+m+1)2

The Variance of R is Var(R)=nm(n+m+1)12

1Step 1: Given Information

The independent random variables having an unknown continuous distribution function F=X1,X2,,Xn

The independent random variables having an unknown continuous distribution function 

G=Y1,Y2,,Ym

Given functionIi=1     if the i th smallest of the n+m     variables is from the X sample 0     otherwise 

Random variable R=i=1n+miIj

The mean and variance of R=?

2Step 2: Explanation

We have the random variable,

R=i=1n+miIi


Applying the result of example3e

E(R)=ni¯ 

And Var(R)=n(n+m-n)n+m-1i=1n+mi2n+m-(i¯)2

Now

i¯=1+2+3++(n+m)(n+m)

=(n+m)(n+m+1)(n+m)·2

=n+m+12

i=1n+mi2=12+22+32+.+(n+m)2

=(n+m)(n+m+1)(2n+2m+1)6

E(R)=n(n+m+1)2

3Step 3: Explanation

Calculate the variance,

Var(R)=nmn+m--1(n+m+1)(2n+2m+1)6-(n+m+1)24

=nm(n+m+1)n+m-12n+2 m+16-n+m+14

=nm(n+m+1)n+m-14n+4m+2-3n-3m-312

=nm(n+m+1)n+m-1n+m-112

=nm(n+m+1)12

4Step 4: Final Answer

The mean value ofRis E(R)=n(n+m+1)2

The variance value of R is Var(R)=nm(n+m+1)12.