Moments and Cumulants of the Bivariate Mann-
Whitney Statistic for Two-Stage Trials

Dewei Zhong; John Kolassa; Department of Statistics and Biostatistics, the State University of New Jersey, USA

info@biomedres.us +1 (720) 414-3554

One Westbrook Corporate Center, Suite 300, Westchester, IL 60154, USA

Biomedical Journal of Scientific & Technical Research

April, 2021, Volume 35, 1, pp 27353-27358

Case Report

Moments and Cumulants of the Bivariate Mann- Whitney Statistic for Two-Stage Trials

Dewei Zhong and John Kolassa*

Author Affiliations

Department of Statistics and Biostatistics, the State University of New Jersey, USA

Received: March 02, 2021 | Published: April 16, 2021

Corresponding author: John Kolassa, 565 Hill Center 110 Frelinghuysen Road Piscataway, NJ 08854, USA

DOI: 10.26717/BJSTR.2021.35.005654

Abstract

This paper applies multivariate Cornish-Fisher techniques to calculate the asymptotic critical values of the bivariate Mann-Whitney statistic, which is used in two-stage study designs.

Introduction to Mann-Whitney Statistic and Two- Stage Test

Consider the problem of testing a difference in two groups. Suppose that the continuous responses X₁, . . . , X_M are from a control group, and the continuous responses Y₁, . . . , Y_N are from a treatment group. The Mann-Whitney U test [1], equivalent to the Wilcoxon rank sum test [2], uses the statistic

(1)

Here I(X_i < Y_j )is 1 when X_i < Y_j holds and 0 otherwise. The statistic U is designed to test the null hypothesis that the distribution of X_j is the same as that of Y_j, vs. the alternative hypothesis that P [Y_j ≥ X_j] > 0.5, at level α. A critical value c is selected as the smallest value so that P₀ [U ≥ c] ≤α . If U is larger than the critical value, the treatment group is determined to be superior to the control group.

Due to ethical concerns and resource management, common designs allow for early stopping in the presence of strong, early evidence. Spurrier and Hewett [3] provide a two-stage test based on the Mann-Whitney statistic. Wilding, et al. [4] discuss such a procedure in the context of clinical trials.

The two-stage test has two critical values, c₁ and c₂. First, gather m observations from control group and n observations from treatment group. Define

(2)

If U₁ meets or exceeds the first critical value c₁, stop the trial early to declare the treatment group is superior to control group. If U₁ is less than c₁, gather observations from the control group and observations from treatment group, where , . Define

(3)

If U₂ is larger than or equal the second critical value c₂, claim the treated is superior to the controls.

The critical value of Mann-Whitney statistic in one dimension can be easily calculated. The critical values for the two stage test are more difficult to calculate. Due to the complexity of the mass function for two dimensional Mann-Whitney statistics, obtaining exact critical values is computationally intensive. Kolassa, et al. [5] present a plan for approximating these critical values using a bivariate Cornish–Fisher expansion; this expansion requires bi- variate cumulants of U₁ and U₂. Furthermore, they use a bivariate Edgeworth expansion to approximate power; this expansion also requires bivariate cumulants, in this case for an alternative distribution. This manuscript provides tools for calculating these bivariate moments, and hence bivariate cumulants. Under the null hypothesis, X_i and Y_i are jointly independent and identically distributed. The second section defines certain indicator functions and gives their null expectation. The third section presents first and second order joint moments of the Mann-Whitney statistics. The fourth section presents third- and fourth-order mixed moments. All of these moments are calculated under conditions general enough to encompass both the null and alternative distributions. The fifth section discusses the calculations of cumulants from moments.

Indicator Function Definitions and Expectations

Let I_ij take the value 1 if X_i < Y_j , and 0 otherwise. Products of these indicators represent indicators of more complicated sets. For example, I_ij I_il I_kj = 1 means that all of X_i < Y_j , X_i < Y_l , X_k < Y_j hold, and I_ij I_il I_kj= 0 means that at least one of them does not hold. Below, moments of U = (U₁,U₂ ) will be expressed as sums of such products. Terms will be factors with non-overlapping indices. Table 1 summarizes expectations of these factors. Zhong, et al. [6] performs these calculations in detail. Null values can be calculated using symmetry properties.

First- and Second-Order Moments

In general, using Table 1, {probdef}

Table 1: Expectations of Products of Indicators.

Note that

By the same reasoning,

Higher Moments

As a tool for calculating E[U³₂ ] and E[U⁴₂ ] first define some sums that make up parts of this product. Let

Expectations of these sums of products of indicators can be calculated by separating the sums into quantities with indices replicated and independent quantities whose expectations are given in Table 1, to obtain:

Moments of U₁ are calculated substituting m and n for M and N respectively. Conditional expectations are used to find mixed moments. In order to calculate expectations of mixed moments, introduce indicators indicating whether the observation ranked i in the first sample falls among those observations collected before the interim analysis, and similarly with the observation ranked j among the second sample:

Then the Mann–Whitney statistic calculated using data before the interim analysis is

The law of iterated expectations will be used to calculate mixed moments, by first conditioning on order statistics of the two samples ordered separately:

Z =(X₍₁₎,.....,X_(M), Y₍₁₎,......Y_(N)).

Calculation of mixed moments will proceed by expressing U₁^TU₂^S in terms of quantities from U₁, C, D, E, F, G, G*, H, H*, K, and K* as above, times

the indicators Ai, one such quantity attached to each distinct value of the first index, and times the indicators Bj, one such quantity attached to each distinct value of the second index. Then the expectations of products such as AiAk with i ≠ k are expectations of products from a multinomial, and similarly with the B indicators.

Then λ_x, λ^*_x, and λ^†_x are the expectations of products of one, two, and three such A, respectively, and λ_y, λ^*_y, and λ^†_y are the expectations of products of one, two, and three such B. Then

E[U₁ U₂] = [E[E[U₁ U₂ |Z| = E[U₂²] λ_x λ_y.

Also,

Next,

Multivariate Cumulants

Multivariate cumulants can then be calculated from these moments. Let

μ_ij....k= E[U_i U_j..... U_k ],

for indices i, j, …., k taking values in {1, 2}. Define the moment generating function

such that coefficients with indices permuted are equal. Analytic expressions for cumulants in terms of moments are simple in one dimension but are complex enough to be unusable in as few as two dimensions. Kolassa [7] presents software to perform these calculations numerically, as a result of using a symbolic calculus tool to output numerical code directly.

References

Mann HB, DR Whitney (1947) On a test of whether one of two random variables is stochastically larger than the other. Ann Math Statist 18(1): 50-60.

Wilcoxon F (1945) Individual comparisons by ranking methods. Biometrics Bulletin 1(6): 80-83.

Spurrier JD, JE Hewett (1976) Two-stage Wilcoxon tests of hypotheses. Journal of the American Statistical Association 71(356): 982-987.

Wilding GE, G Shan, AD Hutson (2012) Exact two-stage designs for phase ii activity trials with rank-based endpoints. Contemporary Clinical Trials 33(2): 332-341.

Kolassa J, X Chen, Y Seifu, D Zhong (2020) Power calculations and critical values for two-stage nonparametric testing regimes. Under review.

Zhong D, J Kolassa (2017) Moments and Cumulants of The Two-Stage Mann-Whitney Statistic. Technical report.

Kolassa J (2018) Two Stage: Two Stage MWW. R package version 1.0.

Case Report

Share:

Moments and Cumulants of the Bivariate Mann- Whitney Statistic for Two-Stage Trials

Dewei Zhong and John Kolassa*

Author Affiliations

Department of Statistics and Biostatistics, the State University of New Jersey, USA

Received: March 02, 2021 | Published: April 16, 2021

Corresponding author: John Kolassa, 565 Hill Center 110 Frelinghuysen Road Piscataway, NJ 08854, USA

DOI: 10.26717/BJSTR.2021.35.005654

Abstract

This paper applies multivariate Cornish-Fisher techniques to calculate the asymptotic critical values of the bivariate Mann-Whitney statistic, which is used in two-stage study designs.

BibTex

RIS

APA

Harvard

IEEE

MLA

Vancouver

Chicago

Contact Us

Biomedical Research Network+, LLC

   1 Westbrook Corporate Center,
      Suite 300 one Westchester,
      IL 60154 USA.
+1 (502) 904-2126
Fax - (720) 367-5187
support@biomedres.us
angelaroy@biomedres.us

Useful Links

Home

Aim and Scope

Editorial Committee

Author Guidelines

Indexing and Archiving list

Subject Area

Contact us

About Us

Biomedical Journal of Scientific & Technical Research is a scholarly Open Access publisher focused on Genetic, Biomedical and Remedial missions in relation with Technical Knowledge as well. read more...

Leave a Comment

© 2017 Biomedical Research Network, LLC, All rights reserved. No part of this content may be reproduced or transmitted in any form or by any means as per the standard guidelines of fair use. Creative Commons License Open Access by Biomedical Research Network, LLC is licensed under
a Creative Commons Attribution 4.0 International License. Based on a work at www.biomedres.us.
Best viewed in | Above IE 9.0 version

Scroll

[1] Mann HB, DR Whitney (1947) On a test of whether one of two random variables is stochastically larger than the other. Ann Math Statist 18(1): 50-60.

[2] Wilcoxon F (1945) Individual comparisons by ranking methods. Biometrics Bulletin 1(6): 80-83.

[3] Spurrier JD, JE Hewett (1976) Two-stage Wilcoxon tests of hypotheses. Journal of the American Statistical Association 71(356): 982-987.

[4] Wilding GE, G Shan, AD Hutson (2012) Exact two-stage designs for phase ii activity trials with rank-based endpoints. Contemporary Clinical Trials 33(2): 332-341.

[5] Kolassa J, X Chen, Y Seifu, D Zhong (2020) Power calculations and critical values for two-stage nonparametric testing regimes. Under review.

[6] Zhong D, J Kolassa (2017) Moments and Cumulants of The Two-Stage Mann-Whitney Statistic. Technical report.

[7] Kolassa J (2018) Two Stage: Two Stage MWW. R package version 1.0.