Kybernetika 54 no. 5, 908-920, 2018

A homogeneity test of large dimensional covariance matrices under non-normality

M. Rauf AhmadDOI: 10.14736/kyb-2018-5-0908

Abstract:

A test statistic for homogeneity of two or more covariance matrices is presented when the distributions may be non-normal and the dimension may exceed the sample size. Using the Frobenius norm of the difference of null and alternative hypotheses, the statistic is constructed as a linear combination of consistent, location-invariant, estimators of trace functions that constitute the norm. These estimators are defined as $U$-statistics and the corresponding theory is exploited to derive the normal limit of the statistic under a few mild assumptions as both sample size and dimension grow large. Simulations are used to assess the accuracy of the statistic.

Keywords:

high-dimensional inference, covariance testing, $U$-statistics, non-normality

Classification:

62H15

References:

  1. M. R. Ahmad: Location-invariant multi-sample $U$-tests for covariance matrices with large dimension. Scand. J. Stat. 44 (2017b), 500-523.   DOI:10.1111/sjos.12262
  2. M. R. Ahmad: Testing homogeneity of several covariance matrices and multi-sample sphericity for high-dimensional data under non-normality. Comm. Stat. Theory Methods 46 (2017a), 3738-3753.   DOI:10.1080/03610926.2015.1073310
  3. M. R. Ahmad: On testing sphericity and identity of a covariance matrix with large dimensions. Math. Meth. Stat. 25 (2016), 121-132.   DOI:10.3103/s1066530716020034
  4. T.W. Anderson: An Introduction to Multivariate Statistical Analysis. Third edition. Wiley, NY 2003.   CrossRef
  5. M. Aoshima and K. Yata: Two-stage procedures for high-dimensional data. Seq. An. 30 (2011), 356-399.   DOI:10.1080/07474946.2011.619088
  6. T. Cai, W. Liu and Y. Xia: Two-sample covariance matrix testing and support recovery in high-dimensional and sparse settings. J. Amer. Statist. Assoc. 108 (2013), 265-277.   DOI:10.1080/01621459.2012.758041
  7. Y. Fujikoshi, V. V. Ulyanov and R. Shimizu: Multivariate statistics: High-dimensional and large-sample approximations. Wiley, NY 2010.   CrossRef
  8. J. Hájek, Z. Šidák and P. K. Sen: Theory of Rank Tests. Academic Press, SD 1999.   CrossRef
  9. T. Y. Kim, Z-M. Luo and C. Kim: The central limit theorem for degenerate variable $U$-statistics under dependence. J. Nonparam. Stat. 23 (2011), 683-699.   DOI:10.1080/10485252.2011.556193
  10. V. S. Koroljuk and Y. V. Borovskich: Theory of $U$-statistics. Kluwer Academic Press, Dordrecht 1994.   DOI:10.1007/978-94-017-3515-5
  11. A. J. Lee: $U$-statistics: Theory and Practice. CRC Press, Boca Raton 1990.   CrossRef
  12. E. L. Lehmann: Elements of Large-sample Theory. Springer, NY 1999.   DOI:10.1007/b98855
  13. J. Li and S. X. Chen: Two sample tests for high-dimensional covariance matrices. Ann. Stat. 40 (2012), 908-940.   DOI:10.1214/12-aos993
  14. B. Liu, L. Xu, S. Zheng and G-L. Tian: A new test for the proportionality of two large-dimensional covariance matrices. J. Multiv. An. 131 (2014), 293-308.   DOI:10.1016/j.jmva.2014.06.008
  15. T. Mikosch: Weak invariance principles for weighted $U$-statistics. J. Theoret. Prob. 7 (1991), 147-173.   DOI:10.1007/bf02213365
  16. T. Mikosch: A weak invariance principle for weighted $U$-statistics with varying kernels. J. Multiv. An. 47 (1993), 82-102.   DOI:10.1006/jmva.1993.1072
  17. R. J. Muirhead: Aspects of Multivariate Statistical Theory. Wiley, NY 2005   DOI:10.1002/9780470316559
  18. A. Pinheiro, K. Sen and H. P. Pinheiro: Decomposibility of high-dimensional diversity measures: Quasi-$U$-statistics, martigales, and nonstandard asymptotics. J. Multiv. An. 100 (2009), 1645-1656.   DOI:10.1016/j.jmva.2009.01.007
  19. Y. Qiu and S. X. Chen: Test for bandedness of high-dimensional covariance matrices and bandwidth estimation. Ann. Stat. 40 (2012) 1285-1314.   DOI:10.1214/12-aos1002
  20. J. R. Schott: A test for the equality of covariance matrices when the dimension is large relative to the sample size. Computat. Statist. Data Analysis 51 (2007), 6535-6542.   DOI:10.1016/j.csda.2007.03.004
  21. G. A. F. Seber: Multivariate Observations. Wiley, NY 2004.   DOI:10.1002/9780470316641
  22. P. K. Sen: Robust statistical inference for high-dimensional data models with applications in genomics. Aust. J. Stat. 35 (2006), 197-214.   CrossRef
  23. R. J. Serfling: Approximation Theorems of Mathematical Statistics. Wiley, Weinheim 1980.   DOI:10.1002/9780470316481
  24. M. S. Srivastava and H. Yanagihara: Testing the equality of several covariance matrices with fewer observations than the dimension. J. Multiv. An. 101, 1319-1329.   DOI:10.1016/j.jmva.2009.12.010
  25. A. W. van der Vaart: Asymptotic Statistics. Cambridge University Press, 1998.   DOI:10.1017/cbo9780511802256
  26. P-S. Zhong and S. X. Chen: Tests for high-dimensional regression coefficients with factorial designs. J. Amer. Statist. Assoc. 106 (2011), 260-274.   DOI:10.1198/jasa.2011.tm10284