Kybernetika 47 no. 1, 110-122, 2011

Goodman-Kruskal Measure of Association for Fuzzy-Categorized Variables

S.~M. Taheri and G. Hesamian


The Goodman-Kruskal measure, which is a well-known measure of dependence for contingency tables, is generalized to the case when the variables of interest are categorized by linguistic terms rather than crisp sets. In addition, to test the hypothesis of independence in such contingency tables, a novel method of decision making is developed based on a concept of fuzzy $p$-value. The applicability of the proposed approach is explained using a numerical example.


fuzzy frequency, fuzzy category, fuzzy Goodman--Kruskal statistic, fuzzy $p$-value, fuzzy significance level, NSD index


93E12, 62A10


  1. \bibitem {agresti} A.~Agresti: Categorical Data Analysis. Second Edition. J. Wiley, New York 2002.   CrossRef
  2. M. B.~Brown and J. K.~Benedetti: Sampling behavior of tests for correlation in two-way contingency tables. J. Amer. Statist. Assoc. 72 (1977), 309-315.   CrossRef
  3. T.~Denoeux, M. H.~Masson and P. H.~Herbert: Non-parametric rank-based statistics and significance tests for fuzzy data. Fuzzy Sets and Systems 153 (2005), 1-28.   CrossRef
  4. D.~Dubois and H.~Prade: Ranking of fuzzy numbers in the setting of possibility theory. Inform. Sci. 30 (1983), 183-224.   CrossRef
  5. M. M.~Engelgau, T. J.~Thompson, W. H.~Herman, J. P.~Boyle, R. E.~Aubert, S. J.~Kenny, A.~Badran, E. S.~Sous and M. A.~Ali: Comparison of fasting and 2-hour glucose and HbA1c levels for diagnosing diabetes: diagnostic critera and performance revisited. Diabetes Care 20 (1997), 785-791.} \bibitem {gib} J. D.~Gibbons: \newblock{Nonparametric Measures of Association.} \newblock{Sage Publication, Newbury Park 1993.} \bibitem {goodman} L. A.~Goodman and W. H.~Kruskal: \newblock{Measures of association for cross classifications.} \newblock{J. Amer. Statist. Assoc. 49 (1954), 732-764.} \bibitem {goodman1} L. A.~Goodman and W. H.~Kruskal: \newblock{Measures of Association for Cross Classifications.} \newblock{Springer, New York 1979.   CrossRef
  6. P.~Grzegorzewski: Statistical inference about the median from vague data. Control Cybernet. 27 (1998), 447-464.   CrossRef
  7. P.~Grzegorzewski: Distribution-free tests for vague data. In: Soft Methodology and Random Information Systems (M.~Lopez-Diaz et al., eds.), Springer, Heidelberg 2004, pp. 495-502.   CrossRef
  8. P.~Grzegorzewski: Two-sample median test for vague data. In: Proc. $4$th Conf. European Society for Fuzzy Logic and Technology-Eusflat, Barcelona 2005, pp. 621-626.   CrossRef
  9. P.~Grzegorzewski: K-sample median test for vague data. Internat. J. Intelligent Systems 24 (2009), 529-539.   CrossRef
  10. M.~Holena: Fuzzy hypotheses testing in a framework of fuzzy logic. Fuzzy Sets and Systems 145 (2004), 229-252.} \bibitem {herin2} O.~ Hryniewicz: \newblock{Selection of variables for systems analysis, application of a fuzzy statistical test for independence.} \newblock{Proc. IPMU, Perugia 3 (2004), 2197-2204.   CrossRef
  11. O. Hryniewicz: Goodman-Kruskal $\gamma$ measure of dependence for fuzzy ordered categorical data. Comput. Statist. Data Anal. 51 (2006), 323-334.   CrossRef
  12. O.~Hryniewicz: Possibilistic decisions and fuzzy statistical tests. Fuzzy Sets and Systems 157 (2006), 2665-2673.   CrossRef
  13. C.~Kahranam, C. F.~Bozdag and D.~Ruan: Fuzzy sets approaches to statistical parametric and non-parametric tests. Internat. J. Intelligent Systems 19 (2004), 1069-1078.} \bibitem {kros} R.~Kruse and K. D.~ Meyer: \newblock{Statistics with Vague Data.} \newblock{Reidel Publishing, New York 1987.} \bibitem {kowang} K.~H.~Lee: \newblock{First Course on Fuzzy Theory and Applications.} \newblock{Springer, Heidelberg 2005.} \bibitem {mar} M.~Mare\v{s}: \newblock{Fuzzy data in statistics.} \newblock{Kybernetika 43 (2007), 491-502.   CrossRef
  14. S.~Pourahmad, S. M. T.~Ayatollahi and S.~M.~Taheri: Fuzzy logistic regression, a new possibilistic model and its application in clinical diagnosis. Iranian J. Fuzzy Systems, to appear.   CrossRef
  15. B. P.~Tabaei and W. H.~Herman: A multivariate logistic regression equation to screen for diabetes. Diabetes Care 25 (2002), 1999-2003.} \bibitem {ven} P.~Venkataraman: \newblock{Applied Optimization with MATLAB Programming.} \newblock{J. Wiley, New York 2002.   CrossRef
  16. X.~Wang and E.~Kerre: Reasonable properties for the ordering of fuzzy quantities (II). Fuzzy Sets and Systems 118 (2001), 387-405.   CrossRef
  17. Y.~Yoan: Criteria for evaluating fuzzy ranking methods. Fuzzy Sets and Systems 43 (1991), 139-157.   CrossRef
  18. R.~Viertl: Statistical Methods for Non-Precise Data. CRC Press, Boca Raton 1996.   CrossRef