Re: Shapiro-Wilks Statistic

Posted by Mike on
URL: http://spssx-discussion.165.s1.nabble.com/Shapiro-Wilks-Statistic-tp5739693p5739714.html

Peripherally related to the main topic of this thread but related to Rich's
mention of using IRT is the following article:

Harwell, M. R., & Gatti, G. G. (2001). Rescaling ordinal data to interval data
in educational research. Review of Educational Research, 71(1), 105-131.

Abstract
Many statistical procedures used in educational research are described as
requiring that dependent variables follow a normal distribution, implying an
interval scale of measurement. Despite the desirability of interval scales,
many dependent variables possess an ordinal scale of measurement in
which the differences among values composing the scale are unequal in
terms of what is being measured, permitting only a rank ordering of scores.
This means that data possessing an ordinal scale will not satisfy the assumption
of normality needed in many statistical procedures and may produce biased
statistical results that threaten the validity of inferences. This article shows
how the measurement technique known as item response theory can be used
to rescale ordinal data to an interval scale. The authors provide examples of
rescaling using student performance data and argue that educational
researchers should routinely consider rescaling ordinal data using item response theory.

Relative to the use of polychoric correlation, I suggest looking at the section
"Manifest and Latent Variables" on page 110 (6th page of the PDF).

The article is available at the following link:
https://journals.sagepub.com/doi/pdf/10.3102/00346543071001105?casa_token=3qrofVBrC-EAAAAA:bmhjh32Kds4PKnNwEQchySWqWCpycMjGCukaR1teK_-V_mBebbaxS7H4Pe845CYmjVg8TA2QRIm4

-Mike Palij
New York University
[hidden email]


On Thu, Oct 1, 2020 at 10:38 PM Rich Ulrich <[hidden email]> wrote:
Thanks -
I agree that univariate normality should not be at issue, and I
wonder anew at what your reviewer had in mind.  Showing the
extreme kurtosis should be enough (if there were any question).

From Bruce's citation - I have doubts that the assumptions of
underlying bivariate normality are met, when it comes to polychoric.
If the polychoric r's are larger, then they also have larger standard
errors than the Pearson r's.  I think I had one set of data that I ever
played with, testing polychoric for factoring, and I ended up using
Pearson's.

What also came to mind was Item (or Latent) Response Theory, which
uses logistic transformations. I know IRT from reading, too, rather than
hands-on experience.  I don't know if they have examples with U-shapes.

(a) Underlying Logistic is shaped like Normal with slightly fatter tails, so
it might not meet assumptions any better for U shaped data.  (b) If both
tails are always fat, the correlations won't change by much.  (c) The
people who perform IRT have some interesting ancillary statistics. 

--
Rich Ulrich

From: Baker, Harley <[hidden email]>
Sent: Thursday, October 1, 2020 2:36 PM
To: Rich Ulrich <[hidden email]>
Cc: [hidden email] <[hidden email]>
Subject: Re: Shapiro-Wilks Statistic
 
I have a number of religious identity labels that participants rate in terms of how well they fit. There are five response categories from ‘not at all’ to ‘very well.’ As is the case in most research like this, the distribution of responses is basically U-shaped. (Basically, a label either fits or it does not, very few in-between responses.) Mardia’s statistic shows a huge violation of multivariate normality, as one would expect. As such, univariate normality is not an issue, which is what I should have said originally. Pearson correlations underestimate the degree of linear relationship in these situations, and we often turn to polychoric correlations instead. (I am loathe to use nonparametric correlations when perfectly good parametric methods exist. When categorical data are well-behaved, I also prefer to use parametric models. I agree with one of my mentors, Fred Lord: the data don’t know where they came from. So, if they act parametric, treat them that way. If not, don’t.) Polychoric correlations provide a more accurate estimate of the true degree of linear association under these conditions. The matrix was then submitted to an EFA to see what patterns of self-labeling emerge, if any.

Hope this clarifies the situation a bit better.

Harley


Sent from my iPad

On Oct 1, 2020, at 10:52 AM, Rich Ulrich <[hidden email]> wrote:


CAUTION: This email originated from outside of CSUCI. Do not click links or open attachments unless you validate the sender and know the content is safe. Please forward this email to [hidden email] if you believe this email is suspicious. For more information on how to detect Phishing scams, please visit https://www.csuci.edu/its/security/phishing.htm

I'm a little confused.  You say,
"The data are ordinal in nature, so the issue of normality doesn't even
apply to my situation."

"... the issue doesn't even apply ... "
That implies to me that you are doing analyses after rank-transform
("nonparametric") without considering ordinary ANOVA -- and the
reviewer asks you to justify abandoning the raw scores.

If I were reviewer, I might ask the same.  I hope I would put the
request more clearly than what you seem to have received.

Bruce writes, unclear about the situation -
If it is to justify use of a parametric test, my advice would be, "Don't
bother!"  Like me, Bruce is biased against non-parametric testing for
decently-behaved ordinal data.
--
Rich Ulrich

From: SPSSX(r) Discussion <[hidden email]> on behalf of Baker, Harley <[hidden email]>
Sent: Thursday, October 1, 2020 10:38 AM
To: [hidden email] <[hidden email]>
Subject: Re: Shapiro-Wilks Statistic
 
Hi Bruce,

An editorial reviewer is requiring me to do so. Otherwise, I would not. The data are ordinal in nature, so the issue of normality doesn't even apply to my situation. Regardless, I am complying . . . 

Harley

Dr. Harley Baker
Professor Emeritus of Psychology
California State University Channel Islands
One University Drive
Camarillo, CA 93012
 


From: SPSSX(r) Discussion <[hidden email]> on behalf of Bruce Weaver <[hidden email]>
Sent: Thursday, October 1, 2020 5:26 AM
To: [hidden email] <[hidden email]>
Subject: Re: Shapiro-Wilks Statistic
 
CAUTION: This email originated from outside of CSUCI. Do not click links or open attachments unless you validate the sender and know the content is safe. Please forward this email to [hidden email] if you believe this email is suspicious. For more information on how to detect Phishing scams, please visit https://www.csuci.edu/its/security/phishing.htm


I'll repeat the question I asked earlier:

> Why do you [Harley & 3J LEMA] want to test for normality?

If it is to justify use of a parametric test, my advice would be, "Don't
bother!"   ;-)



3J LEMA wrote
> Any tips on when to use the Shapiro-Wilk statistic over the
> Kolmogorov-Smirnov Statistic in testing normality?
>
> Thank you.





-----
--
Bruce Weaver
[hidden email]
https://nam10.safelinks.protection.outlook.com/?url=http%3A%2F%2Fsites.google.com%2Fa%2Flakeheadu.ca%2Fbweaver%2F&amp;data=02%7C01%7CHarley.Baker%40CSUCI.EDU%7C009a2a7e1919432bf31108d866053d26%7Ce30f5bdb7f18435b84369d84aa7b96dd%7C1%7C1%7C637371519978360100&amp;sdata=Ix%2BBFHa0yn6LxO34w0WvuZtCNdoa5yokfx6gEGNiXWQ%3D&amp;reserved=0

"When all else fails, RTFM."

NOTE: My Hotmail account is not monitored regularly.
To send me an e-mail, please use the address shown above.

--
Sent from: https://nam10.safelinks.protection.outlook.com/?url=http%3A%2F%2Fspssx-discussion.1045642.n5.nabble.com%2F&amp;data=02%7C01%7CHarley.Baker%40CSUCI.EDU%7C009a2a7e1919432bf31108d866053d26%7Ce30f5bdb7f18435b84369d84aa7b96dd%7C1%7C1%7C637371519978360100&amp;sdata=LQAFSVr%2FSx2BNd18u642DIIbTYXsq4PG2plFjzpkWKg%3D&amp;reserved=0

=====================
To manage your subscription to SPSSX-L, send a message to
[hidden email] (not to SPSSX-L), with no body text except the
command. To leave the list, send the command
SIGNOFF SPSSX-L
For a list of commands to manage subscriptions, send the command
INFO REFCARD
===================== To manage your subscription to SPSSX-L, send a message to [hidden email] (not to SPSSX-L), with no body text except the command. To leave the list, send the command SIGNOFF SPSSX-L For a list of commands to manage subscriptions, send the command INFO REFCARD
===================== To manage your subscription to SPSSX-L, send a message to [hidden email] (not to SPSSX-L), with no body text except the command. To leave the list, send the command SIGNOFF SPSSX-L For a list of commands to manage subscriptions, send the command INFO REFCARD
===================== To manage your subscription to SPSSX-L, send a message to [hidden email] (not to SPSSX-L), with no body text except the command. To leave the list, send the command SIGNOFF SPSSX-L For a list of commands to manage subscriptions, send the command INFO REFCARD