t-test with small n

classic Classic list List threaded Threaded
3 messages Options
Reply | Threaded
Open this post in threaded view
|

t-test with small n

aschoff
Hello,

we are dealing with survey data for two independent groups we would
like to compare. Due to missing values we have very small and unevenly
distributed data for some parts of the survey (e.g. 5 responses for
one group, 8 responses for the control group, answers on likert
scales). We have a discussion whether a t-test can be used to compare
these two groups.

We assume that is is not appropriate to use the t-test but we would
like to understand the exact nature of the problem. We are aware of
the requirments for the t-test like normal distribution, equal
variances etc. We are also aware that this test has a low test-power,
however when we try it out the test turns out to be significant.

What would be the correct argument to refuse using the t-test for this
situation (if it is to be refused?). Is it just because we are having
difficulties to prove that we meet the requirements? What would you
consider a lower boundary that justifies to use a t-test? Would it be
more appropriate to use a non parametric test or is it just impossible
to show a systematic difference with a small N like this?

Thanks a lot for your input.
Robinson

=====================
To manage your subscription to SPSSX-L, send a message to
[hidden email] (not to SPSSX-L), with no body text except the
command. To leave the list, send the command
SIGNOFF SPSSX-L
For a list of commands to manage subscriptions, send the command
INFO REFCARD
Reply | Threaded
Open this post in threaded view
|

Re: t-test with small n

Bruce Weaver
Administrator
aschoff wrote
Hello,

we are dealing with survey data for two independent groups we would
like to compare. Due to missing values we have very small and unevenly
distributed data for some parts of the survey (e.g. 5 responses for
one group, 8 responses for the control group, answers on likert
scales). We have a discussion whether a t-test can be used to compare
these two groups.

We assume that is is not appropriate to use the t-test but we would
like to understand the exact nature of the problem. We are aware of
the requirments for the t-test like normal distribution, equal
variances etc. We are also aware that this test has a low test-power,
however when we try it out the test turns out to be significant.

What would be the correct argument to refuse using the t-test for this
situation (if it is to be refused?). Is it just because we are having
difficulties to prove that we meet the requirements? What would you
consider a lower boundary that justifies to use a t-test? Would it be
more appropriate to use a non parametric test or is it just impossible
to show a systematic difference with a small N like this?

Thanks a lot for your input.
Robinson
You might find this BMJ note useful.

  http://www.bmj.com/cgi/content/full/338/apr06_1/a3166

--
Bruce Weaver
bweaver@lakeheadu.ca
http://sites.google.com/a/lakeheadu.ca/bweaver/

"When all else fails, RTFM."

PLEASE NOTE THE FOLLOWING: 
1. My Hotmail account is not monitored regularly. To send me an e-mail, please use the address shown above.
2. The SPSSX Discussion forum on Nabble is no longer linked to the SPSSX-L listserv administered by UGA (https://listserv.uga.edu/).
Reply | Threaded
Open this post in threaded view
|

Re: t-test with small n

David Hitchin
In reply to this post by aschoff
On Tue, 26 Jan 2010 10:56:23 +0100, Robinson Aschoff <[hidden email]>
wrote:
 Due to missing values we have very small and unevenly
> distributed data for some parts of the survey (e.g. 5 responses for
> one group, 8 responses for the control group, answers on likert
> scales). We have a discussion whether a t-test can be used to compare
> these two groups.

The whole point of the t-test is that it is designed for SMALL samples. For
larger samples, perhaps beginning at 30 or so, tests based directly on the
normal distribution are adequate, and as samples get larger, normal-based
tests become even more appropriate. As the t distribution approximates the
normal distribution when samples get very large, it is quite OK to continue
using the t distribution.

One of the apparent paradoxes in this situation is that you need much
larger samples to check that the test is appropriate than to actually do
the test. Often there may be reasons to assume that the populations are
normal - or reasonably close to it - without having to check this on the
particular samples. The tests might have been used many times before in
similar situations, and therefore there might be good reason to assume that
the distributions are the same in the new samples, with the exception of
possibly different means and variances.

As tests of normality are not very powerful with small samples, you might
be tempted to eye-ball the data. BEWARE! You need quite a lot of experience
before you can judge whether your samples deviate much from normality. To
check my assertion, you can use SPSS to generate many samples from normal
distributions, and draw the histograms. See how many YOU think are normal,
or even better, ask a friend to judge. Of course this experiment needs to
be tried several times with different sample sizes. Most people who haven't
worked systematically through a lot of teaching data reject far too many
samples as "non-normal".

The t-test is pretty robust, but there are exceptions. One case is where
the distributions are highly skewed. Another is where the variances are
very different - in which case SPSS provides a modified version of the test
which takes this into account.

If you are doing the test to see if there is significance, say at the 5%
level, and you get results better than 1% or worse than 10% your
conclusions should be quite clear - even if the p-value isn't right to four
decimal places. If the results are so obviously significant or not
significant, there is little point in seeking more refined tests to get
more accurate p-values.

David Hitchin

=====================
To manage your subscription to SPSSX-L, send a message to
[hidden email] (not to SPSSX-L), with no body text except the
command. To leave the list, send the command
SIGNOFF SPSSX-L
For a list of commands to manage subscriptions, send the command
INFO REFCARD