basic que-Factor Analysis

classic Classic list List threaded Threaded
12 messages Options
Reply | Threaded
Open this post in threaded view
|

basic que-Factor Analysis

Mehul Pajwani
Hi,

I am trying to run (for the first time) factor analysis in SPSS.18 and
would appreciate any feedback and recommendations from seasoned SPSS
users and statisticians on the following

I started with 93 continuous variables (5-point, 6-point and 7-point
rating scale type questions) and trying to reach to a reasonable
factor solution. Here are key questions I am struggling with.

1) I started with 93 variables. Are these too many variables to start
with? Is there any guideline about (minimum and maximum) number of
variables?

2) Though all variables I am using are continuous (rating scale) they
are of different scale points and types (e.g. 5-point ratings where
1=Strongly Disagree and 5=Strongly Agree; 6-point scale where 1=Will
not influence at all and 6=Will strongly influence etc.).  I
understand that this will have very different variance and therefore
can possible affect the results. Having said that do I need to
standardize these variables or factors analysis will automatically
take care of it?

3) I selected an option for Rotated Component Matrix but it failed to
produce that with the message “Rotation failed to converge in 25
iterations. (Convergence = .001).” Surprisingly, I ran the factor
analysis with the same data couple days before (with 92 variables, I
guess) and it did produce Rotated Component Matrix.  Any thoughts on
why it is showing that error and how to handle this?

4) My ultimate goal is to identify broader dimensions based on factor
analysis and then use these composite variables for the segmentation
by doing cluster analysis. Is this the right approach for the
segmentation?

I would appreciate any help on this.

Regards,

Mehul

=====================
To manage your subscription to SPSSX-L, send a message to
[hidden email] (not to SPSSX-L), with no body text except the
command. To leave the list, send the command
SIGNOFF SPSSX-L
For a list of commands to manage subscriptions, send the command
INFO REFCARD
Reply | Threaded
Open this post in threaded view
|

Re: basic que-Factor Analysis

Garry Gelade
Mehul

A few suggestions:

1) The maximum number of variables you can analyse to give a robust solution
depends on the number of cases (subjects) and partly on how clear the factor
structure is.  A common suggestion is at least 10 cases per variable, though
5 can work fine in some situations.

2) If you choose Analyze Correlation Matrix on the Extraction submenu, the
standardization will be done automatically.

3) Your 93rd variable may be problematic. Eg if it has many missing values,
and you are using the default Listwise (on the Options submenu) it will
reduce the number of cases available for analysis. Or it may simply be a few
more iterations are needed. Try increasing the number of iterations on the
Rotation submenu to say 100. If the solution still doesn't converge, it is
likely your extra variable is causing the problem.

4) Yes you can cluster on the factors, provided your factor solution "makes
sense" (i.e. is substantively interpretable).

HTH

Garry Gelade


-----Original Message-----
From: SPSSX(r) Discussion [mailto:[hidden email]] On Behalf Of
Mehul Pajwani
Sent: 17 August 2011 03:17
To: [hidden email]
Subject: basic que-Factor Analysis

Hi,

I am trying to run (for the first time) factor analysis in SPSS.18 and
would appreciate any feedback and recommendations from seasoned SPSS
users and statisticians on the following

I started with 93 continuous variables (5-point, 6-point and 7-point
rating scale type questions) and trying to reach to a reasonable
factor solution. Here are key questions I am struggling with.

1) I started with 93 variables. Are these too many variables to start
with? Is there any guideline about (minimum and maximum) number of
variables?

2) Though all variables I am using are continuous (rating scale) they
are of different scale points and types (e.g. 5-point ratings where
1=Strongly Disagree and 5=Strongly Agree; 6-point scale where 1=Will
not influence at all and 6=Will strongly influence etc.).  I
understand that this will have very different variance and therefore
can possible affect the results. Having said that do I need to
standardize these variables or factors analysis will automatically
take care of it?

3) I selected an option for Rotated Component Matrix but it failed to
produce that with the message "Rotation failed to converge in 25
iterations. (Convergence = .001)." Surprisingly, I ran the factor
analysis with the same data couple days before (with 92 variables, I
guess) and it did produce Rotated Component Matrix.  Any thoughts on
why it is showing that error and how to handle this?

4) My ultimate goal is to identify broader dimensions based on factor
analysis and then use these composite variables for the segmentation
by doing cluster analysis. Is this the right approach for the
segmentation?

I would appreciate any help on this.

Regards,

Mehul

=====================
To manage your subscription to SPSSX-L, send a message to
[hidden email] (not to SPSSX-L), with no body text except the
command. To leave the list, send the command
SIGNOFF SPSSX-L
For a list of commands to manage subscriptions, send the command
INFO REFCARD

=====================
To manage your subscription to SPSSX-L, send a message to
[hidden email] (not to SPSSX-L), with no body text except the
command. To leave the list, send the command
SIGNOFF SPSSX-L
For a list of commands to manage subscriptions, send the command
INFO REFCARD
Reply | Threaded
Open this post in threaded view
|

Re: basic que-Factor Analysis

news
In reply to this post by Mehul Pajwani
Mehul,

You might look at

Anna B. Costello and Jason W. Osborne:
Best Practices in Exploratory Factor Analysis: Four Recommendations for
Getting the Most From Your Analysis
http://pareonline.net/pdf/v10n7.pdf

the tutorials of A.J.Schwab , U of Texas :
http://www.utexas.edu/courses/schwab/sw388r7/SolvingProblems/PrincipalComponentAnalysis.ppt
http://www.utexas.edu/courses/schwab/sw388r7/SolvingProblems/PrincipalComponentAnalysis_Outliers_Validation_Reliability.ppt

Jae-On Kim, Charles W. Mueller :
Factor Analysis. Statistical Methods and Practical Issues
Quantitative Applications in the Social Sciences
Volume 14
Sage

Jae-On Kim,  Charles W. Mueller:
Introduction to Factor Analysis. What It Is and How To Do It
Quantitative Applications in the Social Sciences
1978
to get some ideas of the conditions and limits.

HTH
Dr Frank Thomas
FTR Internet Research
Rosny-sous-Bois
France


On 17/08/2011 04:16, Mehul Pajwani wrote:

> Hi,
>
> I am trying to run (for the first time) factor analysis in SPSS.18 and
> would appreciate any feedback and recommendations from seasoned SPSS
> users and statisticians on the following
>
> I started with 93 continuous variables (5-point, 6-point and 7-point
> rating scale type questions) and trying to reach to a reasonable
> factor solution. Here are key questions I am struggling with.
>
> 1) I started with 93 variables. Are these too many variables to start
> with? Is there any guideline about (minimum and maximum) number of
> variables?
>
> 2) Though all variables I am using are continuous (rating scale) they
> are of different scale points and types (e.g. 5-point ratings where
> 1=Strongly Disagree and 5=Strongly Agree; 6-point scale where 1=Will
> not influence at all and 6=Will strongly influence etc.).  I
> understand that this will have very different variance and therefore
> can possible affect the results. Having said that do I need to
> standardize these variables or factors analysis will automatically
> take care of it?
>
> 3) I selected an option for Rotated Component Matrix but it failed to
> produce that with the message “Rotation failed to converge in 25
> iterations. (Convergence = .001).” Surprisingly, I ran the factor
> analysis with the same data couple days before (with 92 variables, I
> guess) and it did produce Rotated Component Matrix.  Any thoughts on
> why it is showing that error and how to handle this?
>
> 4) My ultimate goal is to identify broader dimensions based on factor
> analysis and then use these composite variables for the segmentation
> by doing cluster analysis. Is this the right approach for the
> segmentation?
>
> I would appreciate any help on this.
>
> Regards,
>
> Mehul
>
> =====================
> To manage your subscription to SPSSX-L, send a message to
> [hidden email] (not to SPSSX-L), with no body text except the
> command. To leave the list, send the command
> SIGNOFF SPSSX-L
> For a list of commands to manage subscriptions, send the command
> INFO REFCARD
>

=====================
To manage your subscription to SPSSX-L, send a message to
[hidden email] (not to SPSSX-L), with no body text except the
command. To leave the list, send the command
SIGNOFF SPSSX-L
For a list of commands to manage subscriptions, send the command
INFO REFCARD
Reply | Threaded
Open this post in threaded view
|

Re: basic que-Factor Analysis

Art Kendall
In reply to this post by Mehul Pajwani
Do you mean Q Factor analysis which is an old way of clustering cases?

Art Kendall
Social Research Consultants

On 8/16/2011 10:16 PM, Mehul Pajwani wrote:
Hi,

I am trying to run (for the first time) factor analysis in SPSS.18 and
would appreciate any feedback and recommendations from seasoned SPSS
users and statisticians on the following

I started with 93 continuous variables (5-point, 6-point and 7-point
rating scale type questions) and trying to reach to a reasonable
factor solution. Here are key questions I am struggling with.

1) I started with 93 variables. Are these too many variables to start
with? Is there any guideline about (minimum and maximum) number of
variables?

2) Though all variables I am using are continuous (rating scale) they
are of different scale points and types (e.g. 5-point ratings where
1=Strongly Disagree and 5=Strongly Agree; 6-point scale where 1=Will
not influence at all and 6=Will strongly influence etc.).  I
understand that this will have very different variance and therefore
can possible affect the results. Having said that do I need to
standardize these variables or factors analysis will automatically
take care of it?

3) I selected an option for Rotated Component Matrix but it failed to
produce that with the message “Rotation failed to converge in 25
iterations. (Convergence = .001).” Surprisingly, I ran the factor
analysis with the same data couple days before (with 92 variables, I
guess) and it did produce Rotated Component Matrix.  Any thoughts on
why it is showing that error and how to handle this?

4) My ultimate goal is to identify broader dimensions based on factor
analysis and then use these composite variables for the segmentation
by doing cluster analysis. Is this the right approach for the
segmentation?

I would appreciate any help on this.

Regards,

Mehul

=====================
To manage your subscription to SPSSX-L, send a message to
[hidden email] (not to SPSSX-L), with no body text except the
command. To leave the list, send the command
SIGNOFF SPSSX-L
For a list of commands to manage subscriptions, send the command
INFO REFCARD

===================== To manage your subscription to SPSSX-L, send a message to [hidden email] (not to SPSSX-L), with no body text except the command. To leave the list, send the command SIGNOFF SPSSX-L For a list of commands to manage subscriptions, send the command INFO REFCARD
Art Kendall
Social Research Consultants
Reply | Threaded
Open this post in threaded view
|

Re: basic que-Factor Analysis

Mehul Pajwani
by first three que, I am referring to R Factor Analysis (grouping variables) and once I am done with that, I intend to use those composite variables for cluster analysis.
 
Thanks,
 
Mehul

On Wed, Aug 17, 2011 at 11:22 AM, Art Kendall <[hidden email]> wrote:
Do you mean Q Factor analysis  which is an old way of clustering cases?

Art Kendall
Social Research Consultants

On 8/16/2011 10:16 PM, Mehul Pajwani wrote:
Hi,

I am trying to run (for the first time) factor analysis in SPSS.18 and
would appreciate any feedback and recommendations from seasoned SPSS
users and statisticians on the following

I started with 93 continuous variables (5-point, 6-point and 7-point
rating scale type questions) and trying to reach to a reasonable
factor solution. Here are key questions I am struggling with.

1) I started with 93 variables. Are these too many variables to start
with? Is there any guideline about (minimum and maximum) number of
variables?

2) Though all variables I am using are continuous (rating scale) they
are of different scale points and types (e.g. 5-point ratings where
1=Strongly Disagree and 5=Strongly Agree; 6-point scale where 1=Will
not influence at all and 6=Will strongly influence etc.).  I
understand that this will have very different variance and therefore
can possible affect the results. Having said that do I need to
standardize these variables or factors analysis will automatically
take care of it?

3) I selected an option for Rotated Component Matrix but it failed to
produce that with the message “Rotation failed to converge in 25
iterations. (Convergence = .001).” Surprisingly, I ran the factor
analysis with the same data couple days before (with 92 variables, I
guess) and it did produce Rotated Component Matrix.  Any thoughts on
why it is showing that error and how to handle this?

4) My ultimate goal is to identify broader dimensions based on factor
analysis and then use these composite variables for the segmentation
by doing cluster analysis. Is this the right approach for the
segmentation?

I would appreciate any help on this.

Regards,

Mehul

=====================
To manage your subscription to SPSSX-L, send a message to
[hidden email] (not to SPSSX-L), with no body text except the command. To leave the list, send the command SIGNOFF SPSSX-L For a list of commands to manage subscriptions, send the command INFO REFCARD

Reply | Threaded
Open this post in threaded view
|

LOOKING FOR FREELANCE ANALYSTS

Lorin Drake

We are looking for experienced freelance quantitative research analysts to write quantitative reports for an upcoming health care study.

Must have your own copy of SPSS, PowerPoint, and Excel. Health care experience a plus. Experience with correlation, factor/regression helpful.

Timeframe is Sept/Oct. For more information, please send resume to [hidden email]

 

 

Reply | Threaded
Open this post in threaded view
|

Re: basic que-Factor Analysis

Rich Ulrich
In reply to this post by Mehul Pajwani
"Failure to converge in 25 interations"  is not fatal.

It might be an indication that you need 50 or 100 iterations,
when two competing structures seem to  fit. 

Of course, it could also indicate that var 93 had a lot of
missing (as someone suggested), so that the analysis is a lot
less stable than the analysis that worked, using 92 vars.

--
Rich Ulrich

Reply | Threaded
Open this post in threaded view
|

Re: basic que-Factor Analysis

Mehul Pajwani
Makes sense!  Thanks very much to Frank, Farry and Rich!
 
 
Finally, i ran the factor analys and (Hope, I haven't missed anything) here is what I did.  I would appreciate if you can take a quick look and let me know if I have missed anything or could have done this in a better way.  I can send you my output in Excel or SPSS file, if required.
 
 
Iteration-1 (98 variables)
- Ran factor analysis with 98 items. Prior to performing PCA, the sutability of data for factor analysis was assessed. Inspection of r-matrix revealed the presence of many coefficients of 0.3 and above. KMO value was 0.961 and the Bartlett's Test of Sphericity as significant.
- Variables were validated based on communalities and 19 variables with loadings of less than 0.55 were dropped. 2nd Iteration was ran with 79 variables (98-19=79)
 
Iteration-2 (79 variables)
-Communalities- All 79 variables have communilities more than 0.50
-Measures of Sampling Adequacy (KMO- on the diagonal of the anti-image correlation matrix) was checked. KMO for all 79 variables were greater than 0.50.
-After removing variables with low communalities and checking KMOs, the factor solution was examined to remove any components that have only a single variable loading on them.  Component 13 was removed since it had only one variable. Componont 14 has no variables at all.
 
Iteration-3  (78 variables)
-Communalities- All 78 variables have communilities more than 0.50
- The measures of sampling adequacy (KMO) for all the variables are greater than 0.50. (actually, greater than 0.7, which is good)
- All 13 componants have 2 to 24 variables
 
Final Solution:
13 compononts with variables loadings as below. I got 24 variables for Componant-1 and 15 for compononant-2. Are these too many?
Componant  Number of Variables
1   -24
2   -15
3   -6
4   -7
5   -5
6   -6
7   -3
8   -2
9   -2
10  - 2
11  - 2
12  - 2
13  - 2
Total-    78
 
 


 
On Thu, Aug 18, 2011 at 6:30 PM, Rich Ulrich <[hidden email]> wrote:
"Failure to converge in 25 interations"  is not fatal.

It might be an indication that you need 50 or 100 iterations,
when two competing structures seem to  fit. 

Of course, it could also indicate that var 93 had a lot of
missing (as someone suggested), so that the analysis is a lot
less stable than the analysis that worked, using 92 vars.

--
Rich Ulrich


Reply | Threaded
Open this post in threaded view
|

Re: basic que-Factor Analysis

Art Kendall
What are you using the factor analysis for? (to create a summative score?)  Is this a pre-existing instrument or did you create it? If you are creating summative scales did you remove "splitters" from the final key?


What did you use for a stopping rule? 

Do the factors make sense?


try running parallel analysis.  With moderate Ns (several hundred to a few thousand), I have usually retained the number of factors where the obtained eigenvalue is one more than what would come from random data or random permutations of the same data.
http://flash.lakeheadu.ca/~boconno2/nfactors.html



Art Kendall
Social Research Consultants

On 8/19/2011 5:20 PM, Mehul Pajwani wrote:
Makes sense!  Thanks very much to Frank, Farry and Rich!
 
 
Finally, i ran the factor analys and (Hope, I haven't missed anything) here is what I did.  I would appreciate if you can take a quick look and let me know if I have missed anything or could have done this in a better way.  I can send you my output in Excel or SPSS file, if required.
 
 
Iteration-1 (98 variables)
- Ran factor analysis with 98 items. Prior to performing PCA, the sutability of data for factor analysis was assessed. Inspection of r-matrix revealed the presence of many coefficients of 0.3 and above. KMO value was 0.961 and the Bartlett's Test of Sphericity as significant.
- Variables were validated based on communalities and 19 variables with loadings of less than 0.55 were dropped. 2nd Iteration was ran with 79 variables (98-19=79)
 
Iteration-2 (79 variables)
-Communalities- All 79 variables have communilities more than 0.50
-Measures of Sampling Adequacy (KMO- on the diagonal of the anti-image correlation matrix) was checked. KMO for all 79 variables were greater than 0.50.
-After removing variables with low communalities and checking KMOs, the factor solution was examined to remove any components that have only a single variable loading on them.  Component 13 was removed since it had only one variable. Componont 14 has no variables at all.
 
Iteration-3  (78 variables)
-Communalities- All 78 variables have communilities more than 0.50
- The measures of sampling adequacy (KMO) for all the variables are greater than 0.50. (actually, greater than 0.7, which is good)
- All 13 componants have 2 to 24 variables
 
Final Solution:
13 compononts with variables loadings as below. I got 24 variables for Componant-1 and 15 for compononant-2. Are these too many?
Componant  Number of Variables
1   -24
2   -15
3   -6
4   -7
5   -5
6   -6
7   -3
8   -2
9   -2
10  - 2
11  - 2
12  - 2
13  - 2
Total-    78
 
 


 
On Thu, Aug 18, 2011 at 6:30 PM, Rich Ulrich <[hidden email]> wrote:
"Failure to converge in 25 interations"  is not fatal.

It might be an indication that you need 50 or 100 iterations,
when two competing structures seem to  fit. 

Of course, it could also indicate that var 93 had a lot of
missing (as someone suggested), so that the analysis is a lot
less stable than the analysis that worked, using 92 vars.

--
Rich Ulrich


===================== To manage your subscription to SPSSX-L, send a message to [hidden email] (not to SPSSX-L), with no body text except the command. To leave the list, send the command SIGNOFF SPSSX-L For a list of commands to manage subscriptions, send the command INFO REFCARD
Art Kendall
Social Research Consultants
Reply | Threaded
Open this post in threaded view
|

Re: basic que-Factor Analysis

Mehul Pajwani
The ultimate objective is to segment the respondents based on their perception about current and future economy and how they are/will be dealing with the changes in the economy.

I intend to define the broad dimensions based on the factor analysis and then to use the same for cluster analysis.

Yes, factors do make sense and are seem to be meaningful.

BTW, all these 14 factors have eigenvalues more than 1. Would parallel analysis help me reduce the number of variables loading under a given factor? Would it be a good idea to use the current factor solution?

Please let me know your thoughts on this

Thanks,

Mehul

On Sat, Aug 20, 2011 at 7:00 AM, Art Kendall <[hidden email]> wrote:
What are you using the factor analysis for? (to create a summative score?)  Is this a pre-existing instrument or did you create it? If you are creating summative scales did you remove "splitters" from the final key?


What did you use for a stopping rule? 

Do the factors make sense?


try running parallel analysis.  With moderate Ns (several hundred to a few thousand), I have usually retained the number of factors where the obtained eigenvalue is one more than what would come from random data or random permutations of the same data.
http://flash.lakeheadu.ca/~boconno2/nfactors.html



Art Kendall
Social Research Consultants

On 8/19/2011 5:20 PM, Mehul Pajwani wrote:
Makes sense!  Thanks very much to Frank, Farry and Rich!
 
 
Finally, i ran the factor analys and (Hope, I haven't missed anything) here is what I did.  I would appreciate if you can take a quick look and let me know if I have missed anything or could have done this in a better way.  I can send you my output in Excel or SPSS file, if required.
 
 
Iteration-1 (98 variables)
- Ran factor analysis with 98 items. Prior to performing PCA, the sutability of data for factor analysis was assessed. Inspection of r-matrix revealed the presence of many coefficients of 0.3 and above. KMO value was 0.961 and the Bartlett's Test of Sphericity as significant.
- Variables were validated based on communalities and 19 variables with loadings of less than 0.55 were dropped. 2nd Iteration was ran with 79 variables (98-19=79)
 
Iteration-2 (79 variables)
-Communalities- All 79 variables have communilities more than 0.50
-Measures of Sampling Adequacy (KMO- on the diagonal of the anti-image correlation matrix) was checked. KMO for all 79 variables were greater than 0.50.
-After removing variables with low communalities and checking KMOs, the factor solution was examined to remove any components that have only a single variable loading on them.  Component 13 was removed since it had only one variable. Componont 14 has no variables at all.
 
Iteration-3  (78 variables)
-Communalities- All 78 variables have communilities more than 0.50
- The measures of sampling adequacy (KMO) for all the variables are greater than 0.50. (actually, greater than 0.7, which is good)
- All 13 componants have 2 to 24 variables
 
Final Solution:
13 compononts with variables loadings as below. I got 24 variables for Componant-1 and 15 for compononant-2. Are these too many?
Componant  Number of Variables
1   -24
2   -15
3   -6
4   -7
5   -5
6   -6
7   -3
8   -2
9   -2
10  - 2
11  - 2
12  - 2
13  - 2
Total-    78
 
 


 
On Thu, Aug 18, 2011 at 6:30 PM, Rich Ulrich <[hidden email]> wrote:
"Failure to converge in 25 interations"  is not fatal.

It might be an indication that you need 50 or 100 iterations,
when two competing structures seem to  fit. 

Of course, it could also indicate that var 93 had a lot of
missing (as someone suggested), so that the analysis is a lot
less stable than the analysis that worked, using 92 vars.

--
Rich Ulrich



Reply | Threaded
Open this post in threaded view
|

Re: basic que-Factor Analysis

Art Kendall
It would not be a good idea to use the current factor solution.

The Kaiser criterion is used to limit the number of factors to extract.  It is based on the idea that one would not be interested in a new dimension that accounted for on one "variables worth" of the variance. It merely avoids inefficient use of computer time.

The parallel analysis test and scree test will give you a ball park number of factors to retain.  The final number of retained factors is based on substantive interpretation of the constructs that might underlie set of clean loading items.  You need to look at the interpretability of the overall factor solution and of each candidate factor.


Art Kendall
Social Research Consultants

On 8/21/2011 12:40 AM, Mehul Pajwani wrote:
The ultimate objective is to segment the respondents based on their perception about current and future economy and how they are/will be dealing with the changes in the economy.

I intend to define the broad dimensions based on the factor analysis and then to use the same for cluster analysis.

Yes, factors do make sense and are seem to be meaningful.

BTW, all these 14 factors have eigenvalues more than 1. Would parallel analysis help me reduce the number of variables loading under a given factor? Would it be a good idea to use the current factor solution?

Please let me know your thoughts on this

Thanks,

Mehul

On Sat, Aug 20, 2011 at 7:00 AM, Art Kendall <[hidden email]> wrote:
What are you using the factor analysis for? (to create a summative score?)  Is this a pre-existing instrument or did you create it? If you are creating summative scales did you remove "splitters" from the final key?


What did you use for a stopping rule? 

Do the factors make sense?


try running parallel analysis.  With moderate Ns (several hundred to a few thousand), I have usually retained the number of factors where the obtained eigenvalue is one more than what would come from random data or random permutations of the same data.
http://flash.lakeheadu.ca/~boconno2/nfactors.html



Art Kendall
Social Research Consultants

On 8/19/2011 5:20 PM, Mehul Pajwani wrote:
Makes sense!  Thanks very much to Frank, Farry and Rich!
 
 
Finally, i ran the factor analys and (Hope, I haven't missed anything) here is what I did.  I would appreciate if you can take a quick look and let me know if I have missed anything or could have done this in a better way.  I can send you my output in Excel or SPSS file, if required.
 
 
Iteration-1 (98 variables)
- Ran factor analysis with 98 items. Prior to performing PCA, the sutability of data for factor analysis was assessed. Inspection of r-matrix revealed the presence of many coefficients of 0.3 and above. KMO value was 0.961 and the Bartlett's Test of Sphericity as significant.
- Variables were validated based on communalities and 19 variables with loadings of less than 0.55 were dropped. 2nd Iteration was ran with 79 variables (98-19=79)
 
Iteration-2 (79 variables)
-Communalities- All 79 variables have communilities more than 0.50
-Measures of Sampling Adequacy (KMO- on the diagonal of the anti-image correlation matrix) was checked. KMO for all 79 variables were greater than 0.50.
-After removing variables with low communalities and checking KMOs, the factor solution was examined to remove any components that have only a single variable loading on them.  Component 13 was removed since it had only one variable. Componont 14 has no variables at all.
 
Iteration-3  (78 variables)
-Communalities- All 78 variables have communilities more than 0.50
- The measures of sampling adequacy (KMO) for all the variables are greater than 0.50. (actually, greater than 0.7, which is good)
- All 13 componants have 2 to 24 variables
 
Final Solution:
13 compononts with variables loadings as below. I got 24 variables for Componant-1 and 15 for compononant-2. Are these too many?
Componant  Number of Variables
1   -24
2   -15
3   -6
4   -7
5   -5
6   -6
7   -3
8   -2
9   -2
10  - 2
11  - 2
12  - 2
13  - 2
Total-    78
 
 


 
On Thu, Aug 18, 2011 at 6:30 PM, Rich Ulrich <[hidden email]> wrote:
"Failure to converge in 25 interations"  is not fatal.

It might be an indication that you need 50 or 100 iterations,
when two competing structures seem to  fit. 

Of course, it could also indicate that var 93 had a lot of
missing (as someone suggested), so that the analysis is a lot
less stable than the analysis that worked, using 92 vars.

--
Rich Ulrich



===================== To manage your subscription to SPSSX-L, send a message to [hidden email] (not to SPSSX-L), with no body text except the command. To leave the list, send the command SIGNOFF SPSSX-L For a list of commands to manage subscriptions, send the command INFO REFCARD
Art Kendall
Social Research Consultants
Reply | Threaded
Open this post in threaded view
|

Re: basic que-Factor Analysis

Mehul Pajwani
Thanks for your advise Art!

May I ask you the main reason why it is not a good idea to use the current factor solution?  I mean, is it because of too many variables on first two factors?

I also looked at the scree plot and it tells to retain 4-5 factors 

Would you like to see the scree plot and/or Rotated Component Matrix?   I would appreciate your help on this.

thanks,

Mehul

On Tue, Aug 23, 2011 at 4:26 PM, Art Kendall <[hidden email]> wrote:
It would not be a good idea to use the current factor solution.

The Kaiser criterion is used to limit the number of factors to extract.  It is based on the idea that one would not be interested in a new dimension that accounted for on one "variables worth" of the variance. It merely avoids inefficient use of computer time.

The parallel analysis test and scree test will give you a ball park number of factors to retain.  The final number of retained factors is based on substantive interpretation of the constructs that might underlie set of clean loading items.  You need to look at the interpretability of the overall factor solution and of each candidate factor.



Art Kendall
Social Research Consultants

On 8/21/2011 12:40 AM, Mehul Pajwani wrote:
The ultimate objective is to segment the respondents based on their perception about current and future economy and how they are/will be dealing with the changes in the economy.

I intend to define the broad dimensions based on the factor analysis and then to use the same for cluster analysis.

Yes, factors do make sense and are seem to be meaningful.

BTW, all these 14 factors have eigenvalues more than 1. Would parallel analysis help me reduce the number of variables loading under a given factor? Would it be a good idea to use the current factor solution?

Please let me know your thoughts on this

Thanks,

Mehul

On Sat, Aug 20, 2011 at 7:00 AM, Art Kendall <[hidden email]> wrote:
What are you using the factor analysis for? (to create a summative score?)  Is this a pre-existing instrument or did you create it? If you are creating summative scales did you remove "splitters" from the final key?


What did you use for a stopping rule? 

Do the factors make sense?


try running parallel analysis.  With moderate Ns (several hundred to a few thousand), I have usually retained the number of factors where the obtained eigenvalue is one more than what would come from random data or random permutations of the same data.
http://flash.lakeheadu.ca/~boconno2/nfactors.html



Art Kendall
Social Research Consultants

On 8/19/2011 5:20 PM, Mehul Pajwani wrote:
Makes sense!  Thanks very much to Frank, Farry and Rich!
 
 
Finally, i ran the factor analys and (Hope, I haven't missed anything) here is what I did.  I would appreciate if you can take a quick look and let me know if I have missed anything or could have done this in a better way.  I can send you my output in Excel or SPSS file, if required.
 
 
Iteration-1 (98 variables)
- Ran factor analysis with 98 items. Prior to performing PCA, the sutability of data for factor analysis was assessed. Inspection of r-matrix revealed the presence of many coefficients of 0.3 and above. KMO value was 0.961 and the Bartlett's Test of Sphericity as significant.
- Variables were validated based on communalities and 19 variables with loadings of less than 0.55 were dropped. 2nd Iteration was ran with 79 variables (98-19=79)
 
Iteration-2 (79 variables)
-Communalities- All 79 variables have communilities more than 0.50
-Measures of Sampling Adequacy (KMO- on the diagonal of the anti-image correlation matrix) was checked. KMO for all 79 variables were greater than 0.50.
-After removing variables with low communalities and checking KMOs, the factor solution was examined to remove any components that have only a single variable loading on them.  Component 13 was removed since it had only one variable. Componont 14 has no variables at all.
 
Iteration-3  (78 variables)
-Communalities- All 78 variables have communilities more than 0.50
- The measures of sampling adequacy (KMO) for all the variables are greater than 0.50. (actually, greater than 0.7, which is good)
- All 13 componants have 2 to 24 variables
 
Final Solution:
13 compononts with variables loadings as below. I got 24 variables for Componant-1 and 15 for compononant-2. Are these too many?
Componant  Number of Variables
1   -24
2   -15
3   -6
4   -7
5   -5
6   -6
7   -3
8   -2
9   -2
10  - 2
11  - 2
12  - 2
13  - 2
Total-    78
 
 


 
On Thu, Aug 18, 2011 at 6:30 PM, Rich Ulrich <[hidden email]> wrote:
"Failure to converge in 25 interations"  is not fatal.

It might be an indication that you need 50 or 100 iterations,
when two competing structures seem to  fit. 

Of course, it could also indicate that var 93 had a lot of
missing (as someone suggested), so that the analysis is a lot
less stable than the analysis that worked, using 92 vars.

--
Rich Ulrich