SPSS-Cluster Analysis-Query

classic Classic list List threaded Threaded
4 messages Options
Reply | Threaded
Open this post in threaded view
|

SPSS-Cluster Analysis-Query

VIKRAMPATIL
Hi,

We are having data of 1,20,000 records

We need to run cluster analysis with 3 variables

Now for deciding the number of cluster when I am running the Hierarchical cluster analysis ,getting the msg. iinsufficient memory

Can you please let us know any solution 'How can we decide the number of clusters for 1,20,000 recirds and 3 variables

Regards,
Vikram

Reply | Threaded
Open this post in threaded view
|

Re: SPSS-Cluster Analysis-Query

Poling, Taylor Leigh
Try the Two-Step cluster algorithm in SPSS - I'm pretty sure it was designed specifically to handle large datasets.
 
Taylor


From: SPSSX(r) Discussion on behalf of VIKRAMPATIL
Sent: Tue 7/21/2009 10:24 AM
To: [hidden email]
Subject: SPSS-Cluster Analysis-Query

Hi,

We are having data of 1,20,000 records

We need to run cluster analysis with 3 variables

Now for deciding the number of cluster when I am running the Hierarchical cluster analysis ,getting the msg. iinsufficient memory

Can you please let us know any solution 'How can we decide the number of clusters for 1,20,000 recirds and 3 variables

Regards,
Vikram

Reply | Threaded
Open this post in threaded view
|

Re: SPSS-Cluster Analysis-Query

Oliver, Richard

Yes, Twostep Cluster is designed to work with large datasets. By default, it will also automatically determine the number of clusters.

 


From: SPSSX(r) Discussion [mailto:[hidden email]] On Behalf Of Poling, Taylor Leigh
Sent: Tuesday, July 21, 2009 11:55 AM
To: [hidden email]
Subject: Re: SPSS-Cluster Analysis-Query

 

Try the Two-Step cluster algorithm in SPSS - I'm pretty sure it was designed specifically to handle large datasets.

 

Taylor

 


From: SPSSX(r) Discussion on behalf of VIKRAMPATIL
Sent: Tue 7/21/2009 10:24 AM
To: [hidden email]
Subject: SPSS-Cluster Analysis-Query

Hi,

We are having data of 1,20,000 records

We need to run cluster analysis with 3 variables

Now for deciding the number of cluster when I am running the Hierarchical cluster analysis ,getting the msg. iinsufficient memory

Can you please let us know any solution 'How can we decide the number of clusters for 1,20,000 recirds and 3 variables

Regards,
Vikram

 

Reply | Threaded
Open this post in threaded view
|

Re: SPSS-Cluster Analysis-Query

Guerrero, Rodrigo
In reply to this post by VIKRAMPATIL

Hello all,

 

I have two statistical questions about Logistic Regression, although it is not necessarily SPSS related.  I am running a model to predict response of our direct mail campaign and have both ratio as well as categorical independent variables.  Some of these categorical variables have missing data for cases. In some variables, more than half the data is missing.

 

1)       What do you think the implications are of treating missing as a valid value and including those records in the equation?

 

2)      I am concerned about over fitting the model.  How can I test the model to avoid this pitfall?

 

 

Thank you very much for your help.

 

 

Rodrigo

 

 

Rodrigo A. Guerrero | Director Of Marketing Research and Analysis | The Scooter Store | 830.627.4317

 

 


The information transmitted is intended only for the addressee(s) and may contain confidential or privileged material, or both. Any review, receipt, dissemination or other use of this information by non-addressees is prohibited. If you received this in error or are a non-addressee, please contact the sender and delete the transmitted information.