Does anyone have experience reformatting an SPSS file for optimal usage in Tableau?
I’m currently struggling with the best ways to turn my variables into cases that doesn’t result in double counting of key concepts. My original file has one case for each entity. In the new file there are multiple cases per entity.
I have a series of binary indicators that, when resolved to cases, inflate numbers.
Example data of variables and cases Names Major Pool Minor Pool Sales Total sales Contact type 1 Contact Type 2 Contact type 3 Result 1 George Major Pool Minor Pool A 250 750 yes no No positive George Major Pool Minor Pool B 500 750 yes no No positive Michael Major Pool Minor Pool C 250 275 no no No negative Michael Major Pool Minor Pool D 25 275 no no No negative Joan Major Pool Minor Pool A 74 165 no no No negative Joan Major Pool Minor Pool B 91 165 no no No negative Clint Major Pool Minor Pool C 100 600 no no No negative Clint Major Pool Minor Pool D 500 600 no no No negative Kevin Major Pool Minor Pool A 250 750 no no No negative Kevin Major Pool Minor Pool B 500 750 no no No negative Jason Major Pool Minor Pool C 250 349 no yes No positive Jason Major Pool Minor Pool D 25 349 no yes No positive Jason Major Pool Minor Pool A 74 349 no yes No positive Anne Major Pool Minor Pool B 91 191 no no No negative Anne Major Pool Minor Pool C 100 191 no no No negative Marnie Major Pool Minor Pool D 500 750 no no No negative Marnie Major Pool Minor Pool A 250 750 no no No negative Natalie Major Pool Minor Pool B 500 775 no no No negative Natalie Major Pool Minor Pool C 250 775 no no No negative Natalie Major Pool Minor Pool D 25 775 no no No negative Here if I try to report on the number of people in the Major Pool who had a contact 1 type of “yes” I’m getting two (because there are two rows of data) not my expected result of one, because it’s only one unique person.
Right now I am working with a file of ~500k cases and 250 variables.
Thank you for any guidance or help you can provide Natalie |
Administrator
|
Maybe you need to explore aggregating the file?
Please reply to the list and not to my personal email.
Those desiring my consulting or training services please feel free to email me. --- "Nolite dare sanctum canibus neque mittatis margaritas vestras ante porcos ne forte conculcent eas pedibus suis." Cum es damnatorum possederunt porcos iens ut salire off sanguinum cliff in abyssum?" |
I don't think I follow.
Aggregate the variables or the cases? The original SPSS file has one case per entity and many indicator variables (500k cases, 250ish variables). However, because I have 16 groups (currently binary indicators) to which any entity can belong (and they could belong to any combination of any groups) the presentation in Tableau becomes unwieldy. I can run the necessary data in SPSS, but am looking to use Tableau for reporting because of the ease of uniformly reporting on characteristics of all 16 subgroups. Another emailed me off list suggesting casetovars. The syntax within SPSS isn't the problem, it's the way to alter the layout/construction of the data file that is currently in SPSS (And works...) in such a way that Tableau can properly use it. If anyone else is currently using a combination of SPSS and Tableau it would be most helpful to chat. Natalie -----Original Message----- From: SPSSX(r) Discussion [mailto:[hidden email]] On Behalf Of David Marso Sent: Friday, June 01, 2012 10:21 AM To: [hidden email] Subject: Re: SPSS and Tableau Maybe you need to explore aggregating the file? -- View this message in context: http://spssx-discussion.1045642.n5.nabble.com/SPSS-and-Tableau-tp5713466p5713467.html Sent from the SPSSX Discussion mailing list archive at Nabble.com. ===================== To manage your subscription to SPSSX-L, send a message to [hidden email] (not to SPSSX-L), with no body text except the command. To leave the list, send the command SIGNOFF SPSSX-L For a list of commands to manage subscriptions, send the command INFO REFCARD ===================== To manage your subscription to SPSSX-L, send a message to [hidden email] (not to SPSSX-L), with no body text except the command. To leave the list, send the command SIGNOFF SPSSX-L For a list of commands to manage subscriptions, send the command INFO REFCARD |
Natalie,
If you don't get anything from spss+tableau users, post back and specify how the data file has to be organized (give an example if possible) for tableau and the save file type range (e.g., xls, sav, txt, etc). Gene Maguin -----Original Message----- From: SPSSX(r) Discussion [mailto:[hidden email]] On Behalf Of Spring, Natalie Sent: Friday, June 01, 2012 1:24 PM To: [hidden email] Subject: Re: SPSS and Tableau I don't think I follow. Aggregate the variables or the cases? The original SPSS file has one case per entity and many indicator variables (500k cases, 250ish variables). However, because I have 16 groups (currently binary indicators) to which any entity can belong (and they could belong to any combination of any groups) the presentation in Tableau becomes unwieldy. I can run the necessary data in SPSS, but am looking to use Tableau for reporting because of the ease of uniformly reporting on characteristics of all 16 subgroups. Another emailed me off list suggesting casetovars. The syntax within SPSS isn't the problem, it's the way to alter the layout/construction of the data file that is currently in SPSS (And works...) in such a way that Tableau can properly use it. If anyone else is currently using a combination of SPSS and Tableau it would be most helpful to chat. Natalie -----Original Message----- From: SPSSX(r) Discussion [mailto:[hidden email]] On Behalf Of David Marso Sent: Friday, June 01, 2012 10:21 AM To: [hidden email] Subject: Re: SPSS and Tableau Maybe you need to explore aggregating the file? -- View this message in context: http://spssx-discussion.1045642.n5.nabble.com/SPSS-and-Tableau-tp5713466p5713467.html Sent from the SPSSX Discussion mailing list archive at Nabble.com. ===================== To manage your subscription to SPSSX-L, send a message to [hidden email] (not to SPSSX-L), with no body text except the command. To leave the list, send the command SIGNOFF SPSSX-L For a list of commands to manage subscriptions, send the command INFO REFCARD ===================== To manage your subscription to SPSSX-L, send a message to [hidden email] (not to SPSSX-L), with no body text except the command. To leave the list, send the command SIGNOFF SPSSX-L For a list of commands to manage subscriptions, send the command INFO REFCARD ===================== To manage your subscription to SPSSX-L, send a message to [hidden email] (not to SPSSX-L), with no body text except the command. To leave the list, send the command SIGNOFF SPSSX-L For a list of commands to manage subscriptions, send the command INFO REFCARD |
Administrator
|
In reply to this post by Spring, Natalie
If Tableau supports weighting then you could aggregate to get the counts by person and weight by the reciprocal of the count. Alternatively use the sum of the reciprocals?
say n=2 1/2*2=1 ...
Please reply to the list and not to my personal email.
Those desiring my consulting or training services please feel free to email me. --- "Nolite dare sanctum canibus neque mittatis margaritas vestras ante porcos ne forte conculcent eas pedibus suis." Cum es damnatorum possederunt porcos iens ut salire off sanguinum cliff in abyssum?" |
Free forum by Nabble | Edit this page |