SPSSX Discussion

News from the SPSS Community

Classic

List

Threaded

5 messages Options

Jon K Peck

News from the SPSS Community

Two new extension commands have been posted on the SPSS Community site, and one other has an update.

STATS COXREGR performs Cox regression. It adds some features not available in the built-in COXREG command - principally interval-based censoring using a counting process. It requires the R Essentials.

STATS VALLBLS FROMDATA creates value labels for a set of variables based on values of other variables in the dataset. Variables can be specified by name or via patterns in the names. Conflicts in the labels and duplicate labels are reported. It requires the Python Essentials.

Both extensions include a dialog box interface and traditional SPSS syntax.

STATS ADJUST WIDTHS, which synchronizes string variable sizes across datasets to facilitate merging, has a bugfix for the VARIABLES=ALL case.

The Python Essentials can be installed via the SPSS Community website (www.ibm.com/developerworks/spssdevcentral), or, for V21, via the media for that version. For V22, the Python Essentials are integrated into the main Statistics install.

The R Essentials can be installed via the website.

For Statistics 22, new extensions and updates can be installed from the Utilities > Extension Bundles > Download and Install Extension Bundles menu. For older versions of Statistics, these items need to be downloaded from the website and installed.

Reminder: If you are on Windows 7 or similar, extensions usually need to be installed using Run As Administrator.

We hope that you find these useful.

Jon Peck (no "h") aka Kim
Senior Software Engineer, IBM
[hidden email]
phone: 720-342-5621

Albert-Jan Roskam

Re: News from the SPSS Community

<snip>
>
>STATS VALLBLS FROMDATA creates value
labels for a set of variables based on values of other variables in the
dataset. Variables can be specified by name or via patterns in the
names. Conflicts in the labels and duplicate labels are reported.
It requires the Python Essentials.

Very useful (in fact I also created something like this). What happens if the values are not unique?
e.g.
1 males
1 gents
2 females

Will it (a) silently use "gents" (b) throw some exception?

=====================
To manage your subscription to SPSSX-L, send a message to
[hidden email] (not to SPSSX-L), with no body text except the
command. To leave the list, send the command
SIGNOFF SPSSX-L
For a list of commands to manage subscriptions, send the command
INFO REFCARD

Jon K Peck

Re: News from the SPSS Community

The command produces a table showing the number of conflicts such as your example and the number of duplicate labels for each variable
Duplicates would be, for example,
1 males
2 males
3 females

The conflicts rule is that the first assignment for a variable wins.

Jon Peck (no "h") aka Kim
Senior Software Engineer, IBM
[hidden email]
phone: 720-342-5621

From: Albert-Jan Roskam <[hidden email]>
To: [hidden email],
Date: 01/21/2014 01:48 AM
Subject: Re: [SPSSX-L] News from the SPSS Community
Sent by: "SPSSX(r) Discussion" <[hidden email]>

<snip> > >STATS VALLBLS FROMDATA creates value labels for a set of variables based on values of other variables in the dataset. Variables can be specified by name or via patterns in the names. Conflicts in the labels and duplicate labels are reported. It requires the Python Essentials. Very useful (in fact I also created something like this). What happens if the values are not unique? e.g. 1 males 1 gents 2 females Will it (a) silently use "gents" (b) throw some exception? ===================== To manage your subscription to SPSSX-L, send a message to [hidden email] (not to SPSSX-L), with no body text except the command. To leave the list, send the command SIGNOFF SPSSX-L For a list of commands to manage subscriptions, send the command INFO REFCARD

Richard Ristow

Re: News from the SPSS Community

In reply to this post by Jon K Peck

At 05:41 PM 1/20/2014, Jon K Peck wrote:

>STATS VALLBLS FROMDATA creates value labels for a set of variables
>based on values of other variables in the dataset. Variables can be
>specified by name or via patterns in the names. Conflicts in the
>labels and duplicate labels are reported. It requires the Python Essentials.

This will be useful; but may I remark that the effect can also be
obtained rather neatly in native SPSS? I posted a solution on the
16th (*). It does not report duplicate or conflicting labels, but
those checks could be made by simple syntax running against dataset
"Summary", which the code creates.

Now, "STATS VALLBLS FROMDATA" is production-grade code, with
well-formatted output and extensive error checking; the code I posted
definitely is not. But while production-grade utilities are great, I
think there's also a point to posting simpler, more readable
solutions. (I submit that the native-SPSS solution is far more
readable than is the code for "STATS VALLBLS FROMDATA". I poured over
its Python for a long time just to find out how its AGGREGATEs are
written.) We don't just want to solve posters' immediate problems; we
want to present techniques that they (and other readers) may adapt
and apply later.

(*) See posting
Date: Thu, 16 Jan 2014 12:47:03 -0500
From: Richard Ristow <[hidden email]>
Subject: Re: value labels from string vars
To: [hidden email]

=====================
To manage your subscription to SPSSX-L, send a message to
[hidden email] (not to SPSSX-L), with no body text except the
command. To leave the list, send the command
SIGNOFF SPSSX-L
For a list of commands to manage subscriptions, send the command
INFO REFCARD

Jon K Peck

Re: News from the SPSS Community

Richard,

I don't get it. The extension command - which no one has to use - makes the intent of the code clear, produces useful summary output, works on both string and numeric target variables, deals with special situations like embedded quotes, handles missing data and blank labels properly, and allows easy selection of variable sets with regular expressions. What's not to like? Lots of things that can be done easily with extensions can be done with more difficulty, fragility, or less generality with native code, but I don't see that as a reason not to produce and use extensions. A job that contains VALLBLS FROMDATA or similar extensions will be lots more maintainable than one with complex native code and will not fail mysteriously when unhandled situations such as a label with a quote happen to come along.

To each his own.

Jon Peck (no "h") aka Kim
Senior Software Engineer, IBM
[hidden email]
phone: 720-342-5621

From: Richard Ristow <[hidden email]>
To: [hidden email],
Date: 01/22/2014 11:14 PM
Subject: Re: [SPSSX-L] News from the SPSS Community
Sent by: "SPSSX(r) Discussion" <[hidden email]>

At 05:41 PM 1/20/2014, Jon K Peck wrote: >STATS VALLBLS FROMDATA creates value labels for a set of variables >based on values of other variables in the dataset. Variables can be >specified by name or via patterns in the names. Conflicts in the >labels and duplicate labels are reported. It requires the Python Essentials. This will be useful; but may I remark that the effect can also be obtained rather neatly in native SPSS? I posted a solution on the 16th (*). It does not report duplicate or conflicting labels, but those checks could be made by simple syntax running against dataset "Summary", which the code creates. Now, "STATS VALLBLS FROMDATA" is production-grade code, with well-formatted output and extensive error checking; the code I posted definitely is not. But while production-grade utilities are great, I think there's also a point to posting simpler, more readable solutions. (I submit that the native-SPSS solution is far more readable than is the code for "STATS VALLBLS FROMDATA". I poured over its Python for a long time just to find out how its AGGREGATEs are written.) We don't just want to solve posters' immediate problems; we want to present techniques that they (and other readers) may adapt and apply later. (*) See posting Date: Thu, 16 Jan 2014 12:47:03 -0500 From: Richard Ristow <[hidden email]> Subject: Re: value labels from string vars To: [hidden email] ===================== To manage your subscription to SPSSX-L, send a message to [hidden email] (not to SPSSX-L), with no body text except the command. To leave the list, send the command SIGNOFF SPSSX-L For a list of commands to manage subscriptions, send the command INFO REFCARD